Psychological Testing and Assessment - Van der Molen - Chapters 5-9

Which theories about individual intelligence and performance tests are there? - Chapter 5
How are 'special populations' tested? - Chapter 7
Where do personality tests originate? - Chapter 8
How does the assessment of normality and human strengths work? - Chapter 9

Which theories about individual intelligence and performance tests are there? - Chapter 5

Topic 5A Theories of intelligence and factor analysis

Definitions of intelligence

Intelligence is one of the most researched topics in psychology.

Operational definitions define a concept in terms of the way the test is used. This way of defining is not useful for intelligence. Intelligence tests have been developed for measuring intelligence and not for defining it. In addition, operational definitions block the further development of the understanding of the nature of intelligence. A second problem is that the operational definition does not allow innovation. The old tests serve as a validity criterion for new tests. There must be a correlation between the old and new tests and in newly developed, innovative tests this correlation may weaken. According to the operational definition, they may therefore not be valid.

The real definition explains the true nature of the concept. To arrive at a true definition of intelligence, you have to ask experts. Several experts have come up with a definition. This list of definitions is Western and does not speak of cultural differences. The East thinks that benevolence, humility, freedom from conventional standards in assessment and doing what is right are essential for intelligence. In Africa, the social aspects of intelligence are emphasized. Despite all the differences, there are usually two things that everyone agrees on: intelligence is the capacity to learn from experiences and the capacity to adapt to the environment.

Another approach to understanding a construct is to study the popular meaning. Sternberg and colleagues did this and found out that the behaviors that are thought to belong to intelligence are quite similar to what experts think. The difference lies more in the order of importance.

When laymen think of intelligence, the following aspects are mentioned: practical problem-solving skills, verbal skills, and social competence. Experts mention the following aspects: verbal intelligence, problem-solving skills, and practical intelligence. This shows that public opinion on intelligence can be recognized in the intelligence tests.

The two parts that lay people and experts agree on are also not included in the tests. Firstly, because it is difficult to find items for these and secondly, because there has been little change in the design of the tests since the Binet time. Sternberg and colleagues have presented innovations that are based on their model of intelligence. Another interesting tool is the Everyday Problem Solving Inventory. In this test, participants should indicate how their response would be to everyday problems, such as forgetting your money when you go for lunch with a friend.

Factor analysis

Factor analysis is used to measure the validity of tests. There is confirmatory factor analysis and exploratory factor analysis. The first one checks whether test scores and variables fit in a predicted pattern and is therefore good for validation research. In the second, the relationships between many variables are summarized and it is examined whether they can be reduced to a lesser number of factors. It ensures that a simplified representation is given of a large, complex data set. Each collection of aptitude tests reflects a number of underlying basic skills.

A factor analysis starts with a correlation matrix, a table that shows correlations between different variables. Variables that have a high correlation with each other seem to explain a factor.

The smallest number of factors are determined by means of fast computers. Then a factor matrix can be made, which shows how heavily a variable weighs on a certain factor (values from -1.00 to +1.00).

The factor loads are then 'rotated', the best fitting variable is stopped at the best matching factors. In 'rotation to positive manifold', as many negative factor loads as possible are removed per factor. In 'rotation to simple structure', factor loadings are classified in such a way that each variable has a significant load with as few factors as possible. The aim is to make the factor matrix as clear and analysable as possible.

The interpretation of the factors that come out has to be done by the researcher himself. The researcher looks at what the variables of the factor have in common and then gives that interpretation. There are also disadvantages for factor analysis. Firstly, a certain factor can only come out of the analysis if the tests and measurements also contain that factor. Second, the sample size is important: around n=300 a factor analysis is stable and reliable. Finally, factor analysis is always subject to subjective choices and theoretical prejudices of the researcher.

Galton and sensory keenness

Galton thought that intelligence was explained by keen sensory abilities. The theory, however, had a dead end with regard to psychometry: it could hardly be tested. Later in the 1980s, however, reaction time tests were developed that measured the speed of processing. Even though these tests suggested that speed-of-processing measures could be a useful addition to standardized intelligence tests, these tests as well as their data analysis were not sufficiently standardized. That is why this method has not been developed much further.

Spearman and the g factor

Spearman stated that intelligence was based on a general intelligence factor g and several specific factors s1, s2, s3, etc. He focused primarily on defining the nature of g which he called the energy or power that generally appears in every context. The specific factor s represents a specific group of neurons that were used in certain mental actions. He found that individual differences in g reflected three principles of cognition: apprehension of experience, eduction of relations, and eduction of correlations. The word eduction can be rephrased as figuring things out. Even though Spearman’s speculations about physiology are for the most part dismissed, his notion of a general factor of intelligence remains relevant until today.

Thurstone and primary mental skills

Thurstone used factor analysis and concluded that not a single intelligence factor, but a group of factors, could explain empirical results. That brought him to his 7 primary mental skills: verbal comprehension, word fluency, number, space, associative memory, perceptual speed, inductive reasoning.

Later he argued that probably a general g factor was at the head and Spearman also stated that there are group factors that explain intelligence. Vernon felt that there was a hierarchy with the g factor at the top, then two group factors verbally and practically and then the 7 skills of Thurstone divided over the two factors.

Cattell-Horn-Carroll (CHC) theory

According to this theory, intelligence consists of three hierarchical strata. Stratum III is the general intelligence factor g. Stratum II consists of 8 parts, each with a number of specific skills of stratum I which are in brackets:

fluent intelligence (5 specific skills): requires high thinking and reasoning and is used for new tasks that are not done automatically.
crystallized intelligence (10): the breadth and depth of the knowledge of the individual.
domain-specific knowledge (7): knowledge about specific domains that are not typical of culture.
visual-spatial abilities (11): presenting, retaining and transforming mental representations of images.
auditory processing (13): the accurate observation of auditory information and with it the capacity to analyze, understand and imitate patterns or groups of sounds.
memory (13): the ability to consolidate new information and store it in the long-term memory and recall it later through associations.
cognitive processing speed (7): the speed of performing automatic processes, especially when a lot of attention and concentration is needed.
reaction time (5): the ability to make quick decisions in response to simple stimuli.

The skills of stratum I are always expanded and renewed with research. In addition, this model is widely used because the broad and specific skills are empirically verifiable and they include meaningful and implications for the real world. Many psychometricians consider it having the strongest empirical foundation of any theory of intelligence. Others have proposed a larger list that also includes psychomotor, olfactory, and kinesthetic abilities.

Guilford and structure-of-intellect model

Guilford classified intellectual skills in dimensions of operations, contents and products. Operations are the type of mental actions that the test must contain. Contents refers to the nature of the materials or information that is offered. Products includes the different types of mental structures that the brain has to produce in order to arrive at a correct answer. He then had 5 x 5 x 6 = 150 factors of intelligence respectively.

It is a complicated model, but he has raised the suggestion that divergent production, the creation of several good possibilities for a single stimulus (as opposed to only constructing one correct answer to a stimulus situation), can also be part of intelligence.

Planning, attention and simultaneous and successive theory (PASS)

Luria adopted the theory ‘plans, attention, simultaneity and succession', a general theory about cognitive processing. His approach focuses on the mechanisms that process information and is therefore also called the information processing theory. According to him, analysis provides strong evidence for distinguishing between two basic forms of integrative activity of the cerebral cortex, allowing different aspects of the outside world to be reflected. The theory contains three functional units in the brain, whereby the processing of information takes place from the lower units to the higher units:

The first part includes subcortical areas like the brainstem, midbrain, and thalamus. It allows us to regulate attentive processes and selective attention and to resist distractions.
The second part contains the sensory parts of the cerebral cortex (parietal, temporal, and occipital lobes). Their functions are more lateralized and include simultaneous and successive processing.
The third part is in the frontal lobes and is responsible for planning and engine output.

According to this theory, processing starts with attention and planning is the final step.

Simultaneous processing (of information): this is characterized by performing different mental activities simultaneously. An example for which you need this is for example drawing a cube, you need spatial insight, motor skills etc.
Sequential processing (of information): is necessary for mental activities where a correct sequence of activities is needed. This is necessary, for example, for remembering a series of numbers.

Finding tasks that can test these two ways of processing in their pure way is the challenge for applying this theory to intelligence.

Information processing theories of intelligence

These theories propose models on how people represent and process information mentally. It is useful to make the analogy of a computer. The architectural system (hardware) comprises the biological requirements for information processing, such as coding speed. It is quite impenetrable to change by the environment.

The executive system (software) is easily influenced by the environment and it includes components related to the environment that control the problem-solving capacity and leading functional components. An example is metacognition, thinking about thinking, which is important for learning and intelligence.

Gardner and the theory of multiple intelligences

Gardner drew up a number of criteria for an autonomous intelligence:

Potential isolation due to brain damage
The existence of exceptional individuals in an area
Identifiable core operations
Distinctive developmental history
Evolutionary plausibility
Support from experimental psychology
Support from psychometric studies
Susceptibility to symbol encoding

Gardner thus established seven natural intelligences:

Linguistic
Logical-mathematical
Spatial
Musical
Bodily-kinesthetic
Interpersonal
Intrapersonal

Bodily-kinesthetic intelligence includes the skills used by athletes, dancers, mime artists, etc. Personal intelligences include the ability to access both one's own feelings (intrapersonal) and the ability to notice and distinguish someone else’s moods, temperament, motivation, and intentions (interpersonal). People with musical intelligence can easily learn to play an instrument and make their own compositions.

Savants provide an important insight into the existence of different kinds of intelligences. A savant is someone who is mentally inadequate, but has developed himself in a certain area, such as music or art. The theory is compelling because of its simplicity, but the empirical validity is difficult to investigate.

Sternberg and the triarchic theory of successful intelligence

Sternberg has developed a theory that is called triarchic, because it emphasizes three aspects of intelligence.

Componential / analytical intelligence

Internal mental mechanisms responsible for intelligent behavior.

Metacomponents or executive processes: these control the activities of the other intelligence processes, such as planning.
Performance components: the processes that are used to perform a task or solve problems, such as working memory or syllogistic reasoning.
Knowledge acquisition components: processes that are used in learning, such as the use of certain vocabulary in the right context.

Experiential / creative intelligence

Ability to deal with novelty
Ability to automize information processes

Contextual / practical intelligence

Adaptation to the 'real world' environment: if you possess the skills needed for the culture in which you live.
Selection of an appropriate environment: the ability to leave the environment you are in and to select an environment that is better suited to your talents and needs.
Shaping of the environment: the skill to form the existing environment in such a way that it is more consistent with your needs.

For the theory, Sternberg has made the STAT (Sternberg Triarchic Abilities Test), which also contains creative and practical questions.

Topic 5B Individual tests of intelligence and achievement

Intelligence testing is one of the major achievements of the field of psychology. The score of general intelligence is often not what researchers are interested in. If they intending to determine g, any test would do. Often, however, they are looking for specific intellectual functioning and therefore look at the results of subtests, so it is important that they use the test that best describes what they want to know. There are many intelligence tests and new ones are added every month. The following tests are likely to be used in 95% of the assessment cases.

Orientation

This chapter discusses the following intelligence tests:

Wechsler Adult Intelligence Scale (WAIS-IV)
Wechsler Intelligence Scale for Children (WISC-IV)
Stanford-Binet: Fifth Edition (SB5)

Detroit Tests of Learning Aptitude-4 (DTLA-4)
Cognitive Assessment System-II (CAS-II)
Kaufman Brief Intelligence Test-2 (KBIT-2)

Wechsler Scales of Intelligence

The first intelligence test by Wechsler, the Wechsler-Bellevue Intelligence Scale, was released in 1939. He made sure that the test did not have the shortcomings of already existing tests. The existing tests were not available for adults, were too focused on manipulative words, were too focused on speed rather than accuracy, and the notion of mental age was irrelevant.

He drew up the new formula for IQ:

IQ = obtained score / expected mean score for age

He did this, because according to him IQ remains almost constant (IQ constancy), but intellectual skills can change over time. In addition, Wechsler hoped to use the test for psychiatric diagnoses. For this he distinguished between verbal and performance intelligence.

Verbal> Performance (V>P): gave results for organic brain diseases such as psychosis and emotional disorders.
Performance> Verbal (P>V): showed slight mental retardation in adolescents and adults.

Research has shown that there are many exceptions to this diagnostic rule, but it nevertheless showed that distinguishing between verbal understanding and perceptual reasoning (as we now know it) is useful in many areas, such as studying brain-behavioral relationships.

The later Wechsler tests were particularly successful because of the reliable content and formula of the first test, which meant that examiners, among other things, needed little training to switch to a new version.

The latest versions of Wechsler tests all contain the following common features:

Thirteen to fifteen subtests that are used to measure intra-individual strengths and weaknesses rather than one global score.
Empirically based scores and IQ scales. First it was subdivided into verbal IQ and performal IQ, but the latest versions have index scores in four areas:

Verbal comprehension
Perceptual reasoning
Working memory
Processing speed

A common metric for IQ and index score, being an average IQ of 100 with standard deviation of 15. On a subtest this would be an average of 10 with standard deviation of about 3.
Some of the same subtests for different test versions.

Wechsler subtests: description and analysis

Here the different subtests of the WISC-IV (children) and WAIS-IV (adults) are discussed.

Information: this test tests factual knowledge about people, places and general phenomena. Most of these information questions are known when someone has grown up in a Western culture. This subtest also measures learning and memory skills, because one has to retrieve knowledge from memory. The information test is the best measure for general skills and relies heavily on the factor Verbal Understanding.
Digit span: at digit span, participants get a series of digits which they then have to repeat (Digits Forward), have to repeat backwards (Digits Backwards) or put in the correct numerical order (Digits Sequencing). When the person is right, the next series that is a number longer, up to a maximum of 9 digits. Scores on this test can be influenced by anxiety or fatigue, which is often seen in psychiatric patients. The different types of test measure different skills, because repeating digits backwards requires more working memory and skills than forward, which is basically only repeating them.
Vocabulary: in this test different words are mentioned after which the participant must tell what these words mean. They will then receive full scores if they mention all the characteristics of the word and some of the points if they have forgotten an important part. It is the second best measurement for general intelligence.
Arithmetic: in this test calculation problems are verbally presented that the participants must answer. Some items can be tricky, because it must also be answered within a short time. Arithmetic charges the strongest on the factor memory.
Compehension: this raises questions that do not have to do with factual knowledge, but where it is important to be able to explain some concept. An example is: 'What does the proverb mean: Better a bird in your hand, than ten in the air?' There is still a disagreement about whether this subtest has mainly to do with the social functioning of the participant and whether it therefore measures 'social intelligence'.
Similarities: here the participant's ability is tested to separate important unimportant characteristics from objects, facts and ideas. For example, two objects are given (shirt and socks) after which they are asked where they are equal (common concept). The point is that the participant can distinguish the important equation from the unimportant equality (that they both start with the s).
Letter-number sequencing: this gives the participant a series of letters and numbers, after which he first has to put the numbers and then the letters in the correct order. For example, the series R-3-B-5-Z-1-C should then be 1-3-5-BCRZ. This subtest, together with Digit Span and Arithmetic, measures the working memory.
Picture completion: here images are shown in which the participants have to indicate which important part is missing. The test assumes that the participant has seen the object, so it can be bad for participants from a different culture.
Picture concepts: this subtest is used in children. A row of pictures is shown and the child has to indicate which concept the pictures have in common. With each series the concept becomes more abstract and therefore more difficult to guess. This subtest is only in the WPPSI-IV and the WISC-IV.
Block design: here participants must construct a two-dimensional image with three-dimensional blocks. It requires analysis of spatial relationship, visual-motor coordination and it is less susceptible to memory and previous experiences. In the more difficult series, bonus points can be earned when performing quickly.
Matrix reasoning: here participants have to discover a pattern or relationship in a row of figures (simple) or in a 3x3 matrix (more difficult), where the last figure is missing, which has to be chosen from five choices. It mainly measures fluid intelligence.
Object assembly: this is only a test for toddlers and preschoolers, where they get parts of a picture and have to put them together to get the big picture, like a mini-puzzle. This subtest only occurs in the WPPSI-III.
Coding: this is a subtest where a number is tied to a certain symbol. After a few exercises, a series of numbers is given, after which the corresponding symbols must be given in that order. A number of trials must be done within two minutes. Encoding is unique, because it is the only Wechsler task that teaches a new task and tests it the same way.
Symbol search: this is a subtest that measures processing speed, indicating whether a few symbols on the left are located between a larger group of symbols on the right.
Cancellation: in this case, participants must cross out all animals on a large sheet of paper, standing between all non-animal objects. There are two trials: one randomized with the animals and one with a structured rows and columns of animals. In the WAIS-IV this test is more abstract and only two target stimuli are used.
Visual puzzles: here six parts have to be chosen that form an example picture. It charges the strongest on the factor Perceptual Reasoning. This subtest only occurs in the WAIS-IV.
Figure weights: this is part of the WAIS-IV that measures Perceptual Reasoning. There is a scale with figures that are missing on one side. The participant must choose from six options to keep the balance in balance. This subtest only occurs in the WAIS-IV.

Wechsler Adult Intelligence Scale (WAIS-IV)

The WAIS-IV is a significant revision on the WAIS-III. It has retained many old items, but has two new subtests, a simple structure and emphasizes index scores, allowing a sharper boundary to be drawn between discrete domains of cognitive functions. The WAIS-IV consists of 15 subtests, but only 10 core subtests are needed to calculate the IQ score and the index scores. The subtests are divided over the four index scores as follows, instead of only distinguishing between verbal and non-verbal:

Verbal Comprehension Index

Similarities
Vocabulary
Information

Perceptual Reasoning Index

Block design
Matrix Reasoning
Visual Puzzles

Working memory Index

Digit Span
Arithmetic

Processing speed Index

Symbol Search
Coding

The verbal comprehension index (VCI) is similar to the old verbal IQ concept. Perceptual Reasoning Index (PRI) is related to the former notion of Performance IQ. Verbal Understanding and Perceptual Reasoning are used today, because the concepts fit better in the factor analysis.

The Working Memory Index (WMI) consists of subtests that require attention and direct memory. Relatively low scores can therefore indicate concentration problems. The Processing Rate Index (PSI) consists of subtests that require rapid processing of visual information and is sensitive to many conditions. The WAIS-IV is stricter in its criteria than the previous three WAIS tests. In addition, this successor is useful for both the healthy population and the population with mental problems.

Standardization: the standardization of the WAIS-IV was done in 13 different age groups from 16 years old with a total sample of 2200 adults. The sample was stratified on the variables gender, ethnicity, education and region. It was done with cooperative, healthy, English-speaking people without brain damage.
Reliability: the WAIS-IV has good reliability in terms of split-half reliability for the index scores and the full IQ scores. This means that the WAIS-IV is also a good test for a special population in which only one index score needs to be examined.
Validity: the WAIS-IV has a good criterion validity, because it has high correlations with existing intelligence tests. It also has good convergent and discriminant validity. The validity is also well proven by the strong overlap with the previous three editions of the Wechsler intelligence tests

Wechsler Intelligence Scale for Children-IV (WISC-IV)

Several errors were found in the first version: in the standardization sample there was only one culture group (whites), ambiguities about scoring, inappropriate items for children and no use was made of women or African Americans in the pictorial content of items. This has all been adjusted in the WISC-IV.

The WISC-IV has 10 core subtests and 5 additional subtests. The core subtests are block design, similarities, digit span, picture concepts, coding, vocabulary, letter-number sequencing, matrix reasoning, comprehension and symbol search. The additional subtests are picture completion, cancellation, information, arithmetic and word reasoning.

The additional subtests are not used to calculate the general IQ scores, but may be useful firstly because of the important diagnostic information they provide. Secondly, they can serve as an alternative to a core subtest, when a child is not able to perform such a test, for example due to motor problems.

The standardization of the WISC-IV was good, because the agreement between the standardized sample and the data from the population register was almost perfect. This is the case, because 5.7% of the sample consisted of children who had certain characteristics, such as giftedness, learning disabilities, autism or motor problems.

The reliability of the WISC-IV is good and comparable with previous versions. The coefficients of the entire IQ score are around .90, but for each subtest the coefficients vary from .79 to .90.

The validity of the WISC-IV is also good. The correlations between the WISC-IV and the WISC-III with subtests rank from high in the .70 range to low in the .80 range. The correlation for the overall IQ score is higher, namely .89. The convergent validity and discriminant validity of the WISC-IV was also high.

By means of factor analysis, different indexes can also be ordered for the WISC-IV, also with an average of 100 and standard deviation of 15:

Verbal Comprehension Index

Similarities
Vocabulary
Comprehension

Perceptual Reasoning Index

Block design
Picture Concepts
Matrix Reasoning

Working memory

Digit Span
Letter-Number Sequencing

Processing speed

Coding
Symbol Search

Stanford-Binet Intelligence Scale: Fifth Edition (SB5)

The versions for the fourth version of the Stanford-Binet Test could not be used for an analysis of the subtests. Only an overall IQ score was given.

The SB5 has five factors of intelligence that are divided over two domains, resulting in 10 subtests:

Domains
Non-verbal	Verbal
Non-verbal fluent reasoning	Verbal fluent reasoning
Non-verbal knowledge	Verbal knowledge
Non-verbal quantitative reasoning	Verbal quantitative reasoning
Non-verbal visual-spatial processing	Verbal visual-spatial processing
Non-verbal working memory	Verbal working memory

Before the test is taken, a routing procedure is carried out to estimate the general cognitive skills of the participant. It ensures that the number of items can be reduced during the test without introducing measurement errors. This is possible because the test is based on the item response theory and difficulty levels and other parameters are already known since the development phase.

The SB5 can be used for very young children up to gifted adults. The non-verbal subtests can be used well for people who do not have a good command of the language, such as immigrants or the deaf. An important characteristic of the SB5 is that there are items that can distinguish giftedness and mental retardation from the 'normal' IQ scores. In addition, we also looked at whether the test is fair when it comes to cultural, gender, and religious differences. This is the first test that looks at religion. Hearing-impaired people and people who speak poor English or have communication problems can also make this test. The factor on working memory contains both verbal and non-verbal memory which can offer a solution in research on children with ADHD.

The standardization was done at 4800 persons based on the American population register. The validity of the SB5 is good with correlations above .70. The robust correlations with the SB4 show that the SB5 is valid, but also more usable than the SB4. Because the SB5 is so useful for both extremes of the cognitive spectrum, it will be a good tool to test individual intelligence.

Detroit Tests of Learning Aptitude-4 (DTLA-4)

The DTLA-4 is a test used for children between 6 and 17 years of age. With the test, 16 compositions are calculated, including general intelligence, optimal level, and 14 skills. It contains 10 subtests that are very similar to the Binet-Wechsler tests. The general mental skill composition is calculated using the standardized scores of the 10 subtests.

The optimal level composite is based on the four highest standard scores and shows how well the participant performs under optimal conditions. For the other 14 composite scores, combinations of certain subtests are used that together measure one of the following attributes:

Linguistic (verbal vs. non-verbal)
Attention (attention-enhanced vs. attention-reduced)
Motoric (motor-enhanced vs. motor-reduced)
Smooth intelligence vs . crystallized intelligence (Horn & Cattell)
Simultaneous vs. successive (Das & Naglieri)
Associative vs. cognitive (Jensen)
Verbal vs. Performal (Wechsler)

They are used to give contrasting composites with significant differences that can result in a diagnostic explanation. For example, someone who scores well on attention-reduced tasks, but very badly on attention-enhanced tasks, probably has attention problems.

A problem with this test is that there is not enough empirical evidence for the composites. Another problem is that there are more composites than subtests, so that the composites will have high intercorrelations. So it is a good test for general intelligence but not for the composites. Reliability and validity are okay.

Cognitive Assessment System-II (CAS-II)

The CAS-II is based on the PASS theory of Das & Naglieri discussed earlier. The test is for children from 5 to 17 years. The test consists of 12 subtests and takes 60 minutes. There is a smaller version of 8 subtests but it is not recommended to use them. For each scale certain subtests are available:

Planning

Matched numbers: the child must underline the two identical numbers in a number of rows of 6 numbers.
Planned codes: a code is learned and then this is tested by having to enter the missing codes.
Planned connections: lines must be drawn between points in order of alphabet and number (1-A-2-B-3-C, etc.).

Attention

Expressive attention: this is a kind of Stroop task, in which the color of the words should be mentioned.
Number detection: the child must underline specific digits, but only if they are printed in bold font.
Responsive attention: first, the child underlines letter combinations that are physically the same (TT, not Tt), and then combinations that are the same in name (Bb, not Ba).

Simultaneous processing

Non-verbal matrices: a logical shape must be found in a 3x3 matrix so that the empty box at the bottom right can be entered.
Verbal spatial relations: six pictures with figures are shown and the child has to find the relationship in response to a specific question (e.g.: point out the square to the right of the circle).
Figure memory: a drawing is shown and then the child has to recognize the drawing in a larger and more complex picture.

Successive processing

Word series: a series of two to nine words must be repeated.
Sentence Repetition: sentences must be read aloud and then repeated, when they can no longer be seen.
Sentences questions or speech rates (8 to 17 years): questions are asked about sentences that need to be answered.

The scores of children with ADHD exceeded the scores that were expected based on already known information about the scores of children with ADHD.

The difference between black and white children is minimal when checking for background. In addition, high correlations between scores and school performance were found. The test is therefore useful for special education. It is a promising test that deserves to be used more often. The reliability and validity are very good.

Kaufman Brief Intelligence (KBIT-2)

There are two major drawbacks to the aforementioned tests:

The tests take a lot of time, at least one hour.
The person who takes the test must be well trained to be able to administer the tests properly.

Kaufman then developed an easy and short test to solve these problems. The test consists of a verbal scale containing two types of items: verbal knowledge and riddles and a non-verbal scale consisting of matrix items (2x2 and 3x3 figurative analogies).

This test is useful for people between 4-90 years and can be taken in about 20 minutes. Another M = 100 and SD = 15 is used. Kaufman makes it clear that this test can not replace the traditional tests, it is more of an instrument for screening, for example for the following purposes:

Making a quick estimate of intelligence where accurate scores are not essential.
Estimation of verbal versus non-verbal intelligence.
Re-evaluating the intellectual status of previously tested people.
Screening students who may benefit from placement in gifted programs.
Screening students with a high risk who may need further assessment.
To obtain a quick estimate of intelligence when treating adults in insitutions

Individual tests of achievement

Where the intelligence tests measure the broadness of a person's mental skills, achievement tests measure what a person has learned in school or a course. The focus here is on individual tests, so they are conducted 1 on 1.

Kaufman Test of Educational Achievement-II (KTEA-II)

The KTEA-II is used for children from 4.5 years to 25 years. There is a short version of 3 subtests that can be used for people who are even older than 90. For the mapping of learning problems only the full version is recommended. It consists of eight subtests divided into four areas:

Reading

Letter and word recognition
Reading understanding

Mathematics

Math Concepts and applications
Math computation

Written language

Written expression
Spelling

Oral language

Listening comprehension
Spoken expression

Three composite scores can be calculated: reading, math and written language. In addition, a total score can also be calculated. For diagnostic purposes there are also more subtests for reading skills. For older children this test takes 80 minutes, for younger children 30 minutes.

Nature and assessment of learning disabilities

A learning disability can be described as a serious difference between the general intelligence and specific performances in one of the following areas: spoken expression, listening comprehension, written expression, basic level reading, reading comprehension, mathematical calculation and mathematical reasoning. A serious difference was described as one or more standard deviations between a general intelligence and a specific performance. However, this definition was too strict, so that children who did have a learning disability did not come through selection procedures but did need more help.

The new definition of learning disability described it as intrinsic to the individual, a dysfunction of the central nervous system being the cause, and that it can reach into adulthood. With this definition a better separation was made between skill and performance.

Since 2004 there has been a new approach to identifying learning problems. No more attention is paid only to the discrepancy model, but also to the response to an intervention (RTI). Scientifically developed interventions are being used more frequently. This method is therefore not only about identifying, but also about increasing the capacity of the school system to be able to respond to the different needs of the pupils with disabilities.

The approach consists of several steps and starts at the beginning of the year with a general screening of all children to see who is at risk. The children who stand out will receive an intervention for 8 weeks, and if they do not catch up, another intervention will be used again for 8 weeks. If it still does not work, it will be examined on an individual basis where the problems come from and the child may be placed in special education.

Ultimately, there are five characteristics to indicate a learning disability:

A learning disability is a discrepancy within a person in terms of cognitive functioning. But it is not just a difference between a general IQ score and performance test scores.
The learning disability should not be determined by other limiting conditions, such as retardation or visual impairment.
A learning disability is heterogeneous, so there are many different variants of it.
A learning disability often arises early in childhood and can exist into adulthood. This is investigated by means of longitudinal studies. It is a developmental phenomenon.
People with learning disabilities often experience social and emotional problems that are often just as serious as deficits in academic performance. These problems can also exist in adulthood.

The cause sometimes lies, as with dyslexia, in the brain. In addition, the left hemisphere is limited. Restriction of the right hemisphere involves problems in non-verbal tasks. Often, however, the cause of a learning disability is not known.

It is clear, however, that learning disabilities have a major impact on the lives of the children. That is why individual performance tests are best for measuring this. These tests are taken individually and measure academic skills. This is the point where there are problems. It is often about identifying specific problem areas and this should be taken into account when choosing and administering a test.

How are 'special populations' tested? - Chapter 7

Subject 7A: Infant and preschool assessment

Assessment of the infant capacities

The baby and toddler period (preschool period) lasts from birth to about 6 years old. In this, children develop basic reflexes and motor skills. However, some children lag behind and therefore, parents want to know how far the child is behind and whether their cognitive emotional development is going well. On the other hand, there are also children whose development is far ahead of others. Assessments for infants and toddlers help to investigate this.

The 'Neonatal Behavorial Assessment Scale' (NBAS) is used to investigate the behaviors of newly born children. 28 behaviors are scored on a 9-point scale. Examples are reaction to light, cuddiness and orientation to inanimate visual stimuli. 18 reflexes are also controlled, such as the Babinski reflex, sucking reflex, grasp reflex or the rooting reflex. Finally, the qualities of responsiveness of the weak children are tested with, among other things, the quality of alertness, general irritability, and emotional response.

The NBAS is used as feedback for the parents, so it does not have a real scoring table. The developers want to emphasize that a close parent-child relationship is most important. There are researchers who have issued a scoring system for the NBAS with 7 clusters: habituation, orientation, motor performance, arousal/lability, regulation, autonomic stability and reflexes. The reliability of this scoring system is low, so one has to be careful with interpretation. The development of babies can change quickly and strongly.

The Bayley-III is available for children from 1 to 42 months. The test consists of 5 domains with representative scales, namely:

Cognitive scale: 91 items, including sensory accuracy, perceptive skills, attention, puzzle solving, matching of colors, counting and investigative and manipulating behavior. This scale has no subscales.
Language scale: 48 items, includes receptive and expressive communication, such as recognition of sounds, non-verbal expression, identification of action pictures, naming objects, answering questions and following simple instructions. This scale has three subscores: expressive communication, receptive communication and language control score.
Motor scale: 138 items, includes gross and fine motor skills, such as manipulation of objects, functional use of the hands and motor planning. This scale has three subscores: gross motor skills, fine motor skills and a general motor score.
Social-emotional scale: 35 items, includes interactive and purposeful use of emotions, expressing feelings and connecting ideas and emotions with each other. This scale has no separate subtests.
Adaptive behavior scale: parents fill in how the child scores on components such as communication, health and safety, self-care. This scale has a subscore for each component.

The Bayley-III does not follow an overall score, because this can be misleading due to the many skills that are tested. The test is precisely meant to use scale scores for diagnosis and assessment. The reliability of the test is only reasonable, because it is nevertheless a test for young children, which develop quickly, which makes later test scores different (especially for fine motor skills). The validity is good with correlations between .72 and .79.

The Devereux Early Childhood Assessment Clinical Form (DECA-C) is made as an assessment for children from 2 years to about 6 years with social or emotional problems. The test focuses on the protective factors of (the environment of) the child who can partly absorb the problems. The test also contains problem scales.

The test is based on the 'resilience theory' which states that protective factors exist on three levels: environmental, family and within-child. The protective factors can be formed into a Total Protective Factor Score by means of three scales:

Initiative: investigates the child's ability to think independently and behave according to his needs. Items resemble statements like 'the child gets things by himself'.
Self-control: examines the capacities of the child to experience and express his different emotions in a socially desirable way. Items resemble statements like "the child can control his temperament".
Attachment: examines the formation of strong and long-lasting relationships with parents, family and teachers. Items resemble statements like "the child accepts the comfort of adults when he is upset."

The problem factors can be formed into a Total Problem Factor Score by means of four scales:

Attention problems: examines the child's ability to focus on a task and not be distracted by the environment. Items resemble "the child loses concentration quickly."
Aggression: measures aggressive or destructive behaviors towards other people or things. Items resemble "the child destroys personal property of others".
Withdrawal / depression: investigates self-reflection and emotional / social withdrawal. Items resemble 'the child seems to live in his or her own world'.
Problems with emotional control: measure the problems with controlling negative emotions related to purposeful behavior. Items resemble "The child is upset when things do not go his or her way."

Several studies showed that the reliability of the DECA-C is good. The test also appears to be valid and useful.

Of course, while researching young children one must take into account the troublesome fact that they are still in full development, and thus often do not listen to instructions. This makes the investigation very difficult.

Assessment of preschool intelligence

When examining the intelligence of young children, it must be taken into account that low scores do not directly mean that the child has low cognitive skills. It can also be due to the concentration of the child, or to, for example, environmental factors.

The following three tests are intended to measure intelligence in children.

DAS-II

The Differential Ability Scales-II (DAS-II) can be used for three age categories, but here we focus on preschool children aged 3 to 6 years. The DAS-II consists of ten core subtests, used for measuring cognitive skills, and 10 diagnostic subtests, used for additional information about readiness for school and information processing.

The core subtests correspond with the g factor of intelligence and contain three clusters for verbal power, non-verbal reasoning power and spatial power. From this, a General Conceptual Ability (GCA) and a Special Nonverbal Composite (SNC) can be calculated. The GCA is seen as a precursor of the IQ.

The diagnostic subtests of the DAS-II are only used for clinical analyzes and are less influenced by the g factor. They provide useful information for discovering learning difficulties and the readiness for school, and can therefore also be used to replace core subtests.

The DAS-II is a reliable measuring instrument due to the high reliability coefficients, but as is often the case with research on young children, test-retest reliability studies over several weeks are less high, due to the development of the children. The validity is good, given the high correlations with comparable tests.

WPPSI-IV

The Wechsler Preschool and Primary Scale or Intelligence-IV (WPPSI-IV) is used for children aged 2 to 7 years, but here only the test for older preschool children is discussed: from 4 to 7 years. An advantage of this test is that child-friendly and playful stimulus material is offered.

There are five primary index scales, each with two subtests that summarize the cognitive skills of children. The complete WPPSI-IV includes 13 subtests, but only 6 (the italic) are required to calculate an IQ score.

Verbal comprehension, subtests: information and similarities.
Visual spatial, subtests: block design and object assembly.
Fluid reasoning, subtests: matrix reasoning and picture concepts.
Working memory: subtests: Picture memory and zoo locations.
Processing speed: bug search and cancellation.

In addition, there are four subordinate index scales: vocabulary acquisition, nonverbal, general ability and cognitive proficiency. These index scales can be useful in special circumstances, such as the assessment of deaf children or children with slow processing.

Early SB5

The Stanford-Binet Intelligence Scales for Early Childhood (Early SB5) combines the SB5's sub-tests with new Test Observation Checklists (TOC) and parent reports. These are used to get an idea of the child's behavior during the test. A certain behavior, such as not answering a question, can have multiple meanings in children. The child could actually not know the answer, but the child could also be bored, afraid or distracted.

The TOC consists of a part that looks at the characteristics of the child and a part that looks at specific behaviors of the child. Characteristics of the child include: motor skills, activity levels, attention / distractibility, impulsivity, language.

Specific behaviors include: consistency in performance, mood, frustration tolerance / aggressiveness, change in mental set, motivation, fear of failure, degree of cooperativeness or refusal, anxiety, need for redirection, parental behaviors, representativeness of test behaviors.

Practical utility of infant and pre-school assessment

Test scores obtained in the first two years of life often seem to have minimal predictive validity. The important role that these tests nevertheless play will be discussed after a review of predictive studies.

Predictive validity

It is often found that there is a positive correlation between the preschool results and the results of children when they are at school. However, this correlation is not very strong. The correlation only becomes stronger when the children are 19 months old. Only then do the results have a strong, significant and meaningful correlation with the later IQ. The following rule applies: the older the child is during testing, the stronger the relationship with the later IQ. Around the 8^thyear of life IQ remains fairly stable. The results show that tests with infants have a low predictive validity and with preschool children or toddlers an average.

Practical utility of infant scales

The main reason for tests in infants is screening for developmental disorders, because it requires early intervention and thus ensures better outcomes later in life. Exceptionally low scores on the Bayley test, especially on the mental scale, can predict mental disability. Furthermore, for tested 'risky' infants the predictive validity is higher than for normal children. It is therefore useful to test infants because it can predict disability later in life.

Fagan Test of Infant Intelligence (FTII)

Lewis found that traditional tests overlooked early information processing processes that can predict cognitive functioning. For example, in his test visual habituation for new stimuli is examined in infants aged 3 months. The correlations with results at later ages were average, which means that cognitive functioning can be predicted early on.

Fagan developed his FTII with inspiration from this test. A face is shown to a baby, and later a similar face is shown, or the same face, but then the face looks differently. It is measured how long the baby looks at the new face. The reliability is high, due to a large inter-assessment agreement. The predictive validity of the FTII is strongly doubted because low correlations are found with the results for cognitive functioning in later life. That is why FTII is seen as a screening instrument for developmental delays rather than as a predictor of general intelligence.

Screening for school readiness

To define what school readiness is, five models are used:

Maturationist model: according to this model, age is the best way to determine whether a child is ready to go to school, because it is the best indicator for the development of the child.
Environmental model: here the readiness is based on what skills the child has in social experiences, so parents have to decide on this.
Constructivist model: here a child is ready for school if it can not only interact well with teachers, but also with other knowledgeable peers and adults, so several people have to co-decide.
Cumulative-skills model: here the readiness is determined by looking at how many skills the child already possesses that are needed to learn fundamental subjects, such as math and reading.
Ecological model: here the readiness is seen not only within the child itself, but as an interaction between the child's developments and the child's environments. From this follows a complex and qualitative evaluation.

Screening tests must be quick and simple, but they may have certain errors. There may be false-positive children who fail the test, but who do not really lack anything. And there may be false-negative children who do pass the test, but actually have a developmental deficit. Glascoe and Shapiro (2005) gave five pitfalls in screening tests:

Wait until the problem is observable.
Ignore screening results.
Rely on informal methods.
Use of inappropriate tests.
Assume that services do not exist or are limited.

A good screening test should meet the following criteria:

The primary purpose is screening, so do not investigate, diagnose or predict.
Screening is provided in areas such as motor, language, cognitive, social and emotional functioning.
Test-retest reliability must be at least .70.
Concurrent validity must be at least .70.
Sensitivity and specificity for positives and negatives should be at least .70.
The test should last less than 30 minutes.
The test must be able to screen children from different cultures.
A minimum of expertise is needed to take the test.

DIAL-4

The Developmental Indicators for the Assessment of Learning-4 (DIAL-4) is a fast and efficient screening test for ages 2 to 5 years. It tests five areas. Three of them are important development domains: motor, concepts and language. The other two (self-help and social-emotional) are filled in by the parents and teacher of kindergarten.

The scoring leaves room for subjective interpretations, which reduces the reliability of the test. Using norm scores, it can be said whether a child scores average in an area or if he has a 'potential delay’. The reliability is reasonably good. The content validity and criterion validity were also high. The construct validity was tested to see if the three areas matched the items and that was true.

Existing scepticism is mainly about the practical use of the DIAL-4. Sensitivity figures ranged from .73 to .82, which is high, but that still means that 18 to 27 percent of potentially delayed children are not recognized as such. Specificity numbers ran from .82 to .86, which means that 14 to 18 percent of normal children are screened as potentially delayed.

Denver-II

This test consists of 125 items divided into four areas: personal-social, fine motor adjustments, language skills and rough motor skills. The test does not produce a developmental quotient, but a score that is described as normal, questionable or abnormal. The reliability is very high with numbers above the .90. However, the predictive validity of the Denver-II is questionable, because the sensitivity is high, but the specificity is not high; many normal children were labeled as doubtful or abnormal.

HOME

The Home Observation for Measurement of the Environment (HOME) is used to observe the physical and social environment of the children while they are at home. The HOME followed after the SES. It was stated as a hypothesis that children of a higher social class enjoy a richer and warmer environment. This was indirectly measured with work and education from the parents.

However, the HOME allows one to directly investigate whether certain crucial interactions and experiences are present. It measures the quality and quantity of stimulation and support for cognitive, social and emotional developments that are available at home.

The researcher scores dichotomically whether certain items are present or not. These are subdivided into six subscales:

Emotional and verbal responsivity of the parent
Acceptance of the child’s behavior
Organization of the environment
Provision of appropriate play materials
Parent involvement with child
Variety of stimulation

The reliability of the HOME is reasonable. Some subscales have few items, so for these the reliability can not be very high. The interobserver agreement is high, as are the internal consistencies. The validity of the HOME is also average. The correlations with the SES should be significant, but not fully related, and studies showed that this is indeed the case.

The HOME is not only promising in research, but also as a practical instrument for interventions.

Subject 7B Testing persons with disabilities

Non-language tests

These are tests that have as little written or spoken language between examiner and examinee as possible. The Leiter-R test is a non-verbal test for children aged 2 to 21 who do not need a spoken language and can therefore be used for children who do not speak the language, children with autism, brain damage and speech or hearing problems. Children must match small cards with figures with an example. The Leiter-R test has 20 subtests divided over two test batteries: Visualization & Reasoning and Memory & Attention.

It is a reliable test with numbers above .90. Because the Leiter-R is also only non-verbal, it can be used for different target groups. The empirical studies support the use of the test, because it proved useful for medically weak children, low-functioning children with autism and children with a language deficiency. The validity was also high, namely r= .80 (correlation with another measure of nonverbal intelligence).

Many children enjoy drawing human figures and can do so routinely, so Florence Goodenough (1926) invented the Draw-A-Man test reviewed by Harris in 1963. However, this was not entirely non-verbal, because the instructions were given orally. The goal was to measure intelligent development, so the more detailed the drawing was, the better the score.

Naglieri used this technique and developed thein 1988 Draw A Person test (DAP), adding a quantitative scoring system. The DAP is praised for its clear scoring, strong reliability and good standardization, but the validity is less good. It does not effectively identify children with learning disabilities and developmental defects.

The Hiskey-Nebraska Test of Learning Aptitude (H-NTLA) is a non-linguistic test and is used for children aged 3 to 17 years. It consists of 12 subtests, from which a Deviant Learning Quotient (LQ) is calculated. The reliability is high, but becomes less after a number of years. The validity is reasonable, given the correlations with other performance tests.

The Test of Nonverbal Intelligence-4 (TONI-4) is a non-linguistic test for people aged 6 to 90 years. It consists of 60 abstract figures in which the participants have to discover relationships between a number of figures. The test has three scoring methods: age comparisons, percentile grades and TONI-4 quotients (average 100, sd 15). The reliability is good (> .90). Furthermore, the TONI-4 is recommended as an intelligence test for people with poor language skills.

Nonreading and motor-reduced tests

The Peabody Picture Vocabulary Test-IV (PPVT-4) is useful for people who can not verbally express themselves and for people who have weakened motor conditions. The test has two parallel versions, four of which have practice plates and 228 testing plates that contain pictures. The tester pronounces a stimulus word and the person must designate the picture that best suits it.

The reliability is high, but is still disappointing among children of foreign origin. The validity of the PPVT-4 is good, but only as a measuring instrument for vocabulary and not as a measuring instrument for general intelligence.

Testing persons with visual impairments

People who are legally blind, who see less than 10% in the right eye, need special tests to be properly examined. The Wechsler test that has been converted for this purpose is the Haptic Intelligence Scale for the Adult Blind (HISAB). It consists of six subtests of Digit Symbols, Block Design, Object Assembly and Picture Completion. The remaining two are Arithmetic sums and Matrix Reasoning using a pattern board.

Another instrument is the Blind Learning Aptitude Test (BLAT), for children aged 6 to 16 years. This test consists of six subtests that work with a kind of braille, also consisting of dots and lines. .

The Intelligence Test for Visually Impaired Children (ITVIC) replaces the nonverbal subtests of IQ tests requiring intact vision with tests that require only sensation, so-called haptic subtests. The verbal subtests remain as usual. This is for children from 6 to 15 years.

Testing people who are deaf or hard of hearing

A test in sign language can not always be the solution, because not everyone has the same sign language. Using someone who translates into sign language can be bad for validity because the translator can change the content unconsciously. That is why it is best for the examiner to be proficient in standard sign language, so that the test remains standardized. The Wechsler tests are best used for deaf or hearing-impaired people.

Assessment of adaptive behaviour in intellectual disability

It is difficult to investigate intellectually disabled people. Someone is intellectually disabled if his IQ is below 75. Intellectual disability is indicated on a continuum of mild, medium, severe and very serious. In addition, the person must also have difficulty with adjustments of behavior in conceptual, social and practical skills.

These skills are examined with different measuring scales. The Scale of Independent Behavior-Revisted (SIB-R) consists of 259 items divided over 14 subscales divided into four clusters: motor skills, social and communication skills, personal life skills and social skills. The scores from these four clusters give an Independence Score. In addition, the SIB-R indicates whether there are problem behaviors and how serious they are.

The SIB-R thus gives a good picture of how the person functions in real life at home, at school and in public places. In addition, standardization, reliability and validity are also sufficient.

The Inventory for Client and Agency Planning (ICAP) is used for children and adults who have become dependent from birth or later in life. The focus is on determining which kind of care is needed. The test is completed by a care provider and there is also room to indicate problem behavior and maladaptive behavior. The advantage of this instrument is that there is a Service Score that indicates how high the urgency is for attention, supervision and training for the person. The lower the score, the higher the urgency.

Lastly, there is the Vineland Adaptive Behavior Scale (VABS-II) which provides an evaluation in the following areas: communication, daily living skills, socialization and motor skills. The predictive validity of the scale is good.

Assessment of autism spectrum disorders

Autism is not a single disorder, but multiple disorders in a spectrum. People with autism differ from each other, but all have difficulties with social skills, communication skills and flexible behavior. In addition, people with autism often lack empathy.

A measuring instrument for autism is the Modified Checklist for Autism in Toddlers (M-CHAT) for 16 to 30 months old children. It is a checklist of 23 items that indicates that the child may be autistic. The M-CHAT, however, has a high false-positive ratio, but that can be acceptable for the number of children who are well diagnosed. It is very important that children with autism are detected early to help them at the earliest possible age.

Another frequently used instrument is the Baby and Infant Screen for Children with Autism Traits, BISCUIT-1. It consists of 71 items on a 3-point scale for toddlers aged 17 to 37 months. The BISCUIT-1 is also a good measuring instrument for discovering autism.

Where do personality tests originate? - Chapter 8

Subject 8A Personality theories and projective techniques

Personality: An overview

Personality is a vague concept, but we can distinguish two characteristics. First, each person is consistent to a certain level, we have coherence characteristics and action patterns that occur repeatedly. Second, each person is distinctive to a certain level. Personality describes the differences in behavior between people and the consistency of behavior within a person.

Psychoanalytic theories

Psychoanalysis was a creation by Sigmund Freud. Freud developed his general theory of psychological functioning during the investigation of hysteria. His theory is about the unconscious as a foundation of psychological functioning. He said that our unconscious mind consists of thoughts and wishes that are too unacceptable to enter consciousness. The most significant motivations are therefore not in our consciousness. Freud also thought that these motivations came out in a hidden way in dreams. Early in the twentieth century, a large range of tests has been developed to expose Freud’s unconscious, such as inkblot testing, word association approaches and storytelling tests. These tests could supposedly expose the unconscious by interpreting the ambiguous and unstructured answers of the client in the right way. These tests have also had a major impact on personality tests.

According to Freud, the mind consists of three parts:

Id: is completely in your unconscious mind and is the part that concerns instinctual need, such as eating, drinking, sexual gratification and the avoidance of pain. The id has one goal and that is to meet these needs immediately according to the pleasure principle. This principle means that there is an impulse towards the immediate fulfillment of the need without taking into account values, right or wrong, or morality (this is called the pleasure principle). The id also has no logic or sense of time (we are born with it).
Ego: this is our consciousness. The goal of the ego is to mediate between the id and the reality. It is part of the id and a servant of it. The ego is therefore largely conscious and works according to the reality principle. It seeks realistic and safe ways to discharge the impulses coming from the id (this develops just after birth).
Superego: the ethical part of our personality that develops in the first 5 years of our lives. The ego and superego have to compete with each other in order to obtain gratification or not. The superego is about the social standards of right and wrong that we learn from our parents. The superego is partly conscious, but largely unconscious. It tries to suppress the actions of the id and ego and its biggest weapon is to blame. The ego must therefore not only find a safe and realistic way to meet needs, but also a morally correct way to avoid punishment from the superego. The superego also has to do with the ideal ego. The ego measures itself with this ideal image and tries to get as close to it as possible. The ideal contains our goals and aspirations. When we fail to achieve our ideal, we can feel guilty and inferior.

The ego can do its work by having access to various mental strategies called defense mechanisms. Defense mechanisms have three general characteristics. First, they supress fear that arises from conflicting demands of the id, superego and reality. Fear and defense mechanisms are complementary concepts for Freud. Secondly, defense mechanisms all operate unconsciously, even though they are carried out by the conscious ego. Third, they distort the inner and outer reality, so that fear can be reduced.

Psychotic defense mechanisms are the least healthy because they distort reality extremely. They include the denial of reality and the extreme distortion of reality (delusions). The second group of mechanisms is 'acting out' and consists of unadaptive behaviors, such as aggressive or impulsive behavior. Borderline mechanisms, the third group, ensure that the image of others (or of the self) changes from very good to very bad. They include the splitting of personality traits and projective identification. The fourth group are neurotic mechanisms and they include small changes of reality. Use is made of repression and displacement. Obsessive mechanisms are common and include, for example, the isolation of affect or intellectualization. The last group, 'mature' mechanisms, have the least change of reality and appear as convenient virtues. They include behaviors such as altruism, humor, suppression, anticipation or sublimation.

A test has been issued to score these defense mechanisms: The Defense Mechanism Rating Scale (DMRS). It is scored quantitatively and an Overall Defensive Functioning Score is issued. Instruments such as the DMRS can provide empirical validation for psychoanalytic theories, but their use requires a lot of training and time.

Type theories of personality

Friedman and Rosenman examined psychic variables that increase the risk of cardiovascular disease. They concluded that people with a Type A behavioral pattern are more susceptible to this type of disease. Type A persons are always uncertain, regardless of their performance, often do not care about the feelings of competitors, are hostile and quickly irritated. They also feel a kind of pressure to have things done quickly. They often use multitasking. Several studies have been done to find further evidence that type-A behaviors are associated with an increased risk of CHD, but there are several results. The strongest evidence is found for white working people.

However, several studies have shown that Type A behavior is not a single risk factor for cardiovascular disease, but that it is more specific components of type A behavior that can cause this. The behavior can be measured by means of interviews or questionnaires. Questionnaires are less good, because non-verbal behavior can not be noticed, even though these are often signs of type A behavior.

Phenomenological theories of personality

Rogers has had the most influence within this approach. His contribution to personality theory, known as self-theory, has been extended and is admired by many psychology students. In addition, he helped to make the Q-technique popular.

This technique is a procedure for studying changes in the self-concept. The test consists of many cards where statements arise. The client must then, out of 100 of these cards, pile 9 stacks for which the amount of cards per stack is fixed, to create a kind of normal distribution. The cards that are most fitting to the client must be placed on one side and the cards that describe him least on the other side. The cards that he is indifferent about or can not decide on end up in the middle. The researcher can form the items himself according to the needs of the client. The scoring of this test is done by comparing the distribution with an already established standard. The test can also be scored as Rogers did: comparing an examinee’s self-sort with his or her ideal sort. The discrepancy is then used as an index for adjustment. His clients had to divide the stack twice with the following instructions: 1) self-sort, sort the cards to describe yourself as you see yourself today. 2) Ideal sort, sort the cards now to describe the ideal person, the person as you would like to be.

Behavioral and social learning theories

An important assumption is that many of the behaviors that characterize personality are taught. Behaviorists state that the environment forms and maintains behavior. They fundamentally disagree with the role that cognitions play in behavior. Social learning theories state that we learn expectations and rules about the environment and not just stimulus-response connections. Modern social learning theories state that cognitions influence actions.

Rotter developed the Internal - External Scale (IE Scale) to measure the internal / external locus of control. The locus of control refers to the source of things that happen to people. The IE Scale thus measures whether the participants feel that they have control over events (internal locus of control) or that the control does not lie with them (external locus of control). An internal locus is often more positive than an external locus of control.

Bandura developed the concept of self-efficacy. Self-efficacy is about the extent to which people think they are able to respond / act on certain situations. It explains why good knowledge does not always deliver efficient actions. He has also drawn up measuring scales for self-efficacy, in which participants must indicate how well they find themselves functioning on certain tasks.

Trait conceptions of personality

A trait is every way in which an individual differs from another in the long term. The theories about this differ or they classify personality into categories or dimensions. Cattell distinguished between 'surface traits' and 'source traits'." Surface traits are obvious aspects of personality that are easy to see in behavior. 'Source traits' are the stable and constant sources of behavior. They are less visible, but more important for explanation of behavior. He analyzed underlying personality traits through factor analysis. This resulted in 16 personal traits with which he drew up the Sixteen Personality Factor Questionnaire (16PF).

The five-factor model of personality states that five dimensions form the basis for personality. They have been explained by means of a fundamental lexical hypothesis: terms of personality traits have remained in the language because they provide important information about our interactions with others. In addition, there is also evolutionary support for this theory. The 5 traits contain differences between individuals related to basic evolutionary functions such as survival and reproduction. According to Goldbert, people ask themselves 5 questions when it comes to their interaction with others:

Is X active and dominant or passive and a follower?
Is X agreeable?
Can I rely on X?
Is X crazy or stable?
Is X smart or stupid?

All these evaluations have, directly or indirectly, something to do with survival and reproduction. They also correspond to the 5 character traits.

In English the dimensions can be remembered with the acronym OCEAN:

Openness to Experience
Conscientiousness
Extraversion
Agreblenessen
Neuroticismtests have emerged from

Several personality tests have been inspired by this, such as the NEO-PI-R and the NEO-FFI. There are various comments on trait approaches to personality. Firstly, it is not clear whether they cause behavior or only describe behavior. Secondly, the theories have a low predictive validity.

The projective hypotheses

In projective tests, participants are offered unclear stimuli and must respond with their own constructions. The projective hypothesis states that personal interpretation of ambiguous stimuli reflects the subconscious needs, motives and conflicts of the participant. Projective tests must be able to derive underlying personality processes from this. They are categorized into association tests, construction tests, completion tests, and expression tests.

Association techniques

The best-known association test is the Rorschach test, in which participants see pictures with ink stains and have to say what they see in them. A system has been developed for this called the Rorschach Performance Assessment (R-PAS). It defines exactly how the test should be taken, scored and interpreted. Thus the test is well standardized. In addition, interrater correlations show that the test is reliable.

However, validity is questionable. One study found correlations between subscore complexity and coping skills and thus demonstrates the validity of the test. Others argue that formal scoring is nonsensical, and that the test is only good as an addition to a diagnostic interview. The answers can give insight into personal, illogical and strange associations.

In addition, there are also other valid scoring systems for the Rorschacht test. The Rorschach Prognostic Rating Scale (RPRS) has a complicated point system where points are given with good answers and deducted with bad answers. The RPRS indicates the final scores in terms of how successful a treatment will be.

The TDI is especially useful in patients with mental disorder, ranging from slightly confusing thoughts to bizarre, schizophrenic disorganization. The TDI scores the answers to how strange and incoherent they are.

In spite of its extensive scoring systems, the Rorschach test is very controversial. In 1980, researchers conducted a study into susceptibility to faking. Informed and uninformed students, who pretended to have schizophrenia, were compared to persons who actually had schizophrenia. It turned out that the informed students were diagnosed more often than the real patients (72 versus 48 percent) and 42% of the uninformed students were diagnosed. The test was once developed for children, but is now mainly used in adults.

Completion techniques

In sentence-completion, participants receive 40-100 stems consisting of unfinished sentences and have to complete them. These sentences may contain certain themes that are thus unconsciously described. The tests can be scored subjectively and qualitatively as well as objectively and quantitatively.

The Rotter Incomplete Sentences Blank (RISB) is for high school students, students and adults. In the objective scores, each answer receives a score of 0 (positive addition) to 6 (negative addition). The reliability is good, even for people with little psychological knowledge. The validity is also good, because a certain cut-off score seems to predict the possible pathology of the participant.

A problem is, as with other self-reports, that you only find out what the client wants you to know and a single score can never include the nuances of personal functioning.

Construction techniques

The Thematic Apperception Test (TAT) consists of 30 images on different topics and themes. The images are in black and white and one is blank. Some images are only used for a particular target group, such as adult, adult, male, female or a combination of these.

20 images exist for each target group. The participant must then tell a story about what happens on the image. It is often seen that certain themes recur in the stories. These recurring themes then say something about the person. There are many scoring systems, but usually it comes down to the qualitative interpretation of the test taker. The fact that there is no standardized way makes it possible that this test can only be used as a supplement. It is difficult to say anything about psychometric characteristics of this test because there are so many scoring systems. In general, the TAT has a low test-retest reliability (.28). In addition, client's answers are often interpreted intuitively and subjectively and this can quickly lead to over-diagnosis. The test has been extensively used for research purposes.

The Picture Projective Test (PPT) was an improvement on the TAT, because other images were used that are less obscure and depressing. The images used for this had to meet 4 criteria:

The image had to promise a meaningful projective material.
Most, but not all, images had to contain more than one human character.
Approximately half of the images had to show positive affective expressions in the persons depicted.
Approximately half of the images had to contain people who move and are not simply sitting or lying down.

It also appeared that the participants tell more positive stories at the PPT. In addition, the PPT puts more emphasis on interpersonal themes instead of intra-personal themes and therefore also more emphasis on healthy personality adjustment.

Compared with the TAT, the PPT was better at distinguishing between psychotic patients and normal / depressive participants. However, there must be more research into the psychometric qualities of this test.

For children there is the Children's Apperception Test (CAT). This test consists of 10 plates and is suitable for children between 3 and 10 years. The images are animals in a typical human setting (CAT-A). The idea behind this is that children can identify better with animals than with people. A version with people (CAT-H) is available for older children. There is no fixed way for scoring and there is no statistical information about validity and reliability. The diagnosis is made on the basis of 10 variables that are included in each story:

Main theme
Main hero
Main needs and drives for hero
Conception of environment
Perception of parental, contemporary and junior figures
Conflicts
Anxieties
Defenses
Adequacy or superego
Integration of ego.

Other variants of the TAT have also been developed for ethnic, race and language minorities. The T-TAT is suitable for African-Americans, but there were unintentional changes in facial expressions and situations that make it a new version of the TAT.

The TEMAS is intended for Spanish-American people and consists of 23 colorful images. The test contains 18 cognitive functions, 9 personality functions and 7 affective functions as a theme. Through this test you can also measure various objective indexes such as reaction time. The test does have inconsistent reliability and validity.

Expression techniques

In these tests, the participants are asked to make a drawing. At the DAP you are asked to draw a person. The interpretation is entirely intuitive by the examiner. Thereby, this test is poorly supported empirically. The House-Tree-Person test (HTP) asks the participant to draw a house, tree and person. When interpreting the drawings, the house must represent home life, the tree represents the experience of the environment and the person reflects the interpersonal relationships. However, it is an invalid measuring instrument. All in all, most clinicians no longer use the projective techniques as a stand-alone test, but as a supplement to the clinical interview.

Topic 8B Self-report and behavioral assessment of psychopathology

Previously the protective personality tests were discussed. Structural tests also exist. These tests have specific rules about taking, scoring, interpreting and describing the test and its results. These tests are also called objective, but you can only really speak of that after extensive research. There are three approaches for developing structured personality tests: theory-bounded, factor-analytic and criterion-key. A combination of these methods is often used.

Theory-guided inventories

These types of tests are developed around an existing theory.

Personality Research Form (PRF)

The PRF is based on Murray's theory on manifest need, the needs that a person wants to fulfil. Examples include autonomy, dominance, impulsivity and change. The PRF reflects 20 of this type of needs in 20 personality scales. Each scale has about 20 true-false items. There are multiple versions available with more or fewer items. The striking thing is that the different scales have little overlap. Nevertheless, the reliability and validity of this test is high.

State-Trait Anxiety Inventory (STAI)

The STAI is a self-report for measuring anxiety. The purpose of the test is to distinguish between a temporary state of anxiety (state-anxiety) and a long-term, stable state of anxiety (trait-anxiety). The state-scale contains 20 items about how the participant feels now, at the moment. The characteristic scale contains 20 items about how the participant generally feels. The test-retest reliability is logically low for the state scale and high for the trait scale. Furthermore, other reliability, such as internal consistency, is also high. In addition, the content, convergent, discriminant and construct validity is also high. Therefore the STAI is a widely used tool in the clinical and research sector.

Factor-analytically derived inventories

Eysenck Personality Questionnaire (EPQ)

The EPQ was developed to measure the large dimensions of normal and abnormal personality dimensions. From the analysis three dimensions followed: Psychoticism (P), Extraversion (E), and Neuroticism (N). In addition, a Lie scale (L) was added for validity. A high score on the psychotic scale indicates aggressive and hostile characteristics, impulsiveness, a preference for unusual things and empathic limitations. Antisocial and schizoid patients often have a high score on this scale. A high score on the extraversion scale indicates loud, gregarious, outgoing, fun-loving core traits. A low score points to more introverted characteristics. The scale of neuroticism stands for the emotional dimensions ranging from nervous, poorly adjusted and over-emotional (high score) to stable and self-assured (low score).

There is also a Junior EPQ available for children aged 7 to 15 years. The reliability and validity of the EPQ are high. The EPQ is a very good tool for self-reporting.

Comrey Personality Scales

The CPS is a short self-report and is especially useful for students and other adults. The reliability is high. There is more disagreement about the validity. For example, cross-cultural validity is excellent and it is also an acceptable predictor for clinical purposes. But it lacks correlation with the biographical data.

The test consists of 8 scales with 20 items and 20 items for validity (first 2 scales):

Validity check: test for contradictory answers.
Response bias: test for the tendency to look good.
Trust vs. defensiveness: high score indicates an honest and trustworthy person.
Order vs. lack of compulsion: careful, orderly and organized.
Social conformity vs. rebelliousness: keeping rules, accepting society.
Activity vs. lack of energy: a lot of energy to work and perform hard.
Extraversion vs. introversion: easy going, looking for new friends.
Emotional stability vs. neuroticism: optimistic, confident and relaxed.
Mental toughness vs. sensitivity: are tough and show little emotions.
Empathy vs. egocentrism: helpful, generous and sympathetic.

Criterion-keyed inventories

With the criterion-keyed approach, test items are used only when they can distinguish a criterion group from a control group.

Minnesota Multiphasic Personality Inventory-2 (MMPI-2)

The MMPI is a self-report that is useful in psychiatric research and normal personality research. In the first version, the control norm group was not well representative. In addition, the language use of some items was not objective and too directive. As last, the MMPI was not comprehensive enough.

That is why the MMPI-2 improved in many areas. The test consists of 567 true-false items that contain pathological themes. The test is standardized on standard groups of different mental patients. The MMPI-2 is scored on 4 validity scales, 10 standard clinical scales and many additional scales.

The first validity scale is the Cannot Say score. The score is the number of items that the participant has not checked, or double-checked. The second validity scale is the L scale. These are items that indicate an attitude that is almost never seen in our culture. They are items that everyone would fill in the same way (I never get angry, I love everyone). High scores on the F scale mean seriously unadapted behavior. It seems to show psychopathology, but even patients rarely score high. The K scale is used to discover subtle forms of defensibility. Combinations of F and K scale can be used to discover pretended diseases or fake profiles.

The MMPI-2 is always scored on its ten clinical scales, with possible interpretations.

Hs. Hypochondriasis, bias about physical conditions.
D. Depression, sad feelings and hopelessness.
Hy. Hysteria, immaturity, use of repression or denial
Pd. Psychopathic deviate, authority conflicts and impulsivity.
Mf. Masculity-feminity, male / female interests.
Pa. Paranoia, suspicion and confidence.
Pt. Psychasthenia, fear and obsessive thoughts.
Sc. Schizophrenia, Alienation, unusual thoughts.
Ma. Hypomania, a lot of energy and possibly nervous tensions (agitation).
Si. Social introversion, shyness and introversion

The MMPI-2 can be interpreted and scaled per scale by means of T-scores. In addition, there is a configurational method whereby code types are made of the scales. Two or more scales are above a certain criterion ('elevation') and two or more scales differ significantly from the others ('definition'). These scoring methods can also be done via computer programs.

The MMPI-2 has a nice reliability. Internal consistency coefficients are above .70 and test-retest tests above .50 and .90. A disadvantage, however, is that the inter-correlations of the scales are very high. The validity of the MMPI-2 is also good. The MMPI-2 will therefore remain a leading test instrument for many years.

There is also a computerized way of interpreting and the Minnesota Report is the best. This forms a 16-page long report on the validity of the profile, symptom patterns, interpersonal relationships, diagnostic considerations and considerations for treatment. It also contains multiple tables and figures to illustrate these results. It is basically a good program, but one has to take into account that it is made by people and that there can be errors. There can therefore be erroneous interpretations.

Millon Clinical Multiaxial Inventory III (MCMI-III)

Like the MMPI-2, the MCMI-III is useful for psychiatric research, but the MCMI-III is shorter (175 items) and can also be used in combination with the DSM-IV. The test consists of 5 groups of scales:

Clinical personality patterns: 1. Schizoid, 2a. Avoidant, 2b. Depressive, 3. Dependent, 4. Histrionic, 5. Narcissistic, 6a. Antisocial, 6b. Aggressive / sadistic, 7. Compulsive, 8a. Passive-aggressive / negativistic, 8b. Self-defeating.
Severe personality pathology: S. Schizotypal, C. Borderline, P. Paranoid.
Clinical syndromes: A. Anxiety, H. Somatoform, N. Bipolar: manic, D. Dysthymia, B. Alcohol dependent, R. Post-traumatic stress disorder.
Serious syndromes: SS. Thought disorder, CC. Major depression, PP. Delusional disorder.
Validity indexes: X. Disclosure, Y. Desirability, Z. Debasement.

The development of the scales was done on existing patient groups, so that the scales could distinguish different patients. The intercorrelations were, however, somewhat too high. It was also controversial that he only used patients as a standard group.

Personality Inventory for Children (PIC-2)

The PIC-2 is designed for children from 5 to 19 years old. The test consists of a part that is completed by the child (PIY) and a part that is filled in by the teacher (SBS). The PIC-2 has three validity scales: inconsistency, dissimulation (fake behavior) and resilience. In addition, the test has 9 adjustment scales with 2 or 3 subscales each:

Cognitive impairment: inadequate abilities, poor achievement, developmental delay.
Impulsivity and distractibility: disruptive behavior and fearlessness.
Delinquency: antisocial behavior, no control, disobedient.
Family dysfunction: quarrel with family members, unacceptable behavior towards parents.
Reality distortion: developmental deviation, hallucinations and delusions.
Somatic concern: psychosomatic bias, muscle tension and anxiety.
Psychological discomfort: fear and worry, depression, insomnia.
Social withdrawal: social introversion, isolation.
Social skills deficits: few friends, quarrel with peers.

The PIC-2 is scored via T-scores. The reliability is good with test-retest of .82 or higher and internal consistency of .81 or higher. The validity scores are also high. The PIC-2 is therefore a good instrument that can be used well for behavioral emotional research in children.

Behavioral assessment / research

Behavioral research concentrates on behavior itself and not on underlying characteristics, causes or dimensions of personality. It must be fast, direct and easy and correspond with the treatment. Various instruments are used, such as self-reporting, assessment of parents or (semi) structured interviews. In addition, a new form has become popular in recent years: ecological short-term research. Behavioral research can be part of behavioral therapy, where the goal is to change the duration, frequency or intensity of a certain behavior. The way of behavioral research is therefore often dependent on the goals and procedures of the therapy.

Behavioral therapy

Behavioral therapy can be divided into four categories: exposure methods, cognitive behavioral therapy, self-control procedures and social skills training.

Exposure

This method can be used to treat phobias. Here the patient is systematically exposed to the feared object or situation. This is done through desensitization. The patient learns to remain calm with the object, which happens in small steps and after learning relaxation techniques. Another way is flooding or implosion, where the patient is immediately exposed to the feared object.

The therapist first needs a behavioral assessment to treat a phobia. An example of this is the Behavorial Avoidance Test (BAT), in which the therapist measures how long the patient can tolerate the anxiety stimulus. Scores on this test are strongly related to self-reporting of catastrophic thoughts. That means that there is a cognitive component that plays a part. However, the situational context must always be taken into account.

In a fear survey schedule, participants must indicate the presence and intensity of their fears in relation to different stimuli. However, the validity is not good, so these types of instruments must be used with caution.

Cognitive behavioral therapy

The goal of cognitive behavioral therapies is to change the belief structure of the client. One of these types of therapies is Ellis' rational emotive therapy (RET). Distorted behavior is caused by irrational beliefs, which must be changed by logical arguments and incentives. Second is Meichenbaums self-instruction training. In doing so, the patient is taught to learn coping skills for stressful situations. The third is Beck's cognitive therapy, which is mainly focused on depression. Here the pessimistic cognitive structures of the view of the world, the self-concept and the future are re-structured.

The Beck Depression Inventory (BDI) is a self-report questionnaire that measures many cognitive components of depression. It is a simple and quick to fill in questionnaire, because it consists of only 21 items. The reliability of the test is very good. Only the test-retest results were disappointing in healthy participants, but that is not unexpected, because feelings of depression change a lot in a short time. The validity of the BDI is also very good. The test can therefore be used well in behavioral research and for other clinical settings. The only drawback is that the BDI is too transparent, so that the answers are easy to fake.

Self-Monitoring Procedures

During self-monitoring, the patient is expected to choose his own goals and actively participate in leading, mapping and recording progress towards the final goal of the therapy. The therapist acts as a consultant. It is especially useful in the treatment of depression. Thus Lewinsohn discovered that depression is accompanied by a reduced experience of pleasant events. To counter this, he designed the Pleasant Events Schedule (PES). The first goal of this instrument is to draw up a baseline of pleasant, everyday events. The second goal is to measure progress during therapy. The more frequent the pleasant events, the better the symptoms. The PES is therefore a useful tool for self-control against depression.

Structured interviews

Structured interviews are often based on DSM-IV. This consists of five axes. Axis I includes clinical disorders. Axis II includes personality disorders. Axis III includes general medical conditions. Axis IV includes psychosocial and environmental problems. Axis V includes the 'Assessment of Function', a scale from 1-100 to identify everyday functioning. Problems with the DSM-IV are that taking a diagnosis takes a long time, a fixed method is lacking, and the reliability is changeable.

Semi-structured and structured interviews have been developed for this. The Schedule for Affective Disorders and Schizoprenia (SADS) is a semi-structured diagnostic interview for Axis I disorders. It consists of standard questions and additional questions that can be used to get something clearer. The reliability and validity of the SADS are good.

In addition, the Structured Clinical Interview for DSM-IV (SCID) for the DSM-IV is also available. This is also a semi-structured interview that also contains questions for Axis II of the DSM-IV.

Assessment by systematic direct observation

Observation methods are mainly used in children. The goal is to measure specific behaviors. These have been determined in advance. The observations are done under objective standardized procedures. That is why the time and place are well specified. Finally, the scoring is standardized and is therefore no different for other observers.

One of the ways is simply to count the frequency of the behavior. Another way is to record the duration of the behaviors. A goal of intervention can be to reduce both the frequency and the duration of behaviors. This is also available for specified schedules that save time and effort. An example is the Behavior Observation of Students in Schools (BOSS).

It is important to remember that problems may arise in direct observation. Observer drift is the problem that the observer becomes less vigilant after a while and therefore fails to notice behaviors. Another problem is coding complexity. This happens when too many behaviors have to be observed, or when the behaviors are poorly defined. You also have to take into account the moment you observe. Problems may or may not be present at different times of the day. So you cannot always start from a single observation.

Analogue behavioral assessment

In direct observation, the child is examined in his natural setting, such as a classroom. With analogue behavioral assessment, the children are observed in a standardized environment, but in a way that the child feels at ease. The child performs relevant tasks for the observed behaviors. For example, a child has to do homework in a room that is furnished as a classroom, while the observer observes the child through a mirror window. Analogue behavioral assessment can also be used for parent-child interactions. For adults, the Rapid Couples Interaction Scoring System (RCISS) is used in marriage therapy or other therapies.

Ecological momentary assessment

Patients get a device with them and they have to fill in a very short list at random moments of the day. The answers are immediately forwarded to a central computer. It is a more accurate and reliable way to examine the patient's experience. It can provide insights that are much more difficult to obtain with normal research.

How does the assessment of normality and human strengths work? - Chapter 9

Subject 9A: Assessment within the normal spectrum

When someone wants to examine the personality of a normal person, normal personality tests are more appropriate than tests designed for psychopathology. These tests focus on normality and the strengths of people. They also measure other things such as the locus of control, responsibility, intuition, or attachment style.

Broad band tests of normal personality

A broad band test is a test that measures the full functioning of a person. However, it is difficult to know how personality can best be determined in a single concept.

Myers-Briggs Type Indicator (MBTI)

The MBTI is a self-report selection list based on Carl Jung's personality theories. The test is scored on four polarities: Extraversion-Introversion, Sensing-iNtuiton, Thinking-Feeling, Judging-Perceiving. The personality consists of 4 of the extremes, so 2 ^ 4 = 16 different personalities are possible. One extreme is not better than the other, but they can always be considered opposites of each other.

An extrovert (E) directs his energy outside to people or conversations, whereas an introvert (I) directs his energy into his own world. Someone is 'Sensing' (S) when he trusts his immediate senses, while someone who relies on relationships and opportunities beyond his awareness is using his intuition (N). Basing conclusions on thinking (T), someone who thinks logically and objectively, against Feeling (F), trusting in his personal values and social harmony. Judging (J) includes decisive and conclusive judgments, while Perceiving (P) includes non-binding, flexible and spontaneous judgments. On the basis of the letters someone can, for example, have personality ENFP. In this way, certain professional behaviors can also be attributed to a personality.

The 16 different personalities do not occur to the same extent in the population. Some are more common in certain professions. The test can be used well to estimate whether one is suitable for a specific profession.

The MBTI has good internal reliability, given the split half reliability of .80. The test-retest reliability is somewhat lower if the interval between the tests (a number of weeks versus a number of years) is higher. In the long term, 41% of all 4 letters remained the same and for 38% 3 out of 4 letters remained the same.

The validity for the intuition function is good given the positive correlation with emotional intelligence. The polarities have been compared with the NEO-PI-R that measures personality on the basis of the big five theory. The correlations with this instrument indicated that the validity of the MBTI was confirmed. The disadvantage of the MBTI is that it is a very expensive test.

California Psychological Inventory (CPI)

The CPI is a true-false questionnaire and has a version with 434 questions and one with 260 questions. The questions reflect 20 personality scales and 7 work-related scales on 3 broad dimensions. Three of the 20 personality scales (Good Impression, Communality, Well-being) also give a good impression of the behavior of the person during tests.

The reliability of the CPI is acceptable, with coefficients averaging .76 and test-retest coefficients of .68. The three dimensions of the CPI that follow from the factor analysis are:

Orientation towards people or towards someone’s inner life. Similar to the extraversion-introversion polarity.
Rule-favoring or rule-questioning. Reflects a conventional-unconventional dimension also found in many studies
A 7 point scale called Level of Satisfaction. It is a moderator (positive / negative expression) between the four different lifestyles that follow from the first two vectors:

Performers (extrovert & traditional); good as managers or leaders.
Supporters (introvert & traditional); good in supporting positions.
Innovators (extrovert & innovative); adept for creative changes.
Visualizers (introvert & innovative); only working in art or literature.

The CPI is a good predictor for psychological and physical health, school performance, teacher and police staff effectiveness, and leadership success. In addition, the test can also identify delinquent and criminal adolescents.

NEO Personality Inventory-Revised (NEO PI-R)

The NEO PI-R is based on the five-factor model of personality. The items also reflect the five domains: emotional stability, extraversion, openness, conscientiousness and agreeableness. These five domains each have six facet scales that reflect the characteristics of the domain. In this way the NEO PI-R consists of 30 facets.

The test is good to use for research and for measuring psychopathology. Certain differences on the scales can indicate a disorder. For example, persons with attention disorders often score high on neuroticism and lower on conscientiousness.

The NEO PI-R can also be done on various internet sites. The concept of collaboration describes the conducting of tests via the internet, with test specialists. The geographical location of the test taker and the participant is then no longer of importance. This is possible with this test. The tests on the internet are parallel tests of the original test, but still give a good estimate of the personality. The NEO PI-3 has been developed for children and adolescents. This contains questions that are easier to understand for children.

Stability and change in personality

A question that arises in this subject is whether personality remains stable throughout life or whether it shows certain qualitative changes. Longitudinal studies are often used for this. A problem here is selective attrition. Less healthy participants are faster than healthy, which may lead to an overly optimistic picture of reality. In addition, this research is difficult and expensive.

Another way is cross-sectional research, in which different age groups are compared. This type of research is sensitive to cohort effects. That is the assumption that differences between age groups are the result of differences in nature, development or historical experiences of that time, and are therefore not the result of aging. For example, people who grew up during the Great Depression might show a higher level of neuroticism than people who did not.

Therefore, these two types of research are combined: cross-sequential research. In addition, qualitative case studies must also be examined. For example, someone who used to have an insecure attachment may later develop a healthy personality. This is then, as it were, against the expectations of the theories.

Several researchers try to identify normative changes. These are general developmental patterns that occur in most people. However, there are many contradictory studies. Therefore, the different studies can best be investigated with a meta-analysis. Results can be combined to a common metric, called effect size: The difference in means divided by the pooled group standard deviation. This allows a general measure to be used for studies with different measuring scales.

In this way it was found that the biggest changes mainly take place in the young adulthood, when social role expectations become increasingly important. In another study it was found that the domains agreeableness and conscientiousness in puberty are at the lowest point of the development. In addition, extraversion around the 10^th year of life is the highest and after falling until the 15^th year remains approximately the same over the lifetime. Women score higher on neuroticism than men. Lastly, women at all ages score higher on agreeableness, conscientiousness and extraversion than men.

Assessment of moral judgments

According to Kohlberg there are three levels of moral development: pre-conventional> conventional> post-conventional. Each level has two sublevels. The level of moral reasoning was determined with the help of the Moral Judgment Scale. This consists of a number of moral dilemmas in which the reasoning of the answer is scored.

After much criticism of the reliability and validity of the MJS, Kohlberg and his colleagues developed a new scoring system. This has been validated with a longitudinal study. The Moral Judgment Scale proved to be reliable, internally consistent and valid, because moral development was in line with theory.

The Defining Issues Test (DIT) largely corresponds to the Moral Judgment Scale, but has a simpler and more objective scoring system. Answers to similar moral dilemmas are scored. After that, certain factors that can play a role in the dilemma are given. The participants must indicate how important they consider them. In general it is a good alternative to the Moral Judgment Scale, but marginal comments are as follows: First, some dilemmas are either dated or well known. Secondly, the DIT is biased towards conservatively religious individuals. Because of their beliefs they almost always reason at a conventional level (sub-level 3 or 4). Lastly, a relationship has never been demonstrated between moral development level on the DIT and moral behavior. This shows that the DIT is not a good tool for predicting behavior, but for evaluating moral development.

Assessment of spiritual and religious concepts

Subjects such as spirituality and religion are not studied much in psychology, but are still important to continue to research, given the importance that most people have for religion. Spirituality differs from religion in that it does not have to be experienced with certain institutions, is more personal and is less mediated by a group. Reasons for doing an assessment of spirituality or religion are: understanding the client's worldview, receiving different insights for a problem, investigating whether the spiritual-religious insights the person has can be used with coping, examining which spiritual-religious interventions are useful for therapy, and to recognize spiritual doubts that can be included in the therapy.

One of the first measuring scales was the Allport-Ross Religious Orientation Scale. This investigated the intrinsic and extrinsic expression of religion. Intrinsic expression means that you really live your religion (e.g., to find meaning), extrinsic expression means that you use your religion (e.g., to seek security or social status). This scale was later revised to the Religious Orientation Scale, consisting of 11 extrinsic items and 9 intrinsic items on a 9-point scale.

The Religious Orientation scale also had problems; it was unclear what exactly the intrinsic and extrinsic scales measured and what the relationship between the two scales was. That is why the Religion as Quest scale was developed , where complexity, doubt and tentativeness were seen as ways to be religious. This instrument proved to be more reliable, but there are still doubts about what the scale actually measures. The test is intended to measure the following: willingness to be confronted with existential questions without reducing their complexity, self-criticism and perception of religious doubts such as positivity, and openness to change. There was much criticism of this, that it is rather assessing agnosticism, anti-orthodoxy, religious doubts, or religious conflict. In addition, the test is too concise and factorially too simple.

The Spiritual Well Being Scale measures the spiritual well-being on two dimensions. The Religious Welfare Scale measures well-being in relation to God / higher power and the Existential Welfare Scale measures well-being in relation to life's utility and satisfaction. The scores on the SWB should match the independent measurements of well-being. This has been researched a lot and in general positive results are achieved here. The idea is that the well-being consists of the integral experience of the person who functions as God intended and has a right-minded relationship with him, with others and with himself. Criticism on this scale is that it has too low a ceiling for religious people. It cannot be used to distinguish between people with high levels of spiritual well-being.

The Assessment of Spirituality and Religious Sentiments (ASPIRES) Scale measures spiritual and religious variables on two dimensions: spiritual supernaturality and religious feelings. The spiritual dimension consists of three facets: the ability to pray, belief in a larger reality and connectedness with a larger humanity. The religious dimension has two facets: religious involvement and religious crisis. Validity studies support the reliability and validity of the ASPIRES Scale. In addition, it predicts certain social behaviors and personality traits.

The Faith Maturity Scale (FMS) firstly developed a baseline of data about the vitality of faith in protestant municipalities. In addition, to identify demographic, personal and municipal variables that contribute to the development of faith. Finally, to evaluate the impact of religious education. The FMS consists of 38 items spread over 8 themes, such as trust, experiences and spiritual growth. The items used in the test are mainly used for Protestant religions. Nevertheless, validity and reliability are high. Firstly, different groups (pastors, teachers, young people) were indeed expected to have different scores. Secondly, the mutual correlation between the pastors was also high. Third, the FMS also proved useful, given the high correlations with pro social behaviors.

Access:

Public

Join WorldSupporter!

Join with a free account for more service, or become a member for full access to exclusives and extra support of WorldSupporter >>

Going abroad?

Insure your way around the world

International expat insurances

Travel & Worldsupporter insurances (NL)

Study with summaries

Associate with your Field of Study

Search Summaries or Notes

Start using Summaries

Add a Summary

Search a summary

Contributions: posts

Help other WorldSupporters with additions, improvements and tips

Add new contribution

Spotlight: topics

Check the related and most recent topics and summaries:

WorldSupporter and development goals:

Goal 4: Quality Education

Check how to use summaries on WorldSupporter.org

Online access to all summaries, study notes en practice exams
How and why use WorldSupporter.org for your summaries and study assistance?
Using and finding summaries, notes and practice exams on JoHo WorldSupporter
Quicklinks to fields of study for summaries and study assistance

Online access to all summaries, study notes en practice exams

Check out: Register with JoHo WorldSupporter: starting page (EN)
Check out: Aanmelden bij JoHo WorldSupporter - startpagina (NL)

How and why use WorldSupporter.org for your summaries and study assistance?

For free use of many of the summaries and study aids provided or collected by your fellow students.
For free use of many of the lecture and study group notes, exam questions and practice questions.
For use of all exclusive summaries and study assistance for those who are member with JoHo WorldSupporter with online access
For compiling your own materials and contributions with relevant study help
For sharing and finding relevant and interesting summaries, documents, notes, blogs, tips, videos, discussions, activities, recipes, side jobs and more.

Using and finding summaries, notes and practice exams on JoHo WorldSupporter

There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.

Use the summaries home pages for your study or field of study
Use the check and search pages for summaries and study aids by field of study, subject or faculty
Use and follow your (study) organization
- by using your own student organization as a starting point, and continuing to follow it, easily discover which study materials are relevant to you
- this option is only available through partner organizations
Check or follow authors or other WorldSupporters
Use the menu above each page to go to the main theme pages for summaries
- Theme pages can be found for international studies as well as Dutch studies

Do you want to share your summaries with JoHo WorldSupporter and its visitors?

Check out: Why and how to add a WorldSupporter contributions
JoHo members: JoHo WorldSupporter members can share content directly and have access to all content: Join JoHo and become a JoHo member
Non-members: When you are not a member you do not have full access, but if you want to share your own content with others you can fill out the contact form

Quicklinks to fields of study for summaries and study assistance

Main summaries home pages:

Main study fields:

Business organization and economics, Communication & Marketing, Education & Pedagogic Sciences, International Relations and Politics, IT and Technology, Law & Administration, Medicine & Health Care, Nature & Environmental Sciences, Psychology and behavioral sciences, Science and academic Research, Society & Culture, Tourisme & Sports

Main study fields NL:

Studies: Bedrijfskunde en economie, communicatie en marketing, geneeskunde en gezondheidszorg, internationale studies en betrekkingen, IT, Logistiek en technologie, maatschappij, cultuur en sociale studies, pedagogiek en onderwijskunde, rechten en bestuurskunde, statistiek, onderzoeksmethoden en SPSS
Studie instellingen: Maatschappij: ISW in Utrecht - Pedagogiek: Groningen, Leiden , Utrecht - Psychologie: Amsterdam, Leiden, Nijmegen, Twente, Utrecht - Recht: Arresten en jurisprudentie, Groningen, Leiden

WorldSupporter: what are the features, functionalities and rules on WorldSupporter.org?

WorldSupporter NL: hoe vind je samenvattingen en studiehulp op WorldSupporter.org en JoHo.org

Summaries and Study Assistance - Start

Follow the author: maxuxo

maxuxo

Work for WorldSupporter

JoHo can really use your help! Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world

Working for JoHo as a student in Leyden

Parttime werken voor JoHo

Statistics

916