期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Maximum validity of a test with equivalent items 总被引：1，自引：0，他引：1

Ledyard R Tucker 《Psychometrika》1946,11(1):1-13

It is assumed that a scale of true scores on a function exists and that the probability of answering an item correctly is a curve of the type of the integral of the normal curve. The product moment correlation between the test score and true score is derived for a normal distribution of subjects and a test composed of equivalent items. Numerical examples demonstrate that the maximum correlation between test scores and true scores occurs for a one hundred item test when the point correlation between items is less than three tenths. 相似文献

2.

The relation of the reliability of multiple-choice tests to the distribution of item difficulties

Frederic M. Lord 《Psychometrika》1952,17(2):181-194

Under certain assumptions an expression, in terms of item difficulties and intercorrelations, is derived for the curvilinear correlation of test score on the ability underlying the test, this ability being defined as the common factor of the item tetrachoric intercorrelations corrected for guessing. It is shown that this curvilinear correlation is equal to the square root of the test reliability. Numerical values for these curvilinear correlations are presented for a number of hypothetical tests, defined in terms of their item parameters. These numerical results indicate that the reliability and the curvilinear correlation will be maximized by (1) minimizing the variability of item difficulty and (2) making the level of item difficulty somewhat easier than the halfway point between a chance percentage of correct answers and 100 per cent correct answers. 相似文献

3.

Strong items get suppressed, weak items do not: The role of item strength in output interference

Karl-heinz Bäuml 《Psychonomic bulletin & review》1998,5(3):459-463

An experiment is reported that examines the role of item strength in output interference. Subjects studied two types of categorized item lists: lists in which each category consisted of strong and moderate items, and lists in which each category consisted of weak and moderate items. Different degrees of item strength were accomplished by varying the items’ taxonomic frequency within a category. The subjects either recalled a category’s strong and weak items before its moderate items, or vice versa. The prior recall of the moderate items impaired the later recall of the strong items, but did not impair the later recall of the weak items. This effect of item strength indicates that output interference is caused by a process of retrieval suppression. It additionally suggests that, in order to minimize output-interference effects in recall, a list’s strong items should be recalled before its weak items. 相似文献

4.

Probing item social desirability by correlating personality items with Balanced Inventory of Desirable Responding (BIDR): A validity examination

Chester Kam 《Personality and individual differences》2013

Researchers often include a social desirability measure in personality measures, commonly the Balanced Inventory of Desirable Responding (BIDR), and correlate it with personality items to probe for social desirability of the items. A strong correlation between BIDR scores and a personality item would indicate high item social desirability. The current research assesses the validity of this practice. Results showed that these correlations have high validity only when BIDR scores are calculated as a continuous variable rather than as dichotomized item scores. In addition, self-deception scores have higher validity for detecting item social desirability than do impression management scores. The current research supported the use of the self-deception scores, in particular, to detect highly desirable or undesirable items. 相似文献

5.

The predictive validity of subtle and obvious empirically derived psychological test items under faking conditions 总被引：1，自引：0，他引：1

D L Worthington R S Schlottmann 《Journal of personality assessment》1986,50(2):171-181

The relative contributions of subtle and obvious item endorsements to the prediction of a relevant criterion were assessed under faking and control ("honest") conditions. The MMPI and a nonconformity questionnaire were first administered to 100 male college students. Items on the Pd scale and 101 additional MMPI items that correlated significantly with the nonconformity questionnaire were then rated by 38 other male college students for apparent relationship to psychopathology. From these ratings, a scale (designated PdX) was constructed, which consisted of 21 subtle and 21 obvious items. After a third group of 98 male college students completed the nonconformity questionnaire, they were asked to respond to the items of the Pd and PdX subscales under control, fake-good, and fake-bad instructions. Significant correlations between the nonconformity scale and certain PdX and Pd subscales were found only for the control group. Implications for test construction and for clinical interpretation under faking conditions are discussed. 相似文献

6.

Determination of the optimum number of items to retain in a test measuring a single ability

BEDELL BJ 《Psychometrika》1950,15(4):419-430

相似文献

7.

Learning and discovery in the acquisition of structured material: effects of number of items and their sequence

D J Foss 《Journal of experimental psychology. General》1968,77(2):341-344

相似文献

8.

Social desirability set,interpersonal differences in item desirability,and validity of neuroticism questionnaires

《Acta psychologica》1964

相似文献

9.

Measuring moral reasoning using moral dilemmas: evaluating reliability,validity, and differential item functioning of the behavioural defining issues test (bDIT)

《European Journal of Developmental Psychology》2013,10(5):622-631

ABSTRACT

We evaluated the reliability, validity, and differential item functioning (DIF) of a shorter version of the Defining Issues Test-1 (DIT-1), the behavioural DIT (bDIT), measuring the development of moral reasoning. About 353 college students (81 males, 271 females, 1 not reported; age M = 18.64 years, SD = 1.20 years) who were taking introductory psychology classes at a public University in a suburb area in the Southern United States participated in the present study. First, we examined the reliability of the bDIT using Cronbach’s α and its concurrent validity with the original DIT-1 using disattenuated correlation. Second, we compared the test duration between the two measures. Third, we tested the DIF of each question between males and females. Findings reported that first, the bDIT showed acceptable reliability and good concurrent validity. Second, the test duration could be significantly shortened by employing the bDIT. Third, DIF results indicated that the bDIT items did not favour any gender. Practical implications of the present study based on the reported findings are discussed. 相似文献

10.

Motion-induced overestimation of the number of items in a display

Afraz SR Kiani R Vaziri-Pashkam M Esteky H 《Perception》2004,33(8):915-925

Subjects were asked to report the number of items in a display as the items moved along a circular path around the fixation point. As the rotation speed increased, the apparent number of items also increased. This motion-induced overestimation (MIO) effect was investigated in three experiments. In the first experiment, the effect of rotation speed and set size was explored with an enumeration task. The overestimation error increased with an increase in speed or number of items in the display. In the second experiment, we used an adjustment paradigm to measure the speed threshold of MIO effect onset. Temporal rate of the display, which was defined as product of rotation speed and the number of rotating items, was the determining factor of MIO onset. In the third experiment, moving items were marked with different colours. Surprisingly, the number of perceived items was still overestimated even though the number of perceived colours was not. 相似文献

11.

Transformational and transactional leadership: a meta-analytic test of their relative validity 总被引：19，自引：0，他引：19

Judge TA Piccolo RF 《The Journal of applied psychology》2004,89(5):755-768

This study provided a comprehensive examination of the full range of transformational, transactional, and laissez-faire leadership. Results (based on 626 correlations from 87 sources) revealed an overall validity of .44 for transformational leadership, and this validity generalized over longitudinal and multisource designs. Contingent reward (.39) and laissez-faire (-.37) leadership had the next highest overall relations; management by exception (active and passive) was inconsistently related to the criteria. Surprisingly, there were several criteria for which contingent reward leadership had stronger relations than did transformational leadership. Furthermore, transformational leadership was strongly correlated with contingent reward (.80) and laissez-faire (-.65) leadership. Transformational and contingent reward leadership generally predicted criteria controlling for the other leadership dimensions, although transformational leadership failed to predict leader job performance. 相似文献

12.

Using mixture distribution models to test the construct validity of the Physical Self-Description Questionnaire

Maike Tietjens Philipp Alexander Freund Dirk Büsch Bernd Strauss 《Psychology of sport and exercise》2012,13(5):598-605

相似文献

13.

Reliability and validity of the n-Achievment test

KRUMBOLTZ JD FARQUHAR WW 《Journal of consulting psychology》1957,21(3):226-228

相似文献

14.

A comparative analysis of the empirical validity of past- and present-oriented biographical items

Lawrence S. Kleiman Robert H. Faley 《Journal of business and psychology》1990,4(4):431-437

This study sought to determine whether changing the time orientation or biodata items from past to present would result in a reduction of the items' validity. It was predicated on the notion that the traditionally employed measures of past performance were potentially unfair, especially to minority applicants. Administered to 192 members of the Air National Guard, the set of biodata items measuring present behavior was found to have validity coefficients which are at least comparable, if not superior, to the set measuring past behavior. 相似文献

15.

The Bender-Gestalt test in differential diagnosis of adolescents with learning difficulties

John B. Mordock Senior Clinical School Psychologist

Patricia Ann Terrill

Ellen Novik 《Journal of School Psychology》1969,7(4):11-14

This study assessed the efficiency of the Haine and Koppitz scoring systems used with the Bender-Gestalt Test (B-G) in terms of their ability to differentiate between adolescents with and without central nervous system (CNS) impairment who were achieving below age-expectations. Utilizing a population of 84 adolescents enrolled in a residential treatment center, both the Haine and Koppitz systems with the Bender-Gestalt differentiated 25 Ss with CNS impairment from 59 Ss wihout such impairment. The results indicated, however, that neither scoring system was useful in individual classification when the B-G was used alone or in combination with intelligence test results. 相似文献

16.

Difficulty and validity of analogies items in relation to major field of study

DOPPELT JE 《The Journal of applied psychology》1951,35(1):30-33

相似文献

17.

Optimizing the validity of personality assessments: The importance of aggregation and item content

Sampo V. Paunonen 《Journal of research in personality》1984,18(4):411-431

The predictive validity of a psychological measure can be improved by minimizing measurement errors through increases in the length of the assessment (aggregation) and, for an assessment of finite length, by making use of objective strategies for choosing from all available component measures. Two prominent considerations in selecting individual measures to be aggregated involve standards of (a) item content (construct approach) and (b) item/criterion association (empirical approach). Personality trait scales of different lengths were assembled for this study in order to represent features of the construct and empirical methods of selection. It was observed that (a) although reliability and validity generally increased with test length, aggregation beyond a certain point can fail to be expedient; and (b) although the prediction performance of empirically derived measures initially surpassed that of construct based assessments, the superiority of the empirical scales did not generalize to trait criteria that were not used as a basis for item selection. The data are interpreted as providing support for a theory-based program of test development where substantive considerations involving item content play a major role. The findings are also viewed as encouragement for conventional conceptualizations about organized dimensions of behavior. 相似文献

18.

Salience of norms and order of questionnaire items: their effect on responses to the items

NAKAMURA CY 《Journal of abnormal psychology》1959,59(1):139-142

相似文献

19.

A confirmatory evaluation of the profile of mood states: Convergent and discriminant item validity

John R. Reddon Roger Marceau Ronald R. Holden 《Journal of psychopathology and behavioral assessment》1985,7(3):243-259

The Profile of Mood States was administered to samples of 182 college males, 179 college females, and 257 prison inmates. College males and females did not differ significantly from each other in terms of scale elevation but differed from prison inmates on all scales except Fatigue-Inertia. The college samples differed from the published normative college samples, suggesting the importance of using local norms. A confirmatory item factor analysis suggested convergent item validity with the scoring key and similarity of structure across samples. Discriminant item validity, however, suggested that a smaller number of mood scales would offer a more justifiable interpretation of this inventory.This study was supported by the Alberta Hospital Edmonton, the Solicitor General of Canada, and Social Sciences and Humanities Research Council of Canada Grant 410-80-0576-XI. 相似文献

20.

Comments on Bellack, Hersen, and Turner's paper on the validity of role-play test

James P. Curran 《Behavior Therapy》1978,9(3):462-468

相似文献