首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
This paper proposes two unidimensional item response theory (IRT) models for analysing normative forced‐choice personality items. Both models are derived from a common theoretical framework and arise as a result of different assumptions regarding the mechanism of choice. The simplest mechanism gives rise to the one‐parameter normal‐ogive model. The second mechanism gives rise to a new IRT model, which is closely related to the Coombs–Zinnes probabilistic unfolding model. The second model is compared theoretically to the normal‐ogive model in terms of item characteristic curves and amount of item information. Next, procedures for estimating the respondent and the item parameters in the second model are described. Finally, both models are empirically compared by using two well‐known personality measures.  相似文献   

2.
Computerized adaptive testing under nonparametric IRT models   总被引:1,自引:0,他引:1  
Nonparametric item response models have been developed as alternatives to the relatively inflexible parametric item response models. An open question is whether it is possible and practical to administer computerized adaptive testing with nonparametric models. This paper explores the possibility of computerized adaptive testing when using nonparametric item response models. A central issue is that the derivatives of item characteristic Curves may not be estimated well, which eliminates the availability of the standard maximum Fisher information criterion. As alternatives, procedures based on Shannon entropy and Kullback–Leibler information are proposed. For a long test, these procedures, which do not require the derivatives of the item characteristic eurves, become equivalent to the maximum Fisher information criterion. A simulation study is conducted to study the behavior of these two procedures, compared with random item selection. The study shows that the procedures based on Shannon entropy and Kullback–Leibler information perform similarly in terms of root mean square error, and perform much better than random item selection. The study also shows that item exposure rates need to be addressed for these methods to be practical. The authors would like to thank Hua Chang for his help in conducting this research.  相似文献   

3.
A topic of continuing interest in the measurement area is response acquiescence. A recent study has demonstrated the feasibiliy of studying acquiescence or, more importantly, content/acquiescence correlation in the MMPI. Utilizing the components of variance approach, this study found that the variance due to acquiescence in scores on the Pt and Hg scales was small relative to content variance, but that the correlation between acquiescence and content may be substantial for the Pt scale. The present paper describes a general statistical procedure for investigating content variance, variance due to non-content characteristics of items, and the covariances of content and various item characteristics. The data from a previous paper are reanalyzed, using alternative covariance structure models. Maximum likelihood procedures which allow for a statistical test for parameters of interest are used. The results point to the significance of the content- acquiescence correlation in the Pt scale, but not in the Hy scale. The previous findings are verified statistically, and procedures which hold promise for other investigation into the properties of behavioral tests are described.  相似文献   

4.
A cubic spline method for smoothing equipercentile equating relationships under the common item nonequivalent populations design is described. Statistical techniques based on bootstrap estimation are presented that are designed to aid in choosing an equating method/degree of smoothing. These include: (a) asymptotic significance tests that compare no equating and linear equating to equipercentile equating; (b) a scheme for estimating total equating error and for dividing total estimated error into systematic and random components. The smoothing technique and statistical procedures are explored and illustrated using data from forms of a professional certification test.  相似文献   

5.
Psychometric characteristics of the Adaptive Behavior Inventory for Children (ABIC) are analyzed through five statistical procedures (internal consistency, item difficulty, correlations of item-total correlations, concurrent validity, and construct validity) using data on 436 elementary age children from three racial-ethnic and two social class groups. Data from these five statistical procedures are reported for nine demographic characteristics: children's race, social class, sex, age, birth order, health, family size, family structure, and urban acculturation. Few systematic differences are apparent on internal consistency, correlation of item-total correlations, and construct validity. Some differences are apparent on item difficulty and concurrent validity. On item difficulty the ABIC scores are higher for middle-SES, older, first- or second-born children, and from families whose structures are more typical. Regarding concurrent validity, lower correlations are noted for Mexican American and Black, for more healthy, and for less acculturated children. ABIC-achievement correlations generally are too low to be of practical value. The results are interpreted in terms of possible test bias on the ABIC.  相似文献   

6.
The major purpose of this paper is to describe the components of testing systems that address the primary goal of producing tests tailored to each individual’s needs and abilities. The principal components of a tailored testing procedure are: item calibration, item selection, and ability estimation. Each of these components is defined and discussed. Alternative methods for each component are presented, along with an indication of their relative complexities. Finally, the particular methods used for each component in the tailored testing procedure at the University of Missouri-Columbia are described.  相似文献   

7.
To date, exposure control procedures that are designed to control item exposure and test overlap simultaneously are based on the assumption of item sharing between pairs of examinees. However, examinees may obtain test information from more than one examinee in practice. This larger scope of information sharing needs to be taken into account in refining exposure control procedures. To control item exposure and test overlap among a group of examinees larger than two, the relationship between the two indices needs to be identified first. The purpose of this paper is to analytically derive the relationships between item exposure rate and each of the two forms of test overlap, item sharing and item pooling, for fixed‐length computerized adaptive tests. Item sharing is defined as the number of common items shared by all examinees in a group, while item pooling is the number of overlapping items that an examinee has with a group of examinees. The accuracy of the derived relationships was verified using numerical examples. The relationships derived will lay the foundation for future development of procedures to simultaneously control item exposure and item sharing or item pooling among a group of examinees larger than two.  相似文献   

8.
9.
A Bayesian procedure is developed for the estimation of parameters in the two-parameter logistic item response model. Joint modal estimates of the parameters are obtained and procedures for the specification of prior information are described. Through simulation studies it is shown that Bayesian estimates of the parameters are superior to maximum likelihood estimates in the sense that they are (a) more meaningful since they do not drift out of range, and (b) more accurate in that they result in smaller mean squared differences between estimates and true values.The research reported here was performed pursuant to Grant No. N0014-79-C-0039 with the Office of Naval Research.  相似文献   

10.
A nonparametric item response theory model—the Mokken scale analysis (a stochastic elaboration of the deterministic Guttman scale)—and a computer program that performs this analysis are described. Three procedures of scaling are distinguished: a search procedure, an evaluation of the whole set of items, and an extension of an existing scale. All procedures provide a coefficient of scalability for all items that meet the criteria of the Mokken model and an item coefficient of scalability for every item. Four different types of reliability coefficient are computed both for the entire set of items and for the scalable items. A test of robustness of the found scale can be performed to analyze whether the scale is invariant across different subgroups or samples. This robustness test serves as a goodness of fit test for the established scale. The program is written in FORTRAN 77. Two versions are available, an SPSS-X procedure program (which can be used with the SPSS-X mainframe package) and a stand-alone program suitable for both mainframe and microcomputers.  相似文献   

11.
The frequently observed superior memory for the first items on a list is referred to as primacy. The aetiology of this effect in terms of cognitive processes and their neural substrate is subject to an ongoing debate. However, the brain areas generally involved in successful encoding are well described by subsequent memory studies in which activity during encoding is correlated with memory performance. We employed an object-location association paradigm to differentiate the neural correlate of the primacy from the subsequent memory effect. Activity in the intraparietal sulcus predicted memory performance across all encoding positions. Increased activity in the inferior parietal lobe and angular gyrus resulted exclusively in a more efficient encoding of the first item presented. These areas are part of the ventral frontoparietal network involved in stimulus driven attention. Our results implicate the relatively elevated attention to the first item probably due to its contextual distinctiveness, as a major contributor to the primacy effect.  相似文献   

12.
While there is an ample amount of consumer behavior research that recruits processing fluency as an explanatory construct, the question how to best measure the fluency experience has received little attention. Therefore, there is a lack of consistency in measuring the construct, particularly with regard to the use of single‐item versus multi‐item measures. The current research, thus, aims to investigate how processing fluency can be consistently measured across different experimental fluency manipulations and which type of measure has the highest validity. Based on classic scale development procedures, we propose a reliable and valid multi‐item measure and compare this measure against a single‐item measure in terms of predictive validity. We show that both measures mediate the effect of five established fluency manipulations and that the single‐item measure is sufficient. In addition to providing a measure for future research that can be adapted to different empirical settings, we provide empirical evidence on the replicability of fluency effects and on the theoretical conjecture that people have a uniform fluency experience across different manipulations of fluency.  相似文献   

13.
A new instrument, the Chinese Adolescent Self‐Esteem Scales (CASES), was developed to measure the self‐concepts of the young people in Hong Kong in seven aspects: social, academic, appearance, moral, family, physical/sport, and general self‐esteem. LISREL procedures were utilized to test the extent of factorial invariance for age and gender based on the responses to CASES of 551 Hong Kong adolescents. It was found that CASES possesses the necessary invariance properties for between‐group measurement in terms of the number and pattern of the underlying factors, item factor loadings, and inter‐factor relations, but not in terms of item uniqueness. Implications of these findings are discussed in terms of the use of CASES and of empirical support for the equivalence of self‐concept factor structure for age and gender groups for both Western and non‐Western adolescents.  相似文献   

14.
15.
The original data of McGurk's classic study of black-white differences on cultural and noncultural tests is re-analyzed at the item level to investigate the role of possible item biases that would cause the noncultural items to be relatively more difficult than the cultural items for blacks than for whites. The evidence indicates that McGurk's results cannot be explained in terms of item biases, but appear to be the result of the noncultural items requiring more sheer reasoning ability than the cultural items, which depend more on acquired information.  相似文献   

16.
This article (a) describes how McDonald's nonlinear factor analytic approach to the normal ogive curve can be used to factor analyse total test scores, (b) discusses the conditions in which this model is more appropriate than the widely used linear model, and (c) illustrates the applicability of both models using an empirical example. The rationale for the described procedure is that the test scores are simple sums of binary item responses whose item characteristic curves are adequately represented by normal ogives. The results obtained in the empirical example are meaningful and informative, and agree with the results obtained at the item level.  相似文献   

17.
Two experiments are reported in which postevent source of misinformation was manipulated within weaponpresent and weapon-absent scenarios. Participants viewed slides depicting either a weapon or a newspaper event and then received either incomplete questioning or a narrative. Both postevent sources contained misleading information about a central and peripheral detail concerning either the weapon or the newspaper scenario. With a modified test in Experiment 1, questioning was found to increase misinformation effects concerning the central item, as compared with a narrative, and more misinformation effects were found for the weapon-peripheral than for the newspaper-peripheral item. In Experiment 2, the participants were more likely to claim to have seen contradictory and additive misinformation about the central item in the slides following questioning, and more contradictory and additive misinformation effects occurred for the weapon-peripheral than for the newspaper-peripheral item. The findings are considered in terms of the effects of both postevent and encoding factors on memory.  相似文献   

18.
We have developed a set of naming and recognition tests for evaluating the retrieval of lexical and conceptual knowledge for actions. As a first step, normative information about 280 items was collected for the following variables: (1) the naming responses elicited by each item, (2) the degree to which the image of each item agreed with a target name, (3) the familiarity of each depicted action, and (4) the visual complexity of each item. This information was used to develop administration and scoring procedures for a standardized test of action naming. The effectiveness and reliability of these procedures were evaluated in a second experiment. In a third experiment, five tests were developed to probe the retrieval of conceptual knowledge: (1) independently of the production of a naming response, (2) in response to pictorial and nonpictorial stimuli, (3) in terms of the attributes associated with specific actions, and (4) in terms of similarities and differences between various actions.  相似文献   

19.
20.
Remembering that an item occurred in several different lists is formulated here in terms of retrieval of corresponding list tags associated to the item. Therefore, associative interference should operate upon remembering the several list contexts in which an item appeared. Experimental Ss studied four (or five) overlapping lists of 16 words, sampled from a master set of 32 words, with a given word exemplifying one of the 2 4 (or 2s 5 ) possible sequences of appearances and nonappearances over the four (or five) lists. Later Ss rated from memory for each word and for each list whether that word had occurred in that list. After correcting for interlist generalization effects, indices of discriminative memory revealed strong proactive interference and weaker retroactive interference. Discriminative memory that an item occurred in a given list was poorer the more prior or more subsequent lists in which that item had also occurred. Thus, list differentiation appears explicable in terms of item-specific associative interference.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号