期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

蔡艳涂冬波丁树良《心理学探新》2014,34(5):426-430

多维项目反应理论因其模型本身的天然优势及其兼具因素分析与项目反应理论于一身的优点,而被广大研究者及应用者所重视.本研究在前人研究基础上,重点讨论MIRT多维能力及能力间相关矩阵的参数估计问题.研究采用Monte Carlo模拟方法进行,在三因素完全随机设计（4 ×3×3）下,使用MCMC算法,探讨测验维度数、维度间的相关大小和测验项目数三个因素对MIRT能力及其相关矩阵估计的影响. 相似文献

2.

IRT中最小化χ2/EM参数估计方法

朱玮丁树良陈小攀《心理学报》2006,38(3):453-460

对IRT的双参数Logistic模型（2PLM）中未知参数估计问题,给出了一个新的估计方法――最小化χ2/EM估计。新方法在充分考虑项目反应理论(IRT)与经典测量理论(CTT)之间的差异的前提下,从统计计算的角度改进了Berkson的最小化χ2估计,取消了Berkson实施最小化χ2估计时需要已知能力参数的不合实际的前提,扩大了应用范围。实验结果表明新方法能力参数的估计结果与BILOG相比,精确度要高,且当样本容量超过2000时,项目参数的估计结果也优于BILOG。实验还表明新方法稳健性好相似文献

3.

A Person Fit Test For Irt Models For Polytomous Items

C. A. W. Glas Anna Villa T. Dagohoy 《Psychometrika》2007,72(2):159-180

A person fit test based on the Lagrange multiplier test is presented for three item response theory models for polytomous items: the generalized partial credit model, the sequential model, and the graded response model. The test can also be used in the framework of multidimensional ability parameters. It is shown that the Lagrange multiplier statistic can take both the effects of estimation of the item parameters and the estimation of the person parameters into account. The Lagrange multiplier statistic has an asymptotic χ²-distribution. The Type I error rate and power are investigated using simulation studies. Results show that test statistics that ignore the effects of estimation of the persons’ ability parameters have decreased Type I error rates and power. Incorporating a correction to account for the effects of the estimation of the persons’ ability parameters results in acceptable Type I error rates and power characteristics; incorporating a correction for the estimation of the item parameters has very little additional effect. It is investigated to what extent the three models give comparable results, both in the simulation studies and in an example using data from the NEO Personality Inventory-Revised. 相似文献

4.

A Newton procedure for conditionally linear mixed-effects models

Blozis SA 《Behavior research methods》2007,39(4):695-708

This article reviews Newton procedures for the analysis of mean and covariance structures that may be functions of parameters that enter a model nonlinearly. The kind of model considered is a mixed-effects model that is conditionally linear with regard to its parameters. This means parameters entering the model nonlinearly must be fixed, whereas those entering linearly may vary across individuals. This framework encompasses several models, including hierarchical linear models, linear and nonlinear factor analysis models, and nonlinear latent curve models. A full maximum-likelihood estimation procedure is described. Mx, a statistical software package often used to estimate structural equation models, is considered for estimation of these models. An example with Mx syntax is provided. 相似文献

5.

GPCM模型项目参数估计程序的开发与研究

周明元甘登文丁树良《心理学探新》2005,25(2):57-60

该文对GPCM的项目参数估计的方法进行较为深入的探讨,特别对容易混淆的参数初值计算以及项目参数分两步进行估计的原因进行了阐述。并且基于MMLE／EM算法,开发了相应的项目参数估计程序。通过大量的蒙特卡洛模拟,与国外专业软件PARSCALE比较,本程序的步骤参数估计值更好,区分度参数估计值相当,具有较好的应用价值。相似文献

6.

多维项目反应理论补偿性模型参数估计:基于广义回归神经网络集合

下载免费PDF全文

王鹏孟维璇朱干成张登浩张利会董一萱司英栋《心理学探新》2019,(3):244-249

运用广义回归神经网络(GRNN)方法对小样本多维项目反应理论(MIRT)补偿性模型的项目参数进行估计,尝试解决传统参数估计方法样本数量要求较大的问题。MIRT双参数Logistic补偿模型被设置为二级计分的二维模型。首先,模拟二维能力参数、项目参数值与考生作答矩阵。其次,把通过主成分分析得到的前两个因子在每个题目上的载荷作为区分度的初始值以及题目通过率作为难度的初始值,这两个指标的初始值作为神经网络的输入。集成100个神经网络,其输出值的均值作为MIRT的项目参数估计值。最后,设置2×2种(能力相关水平:0.3和0.7; 两种估计方法:GRNN和MCMC方法)实验处理,对GRNN和MCMC估计方法的返真性进行比较。结果表明,小样本的情况下,基于GRNN集成方法的参数估计结果优于MCMC方法。相似文献

7.

Random effects and extended generalized partial credit models

David J. Hessen 《The British journal of mathematical and statistical psychology》2021,74(2):232-256

In this paper it is shown that under the random effects generalized partial credit model for the measurement of a single latent variable by a set of polytomously scored items, the joint marginal probability distribution of the item scores has a closed-form expression in terms of item category location parameters, parameters that characterize the distribution of the latent variable in the subpopulation of examinees with a zero score on all items, and item-scaling parameters. Due to this closed-form expression, all parameters of the random effects generalized partial credit model can be estimated using marginal maximum likelihood estimation without assuming a particular distribution of the latent variable in the population of examinees and without using numerical integration. Also due to this closed-form expression, new special cases of the random effects generalized partial credit model can be identified. In addition to these new special cases, a slightly more general model than the random effects generalized partial credit model is presented. This slightly more general model is called the extended generalized partial credit model. Attention is paid to maximum likelihood estimation of the parameters of the extended generalized partial credit model and to assessing the goodness of fit of the model using generalized likelihood ratio tests. Attention is also paid to person parameter estimation under the random effects generalized partial credit model. It is shown that expected a posteriori estimates can be obtained for all possible score patterns. A simulation study is carried out to show the usefulness of the proposed models compared to the standard models that assume normality of the latent variable in the population of examinees. In an empirical example, some of the procedures proposed are demonstrated. 相似文献

8.

Goodness-of-fit patterns in a computer cross-validation procedure comparing a linear and a threshold model

Chaeles E. Collyer 《Behavior research methods》1986,18(6):618-622

Cross-validation is the process of comparing a model’s predictions to data that were not used in the estimation of model parameters. Cross-validation may have some value in identifying source models, especially in cases where the corresponding fitted models require the estimation of different numbers of parameters. Some of the information available from cross-validation is illustrated using a linear and a threshold model, and goodness-of-fit patterns are contrasted with those of conventional model-fitting. 相似文献

9.

认知诊断中多分属性与二分属性的对比研究

昌维詹沛达王立君《心理科学》2018,(4):982-988

摘要：Karelitz（2004）和詹沛达等（2016）认为1个多分属性内部(Lk+1)个水平的关系相当于Lk个部分满足线型层级关系的二分属性。本研究的目的是通过比较多分属性模型和二分属性模型的判准率,从而验证多分属性和二分属性间是否存在以上关系。结果表明：当属性个数较少时,两个模型的模式判准率相当,随着属性个数增加,多分属性模型的模式判准率高于二分属性模型的模式判准率。结论：在一定程度上,多分属性和二分属性之间确实存在以上关系,但两者并非完全等价,二者间的差异随着属性个数增加更加明显。相似文献

10.

R-RUM的参数估计及性能评价研究

赵顶位戴海琦《心理学探新》2017,(3):231-236

认知诊断作为21世纪一种新的测量范式,在国内外越来越受到重视。该文运用MCMC算法实现了R-RUM的参数估计,并采用Monte Carlo模拟方法探讨其性能。研究结果表明:(1)R-RUM参数估计方法可行,估计精度较高;(2)Q矩阵复杂性和模型参数水平对模型参数估计精度有较大影响,随着r_(jk)*值的增大和Q矩阵复杂性的增加,项目参数和被试参数估计精度逐渐下降;(3)在特定情形下,R-RUM具有一定的稳健性。相似文献

11.

Representations for biased numerical judgments

Robert F. Fagot Manard R. Stewart Richard E. Kleinknecht 《Attention, perception & psychophysics》1975,17(3):309-319

Models are presented that introduce bias parameters into the numerical representations for ratio and interval judgments. An interval bias model is considered that has the capability of accounting for both interval and ratio judgments. A contrasting ratio bias model—a simple generalization of the classical model that accepts ratio judgments at face value—is also considered, and both models are tested with brightness data. The ratio estimation data lead to the conclusion that the constraints placed on the structure of the data by the classical model are too strong. In terms of goodness-of-fit and certain theoretical criteria, the interval bias model accounts for the ratio estimation data better than the ratio bias model. The interval estimation data are supportive of recent efforts directed toward the development of direct interval scaling theory. 相似文献

12.

Paired comparisons with individual differences

Dr. Erling B. Andersen 《Psychometrika》1976,41(2):141-157

A model of Bradley-Terry type for paired comparisons is considered. In addition to the usual parameters, the model allows for individual parameters and corresponding parameters for the choice-alternatives. The model is applied to a set of data from a Danish investigation of the attitude of blue collar workers towards alternative social gains. The proposed model is finally compared to a model recently suggested by Schönemann and Wang. 相似文献

13.

A marginal maximum likelihood method for the vector threshold model to analyze dichotomous choice data

Hiroshi Hojo 《The Japanese psychological research》2003,45(3):188-201

Abstract: At least two types of models, the vector model and the unfolding model can be used for the analysis of dichotomous choice data taken from, for example, the pick any/ n method. The previous vector threshold models have a difficulty with estimation of the nuisance parameters such as the individual vectors and thresholds. This paper proposes a new probabilistic vector threshold model, where, unlike the former vector models, the angle that defines an individual vector is a random variable, and where the marginal maximum likelihood estimation method using the expectation-maximization algorithm is adopted to avoid incidental parameters. The paper also attempts to discuss which of the two models is more appropriate to account for dichotomous choice data. Two sets of dichotomous choice data are analyzed by the model. 相似文献

14.

Refining the two‐parameter testlet response model by introducing testlet discrimination parameters

Jian Tao Bao Xu Ning‐Zhong Shi Hong Jiao 《The Japanese psychological research》2013,55(3):284-291

For testlet response data, traditional item response theory (IRT) models are often not appropriate due to local dependence presented among items within a common testlet. Several testlet‐based IRT models have been developed to model examinees' responses. In this paper, a new two‐parameter normal ogive testlet response theory (2PNOTRT) model for dichotomous items is proposed by introducing testlet discrimination parameters. A Bayesian model parameter estimation approach via a data augmentation scheme is developed. Simulations are conducted to evaluate the performance of the proposed 2PNOTRT model. The results indicated that the estimation of item parameters is satisfactory overall from the viewpoint of convergence. Finally, the proposed 2PNOTRT model is applied to a set of real testlet data. 相似文献

15.

Empirical underidentification in estimating random utility models: The role of choice sets and standardizations

Sebastian Olschewski Pavel Sirotkin Jörg Rieskamp 《The British journal of mathematical and statistical psychology》2022,75(2):252-292

A standard approach to distinguishing people’s risk preferences is to estimate a random utility model using a power utility function to characterize the preferences and a logit function to capture choice consistency. We demonstrate that with often-used choice situations, this model suffers from empirical underidentification, meaning that parameters cannot be estimated precisely. With simulations of estimation accuracy and Kullback–Leibler divergence measures we examined factors that potentially mitigate this problem. First, using a choice set that guarantees a switch in the utility order between two risky gambles in the range of plausible values leads to higher estimation accuracy than randomly created choice sets or the purpose-built choice sets common in the literature. Second, parameter estimates are regularly correlated, which contributes to empirical underidentification. Examining standardizations of the utility scale, we show that they mitigate this correlation and additionally improve the estimation accuracy for choice consistency. Yet, they can have detrimental effects on the estimation accuracy of risk preference. Finally, we also show how repeated versus distinct choice sets and an increase in observations affect estimation accuracy. Together, these results should help researchers make informed design choices to estimate parameters in the random utility model more precisely. 相似文献

16.

Marginalized maximum a posteriori estimation for the four-parameter logistic model under a mixture modelling framework

Xiangbin Meng Gongjun Xu Jiwei Zhang Jian Tao 《The British journal of mathematical and statistical psychology》2020,73(Z1):51-82

The four-parameter logistic model (4PLM) has recently attracted much interest in various applications. Motivated by recent studies that re-express the four-parameter model as a mixture model with two levels of latent variables, this paper develops a new expectation–maximization (EM) algorithm for marginalized maximum a posteriori estimation of the 4PLM parameters. The mixture modelling framework of the 4PLM not only makes the proposed EM algorithm easier to implement in practice, but also provides a natural connection with popular cognitive diagnosis models. Simulation studies were conducted to show the good performance of the proposed estimation method and to investigate the impact of the additional upper asymptote parameter on the estimation of other parameters. Moreover, a real data set was analysed using the 4PLM to show its improved performance over the three-parameter logistic model. 相似文献

17.

项目反应理论新进展：基于3PLM和GRM的混合模型

下载免费PDF全文

涂冬波蔡艳戴海琦丁树良《心理科学》2011,34(5):1189-1194

IRT中的计量模型较多,不同计量模型适合不同特点的数据资料,实际工作者应根据实际情况选择适当的IRT模型来分析数据。我国是个考试、测评大国,测评的题型丰富多样,在实际应用IRT时,一个模型往往很难反应所有数据资料本身的特点,这时可考虑应用多个IRT模型（即“混合模型”）来分析,以达到对数据的最佳拟合。本文对混合模型的思想方法及原理、参数估计的实现、以及模型性能进行了研究,发现：（1）本文自主开发的混合模型参数估计程序Mix_Tu具有较高的返真性,且与国际知名测量软件Parscale相当。（2）在“项目异常”情况下,Mix_Tu程序对参数b和c的估计受数据异常程度的影响要大于Parscale程序,而对参数a的估计受数据异常程度的影响要小于Parscale程序,而在参数theta上两个程序相当。（3）在“被试异常”情况下,Mix_Tu程序对所有参数的估计受数据异常程度的影响均要小于Parscale程序,Mix_Tu程序表现的更为稳健。相似文献

18.

DifScal: A tool for analyzing difference ratings on an ordinal category scale

Martin C. Boschman 《Behavior research methods》2001,33(1):10-20

An algorithm for analyzing difference scaling results is described. Frequency data on ordered categories that represent perceived differences for a unidimensional psychological attribute are modeled according to Thurstone’s judgment scaling model. The algorithm applies the gradient method for the maximum likelihood estimation of the model parameters. Two ways to calculate the start configuration for the model parameters are elaborated. The algorithm also provides asymptotic values for the standard errors of the estimates and three measures for the goodness of the model fit. An additional feature of DifScal is that it is suited to analyze incomplete data. 相似文献

19.

A Bayesian approach to nonlinear latent variable models using the Gibbs sampler and the metropolis-hastings algorithm 总被引：1，自引：0，他引：1

Gerhard Arminger Bengt O. Muthén 《Psychometrika》1998,63(3):271-300

Nonlinear latent variable models are specified that include quadratic forms and interactions of latent regressor variables as special cases. To estimate the parameters, the models are put in a Bayesian framework with conjugate priors for the parameters. The posterior distributions of the parameters and the latent variables are estimated using Markov chain Monte Carlo methods such as the Gibbs sampler and the Metropolis-Hastings algorithm. The proposed estimation methods are illustrated by two simulation studies and by the estimation of a non-linear model for the dependence of performance on task complexity and goal specificity using empirical data. 相似文献

20.

Specifying optimum examinees for item parameter estimation in item response theory

Martha L. Stocking 《Psychometrika》1990,55(3):461-475

Information functions are used to find the optimum ability levels and maximum contributions to information for estimating item parameters in three commonly used logistic item response models. For the three and two parameter logistic models, examinees who contribute maximally to the estimation of item difficulty contribute little to the estimation of item discrimination. This suggests that in applications that depend heavily upon the veracity of individual item parameter estimates (e.g. adaptive testing or text construction), better item calibration results may be obtained (for fixed sample sizes) from examinee calibration samples in which ability is widely dispersed.This work was supported by Contract No. N00014-83-C-0457, project designation NR 150-520, from Cognitive Science Program, Cognitive and Neural Sciences Division, Office of Naval Research and Educational Testing Service through the Program Research Planning Council. Reproduction in whole or in part is permitted for any purpose of the United States Government. The author wishes to acknowledge the invaluable assistance of Maxine B. Kingston in carrying out this study, and to thank Charles Lewis for his many insightful comments on earlier drafts of this paper. 相似文献