首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 593 毫秒
1.
应征公民心理选拔的人格评估   总被引:2,自引:1,他引:1  
应征公民人格评估的目的是对精神分裂症病前人格特征进行预测性评价。采取定性与定量两种研究方法编制征兵专用人格问卷,并对其进行指标验证。结果发现:①量表应答得分区间可较好地区分正常新兵与精神分裂症被试;②二阶因子分析将人格分量表部分划分为3个维度8个因子;③精神分裂症患者人格分量表分数各指标均显著高于正常新兵;④总预测符合率和预测合格符合率均在98%以上,预测淘汰符合率最低为70.13%。上述结果表明,本研究编制的中国士兵人格问卷(CSPQ)具有良好的信度和效度,适用于我国应征公民心理检测以及我军士兵的人格测试  相似文献   

2.
测验垂直等值是指将测试同一心理特质的不同水平的测验转换到同一个分数量尺上的过程。IRT与MIRT是实现垂直等值的主要方法。IRT无需假设被试的能力分布, 参数估计不依赖于样本, 是构建垂直量表的有效方法, 但测验不满足单维假设时其应用受到限制。MIRT结合IRT和因素分析的特点对IRT进行了拓展, 可更有效估计多维测验的项目参数和被试能力参数, 在垂直等值中有重要应用。已有研究主要探讨IRT和MIRT在垂直等值应用中的适用性、标定方法和参数估计方法, 比较研究两种方法的特性。未来研究应纳入更多变量条件进行比较研究, 拓展方法的应用。  相似文献   

3.
由美国印第安纳州立大学波特(R·porter)同卡特尔(R·B·Cattell)一起编制而成的儿童十四种人格因素问卷是公认比较好的儿童人格测验量表。我们在全国六大区十二个省市抽取了2302名被试,修订了8-11岁儿童(男、女)和12-14岁少年(男、女)的四个常模。同时,进行了效度和信度检验。此量表可作对中小学生进行人格特点的诊断与研究用。  相似文献   

4.
IRT展开模型及对非累积反应机制的检测   总被引:1,自引:1,他引:0  
郭庆科  苗金凤  王昭 《心理学探新》2006,26(1):66-69,78
被试回答人格测验题目时并不是特质水平越高其得分率越高,这称为非累积反应机制。广义等级展开模型GGUM就是针对这一机制提出来的。使用EPQ和五因素人格问卷发现GGUM比累积IRT模型有更好的模型拟合度和测量精度。研究结果表明GGUM有其合理性,且有助于反应心理过程机制的深入探讨。  相似文献   

5.
初中学生自尊特点的初步研究   总被引:45,自引:5,他引:40  
张文新 《心理科学》1997,20(6):504-508
对991名城乡在校初中生施以Coopersmith自尊问卷(25条目版)测验。对问卷的测量学分析发现:该自尊问卷的中文版具有较高的信度,其与有关测量工具之间的效标关联效度合乎逻辑;对被试自尊特点的分析发现:初中阶段学生的自尊存在极显著的年级差异,初中二年级开始自尊极显著地降低;初中生的自尊不存在性别差异;城市被试的自尊在总体上高于农村被试,但城乡因素与被试的性别有交互作用;独生子女的自尊高于非独生子女,但这一差异仅存在于初中一年级。  相似文献   

6.
人格测验中作假的控制方法   总被引:2,自引:0,他引:2  
被试很容易对人格测验作假,这严重影响了人格测验的有效性。目前测评专家已经提出了一些应对作假的方法,它们可被分为事前控制技术和事后识别技术两大类。前者包括迫选式量表,警告及假渠道技术等,后者包括作假识别量表,IRT及反应时识别技术等。目前,在人格测验中嵌套使用作假识别量表,以及在测验指导语中加入警告是比较有效的两种方法,迫选式量表的发展也值得期待。由于研究者对作假的内部发生机制了解较少,这制约了IRT与反应时识别技术的发展。  相似文献   

7.
违法犯罪者人格多种方法研究   总被引:5,自引:0,他引:5  
孔克勤  朱晨海 《心理科学》1997,20(4):307-310
本研究运用问卷法“YG人格测验”、作业法“内田-克雷佩林心理测验”和投射法“色塔人格测验”对126名违法犯罪者进行了测试。结果表明,作业法和投射法人格测验二者相互补充和印证,揭示了违法犯罪者人格的某些特点,问卷法人格测验的结果与前者不一致,应该运用多种方法对违法犯罪者的人格进行研究。  相似文献   

8.
Klaus D. Kubinger 《心理学报》2009,41(10):1024-1036
目前多数人格测验(特别是在中国使用的人格测验)基本上都是人格问卷, 基于实验的行为评估类客观化人格测验应用很少; 而后者近来在德语圈国家中则有复苏的迹象。因此, 本文综述了此类客观测验相对于人格问卷来说所具有的特点和优势, 如, 被试很难在这类客观化人格测验中作伪。本文介绍了维也纳研究小组所做的几个测验, 并讨论了这些测验的心理测量学性质和缺点。最后, 还列举了这些测验的实际应用。  相似文献   

9.
心理与教育测验中存在着被试作答异常现象(能力测验中的猜测现象和睡眠现象, 人格测验中的非0下渐近线现象和非1上渐近线现象), 会导致被试能力或人格特征的测量偏差。在能力测验中, 研究者已提出了多种方法来纠正猜测现象和睡眠现象, 这些方法往往需要调整或删除被试作答信息, 而四参数模型不需要改变被试作答信息而能有效纠正被试能力高估或低估现象。在人格测验中存在着非0下渐近线和非1上渐近线现象, 四参数模型能增强测验项目拟合性能, 提高人格测验的准确性。  相似文献   

10.
余嘉元 《心理学报》2002,34(5):80-86
运用联结主义中的级连相关模型对于小样本条件下的连续记分项目反应理论 (IRT)模型的项目参数和被试能力进行了估计。一组被试对于一组项目的反应矩阵作为级连相关模型的输入 ,这组被试的能力θ或该组项目的参数a、b和c作为该模型的输出 ,对神经网络进行训练使之具备了估计θ,a ,b或c的能力。计算机模拟的实验表明 ,如果测验中有少量项目取自于题库 ,就可以运用联结主义方法对IRT参数和被试能力进行较好的估计  相似文献   

11.
ABSTRACT Correlational and factor‐analytic methods indicate that abnormal and normal personality constructs may be tapping the same underlying latent trait. However, they do not systematically demonstrate that measures of abnormal personality capture more extreme ranges of the latent trait than measures of normal range personality. Item Response Theory (IRT) methods, in contrast, do provide this information. In the present study, we use IRT methods to evaluate the range of the latent trait assessed with a normal personality measure and a measure of psychopathy as one example of an abnormal personality construct. Contrary to the expectation that the measure of psychopathy would be more extreme than the measure of normal personality traits, the measures overlapped substantially in terms of the regions of the latent trait for which they provide information. Moreover, both types of inventories were limited in terms of measurement bandwidth, such that they did not provide information across the entire latent trait continuum. Implications and future directions are discussed.  相似文献   

12.
In the current study, we examined the dimensionality of the 16-item Card Sorting subtest of the Delis-Kaplan Executive Functioning System assessment in a sample of 264 native English-speaking children between the ages of 9 and 15 years. We also tested for measurement invariance for these items across age and gender groups using item response theory (IRT). Results of the exploratory factor analysis indicated that a two-factor model that distinguished between verbal and perceptual items provided the best fit to the data. Although the items demonstrated measurement invariance across age groups, measurement invariance was violated for gender groups, with two items demonstrating differential item functioning for males and females. Multigroup analysis using all 16 items indicated that the items were more effective for individuals whose IRT scale scores were relatively high. A single-group explanatory IRT model using 14 non-differential item functioning items showed that for perceptual ability, females scored higher than males and that scores increased with age for both males and females; for verbal ability, the observed increase in scores across age differed for males and females. The implications of these findings are discussed.  相似文献   

13.
Abstract

Differential item functioning (DIF) is a pernicious statistical issue that can mask true group differences on a target latent construct. A considerable amount of research has focused on evaluating methods for testing DIF, such as using likelihood ratio tests in item response theory (IRT). Most of this research has focused on the asymptotic properties of DIF testing, in part because many latent variable methods require large samples to obtain stable parameter estimates. Much less research has evaluated these methods in small sample sizes despite the fact that many social and behavioral scientists frequently encounter small samples in practice. In this article, we examine the extent to which model complexity—the number of model parameters estimated simultaneously—affects the recovery of DIF in small samples. We compare three models that vary in complexity: logistic regression with sum scores, the 1-parameter logistic IRT model, and the 2-parameter logistic IRT model. We expected that logistic regression with sum scores and the 1-parameter logistic IRT model would more accurately estimate DIF because these models yielded more stable estimates despite being misspecified. Indeed, a simulation study and empirical example of adolescent substance use show that, even when data are generated from / assumed to be a 2-parameter logistic IRT, using parsimonious models in small samples leads to more powerful tests of DIF while adequately controlling for Type I error. We also provide evidence for minimum sample sizes needed to detect DIF, and we evaluate whether applying corrections for multiple testing is advisable. Finally, we provide recommendations for applied researchers who conduct DIF analyses in small samples.  相似文献   

14.
刘红云  骆方 《心理学报》2008,40(1):92-100
作者简要介绍了多水平项目反应模型,对多水平项目反应理论与通常项目反应理论之间的关系进行了探讨,得到了多水平项目反应模型参数与通常项目反应模型参数之间的关系,并讨论了多水平项目反应模型的推广模型。通过一个实际例子,用多水平项目反应模型对测验中项目的特征进行分析;检验个体水平和组水平预测变量对能力参数的影响;对项目功能差异进行分析。最后文章就多水平项目反应理论模型的优势与不足进行了讨论  相似文献   

15.
Item response theory (IRT) provides valuable methods for the analysis of the psychometric properties of a psychological measure. To date, however, these methods have not been used frequently by personality assessment researchers, in part because many researchers have not been introduced to the methods and in part because most of the development of IRT has taken place in applied education assessment settings, resulting in terminology that is ability focused rather than trait focused. The purpose of this article is twofold. First, an overview of IRT is presented, highlighting the concepts of the three-parameter IRT model, item and test information, and conditional standard error of measurement. Second, the psychometric properties of the (MMPI-2) PSY-5 scales are examined to demonstrate IRT's value.  相似文献   

16.
杨向东 《心理科学进展》2010,18(8):1349-1358
从测验项目解决的认知过程的视角分析了在不同测验理论框架下的测量模型中的基本假设, 指出测量模型是测验开发者有关测验项目反应机制的理论假设的具体表征, 是系统检验测量假设和过程的统计框架。然而, 不管是经典测验理论、概化理论, 还是早期的项目反应理论模型, 相关假设都过于简化, 缺少相应实质理论的支持。与之相比, 认知测量模型强调与个体在测验项目反应过程中的认知过程、认知策略和知识结构的对应性, 提供了在实质理论基础上界定测量建构、设计测验项目、进行建模分析和解释的可能性, 为日益边缘化的心理测量学和主流心理学研究的融合奠定了基础。  相似文献   

17.
Signal Detection Theory (SDT; MacMillan & Creelman, 1991) is a method of data collection that has been used for several years, which describes the decision-making strategies of individuals. However, its use has been largely restricted to experiments involving sensation and perception. The Overclaiming Questionnaire (OCQ; Paulhus & Bruce, 1990) is a scale that has been developed to measure intellectual ability and personality, using SDT as a guideline. Although the scale has been successful in measuring human characteristics such as narcissism and intelligence, it is still unclear how to measure the characteristics of the various stimuli used (e.g., item difficulty, item discrimination, etc.). In some ways, this is a direct consequence of the general lack of research involved in item parameter estimation in the field of SDT. Using the OCQ, this article presents a graphical and nonparametric form of item response modeling to address this issue. In many ways, the approach is influenced by and structured around item response theory (IRT; Hambleton, Swaminathan, & Rogers, 1991). The general features of both SDT and IRT are described. Results suggest that this method is indeed a reasonable approach to describing item functioning, and there are several advantages to using this method over traditional IRT methods. Furthermore, SDT appears to be a fruitful approach to assessing intelligence, ability, and other psychological constructs, with advantages over traditional approaches. Overall, the results provide interesting implications for item selection and test development in several scientific and academic fields.  相似文献   

18.
项目反应理论是测量被试潜在特质的现代测量理论, 潜在类别分析是基于模型的潜在特质分类技术。混合项目反应理论将项目反应理论与潜在类别分析相结合, 能够同时对被试分类并量化其潜在特质。在阐述混合项目反应理论概念、原理的基础上, 介绍了MRM、mNRM和mPCM等几种常见混合模型及其参数估计方法, 并从心理与行为特征分类、项目功能差异检测、测验效度评价等方面评述了其在心理测验中的应用发展轨迹。  相似文献   

19.
Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The objective was to provide bounds of the likely DIF effects on these measurement consequences. Five factors were manipulated: test length, percentage of DIF items per form, item type, sample size, and level of group ability difference. Results indicate that the greatest DIF effect was less than 2 points on the 0 to 60 total score scale and about 0.15 on the IRT ability scale. DIF had a limited effect on the ratio of true-score variance to observed-score variance, but its influence on the standard error of estimation for the IRT ability parameter was evident for certain ability values.  相似文献   

20.
等级反应模型下项目特征曲线等值法在大型考试中的应用   总被引:2,自引:1,他引:1  
在中国最大的资格考试之一的经济专业资格考试中,为保证不同年度间考试的可比性、进行题库建设和为计算机自适应考试做准备,应用项目反应理论中等级反应模型下的项目特征曲线等值法,采用铆测验等值设计,实现了4个年度考试资料的项目参数和能力参数的等值,并成功地组建了经济专业题库。在此基础上,利用等值技术对不同年份试卷的划界分数进行了比较,为经济考试的合格标准制定、确保考试的公平性提供了实证依据。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号