首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 437 毫秒
1.
文章在回顾近年来情景判断测验研究的基础上,总结了情景判断测验的效标关联效度、结构效度、增量效度和情景判断测验效度的影响因素.研究发现情景判断测验有较高的效标关联效度,是一种较好的人才选拔工具;情景判断测验是一种测量方法,可以用来测量指定的结构;情景判断测验对认知能力、人格、工作知识等变量具有增量效度;试题特性、测验开发模式、研究设计、评分方式等会影响情景判断测验的效度.  相似文献   

2.
目前,情境判断测验的相关研究主要呈现两大趋势———效度研究和跨文化比较研究。其中,效度研究主要分为效标关联效度和构想效度研究,跨文化比较研究则主要探讨测验的文化公平性,以及对不同种族员工工作绩效的预测作用。文章还将介绍研究呈现出的新趋势———对情境判断测验本质的研究,即探讨测验形式特点和被试信息加工过程对测验结果的影响。  相似文献   

3.
情景判断测验的开发程序、构思效度及研究趋势   总被引:9,自引:0,他引:9  
章详细介绍情景判断测验开发的一般程序,对情景判断测验的多种形式和记分方法进行了总结和比较。同时,从情景判断测验的结果与认知能力、个性和工作经验等关系的角度出发,分析了情景判断测验的构思效度,认为情景判断测验测量的是多维构思。文章最后认为,需要从与其他构思的关系、测评指定构思、影响效度的因素以及跨文化比较等四个方面对情景判断测验开展进一步研究。  相似文献   

4.
目的:修订梅尔美术判断测验(Meier Art Judgment Test)并对其信度、效度进行检验。方法:通过对来自6所大学、中专共2270人施测梅尔美术判断测验,采用CTT区分度和IRT的模型拟合检验、区分度筛选项目,以霍兰德艺术分测验、学生艺术创作水平自评与艺术过往经历分量表为效标,以及采用效标组法(美术与非美术专业)检验效标关联效度。结果:保留的61题都拟合IRT的2参数logistic模型,量表得分与各效标得分相关显著,美术与非美术专业学生得分存在显著差异; 但测验信息量分析表明,对高能力被试的测量误差相对较大。结论:修订的量表能测量个体的美术判断能力; 今后改进方向应该是增加更难的试题。  相似文献   

5.
MBTI人格类型量表的效度分析   总被引:25,自引:0,他引:25  
目的探讨中文版MBTI人格类型量表的内容效度、效标关联效度和结构效度,为其在中国应用提供操作性技术,方法大学本科和专科学生2123名,陆军初级军官276名;MBTI-G量表中文修订版;效标测验包括EPQ、16PF、MMPI-2、A-Type和PM测验。结果(1)经专家评判、中英文版相关分析、自评他评和信度分析,表明中文版MBTI有较好的内容效度。(2)效标关联效度研究发现EI维度具有明显的内外向人格特征;感觉型个体温和、现实和谨慎,直觉型个体则恃强、敢为、果断和中强度A型行为特征;对事型个体稳重、安详、恃强、自律;判断型个体善于交往和社会化程度高,做事有强的责任感、计划性和有恒性,适应新环境能力较强,成就感强。以上发现与MBTI原设计和国外研究吻合。(3)97项题目因子分析最大负荷落在主因素上平均占82.81%,次级负荷占11.02%,仅6题因子分析不理想。(4)修订版MBTI人格类型测验与PM领导行为类型测验间有一定相关;中国军队初级指挥员以ESFJ、ISTJ人格类型为主。结论本研究修订的中文版MBTI具有较好的内容效度、效标关联蚊度和结构效度。  相似文献   

6.
效度概化:预测效度元分析的30年成果述评   总被引:3,自引:0,他引:3  
效度概化是通过元分析技术对普遍化的预测效度的估计。元分析是研究效度概化问题的技术,是对具有“预测因子-效标”特征的相关数据进行定量综合的方法。效度概化促进了预测因子与效标之间关系的理论研究和应用研究,是应用心理学领域近30年(1977~2007)来最重要的进展之一。30年来的效度概化研究表明,认知能力测验、知识和技能测验、人格测验、结构化面试和评价中心技术等的预测效度具有鲁棒性、对应性和联合增值性  相似文献   

7.
李金波  王权 《心理科学》2003,26(5):885-886
1 引言  测验信度和效度是衡量测验编制质量的两个主要参数。测验信度和效度受项目难度、区分度以及被试能力分布等多方面因素的制约。IRT利用信息函数的概念提出了用项目参数来调节测验信度的具体方法 ,这是IRT在心理和教育测量学上的一大贡献。但对于如何提高测验效度 ,至今人们还是凭经验来选择测验项目 ,缺乏客观有效的方法。另外 ,项目难度与区分度是密切地关联着的 ,它们协同影响着测验效度。为此 ,在研究项目参数与测验效度间的关系前 ,首先应该研究项目难度与项目区分度间的关系。2 区分度对难度的回归关系的模拟试验2 .1 …  相似文献   

8.
自我职业选择测验(SDS)的试用报告   总被引:12,自引:0,他引:12  
本研究对自我职业选择测验 (SDS) 1 985年版进行了修订 ,并在武汉市中学生中进行了适用性的验证。在原测验中译本基础上 ,进行了项目修改、项目分析、信效度检验等标准化工作。结果表明 :①该测验具有良好的项目特性 ;②该测验同质性信度、分半信度均达到一般心理测验要求标准 ;③该测验结构效度与效标关联效度亦较为理想 ;④个别项目仍有待于进一步修改 ,取样还应面向全国 ,以利于进一步的推广作用。在武汉市中学生中的试用结果表明 :①该测验可以作为中学生职业辅导的选用工具 ;②在该测验中使用标准分代替粗分更具科学性。  相似文献   

9.
论文阐述了选择编制与世界著名的个别施测的韦克斯勒儿童智力测验相似的但团体施行的儿童智力测验的理由;论述了指导新编测验的五条准则以及选题过程;还报告了对新编测验试用稿的几次相继的因素分析及其它的信度、效度检验结果。  相似文献   

10.
卢谢峰  唐源鸿  王孟成 《心理科学》2012,35(6):1453-1458
人格测验的参照情境效应是指,在一般人格测验的基础上,设置某种特定的参照情境,进而使测验的效标关联效度得以提高的现象。在过去十余年里,参照情境效应的考察重心从早期的效度证据搜集逐渐转向内部机理的探讨。研究者试图通过参照情境与效标的逻辑关联、参照情境的被试间变异及被试内变异来解释现象背后的测量学原理。在构念层面则提出“人格和角色认同层级模型”,以此说明参照情境效应的人格机制问题。然而,该主题的探索尚处于初始阶段,未来研究可从参照情境的操作范式、参照情境效应的调节机制等方面继续寻求突破。  相似文献   

11.
Evaluating the ecological validity of neuropsychological tests has become an increasingly important topic over the past decade. In this paper, we provide a comprehensive review of the research on the ecological validity of neuropsychological tests, as it pertains to everyday cognitive skills. This review is presented in the context of several theoretical issues facing ecological validity research. Overall, the research suggests that many neuropsychological tests have a moderate level of ecological validity when predicting everyday cognitive functioning. The strongest relationships were noted when the outcome measure corresponded to the cognitive domain assessed by the neuropsychological tests. Several other factors that may moderate the degree of ecological validity established for neuropsychological tests are in need of further exploration. These factors include the effects of the population being tested, the approach utilized (verisimilitude vs. veridicality), the person completing the outcome measure (significant other vs. clinician), illness severity, and time from injury until evaluation. In addition, a standard measurement of outcome for each cognitive domain is greatly needed to allow for comparison across studies.  相似文献   

12.
Claims of changes in the validity coefficients associated with general mental ability (GMA) tests due to the passage of time (i.e., temporal validity degradation) have been the focus of an on-going debate in applied psychology. To evaluate whether and, if so, under what conditions this degradation may occur, we integrate evidence from multiple sub-disciplines of psychology. The temporal stability of construct validity is considered in light of the evidence regarding the differential stability of g and the invariance of measurement properties of GMA tests over the adult life-span. The temporal stability of criterion-related validity is considered in light of evidence from long-term predictive validity studies in educational and occupational realms. The evidence gained from this broad-ranging review suggests that temporal degradation of the construct- and criterion-related validity of ability test scores may not be as ubiquitous as some have previously concluded. Rather, it appears that both construct and criterion-related validity coefficients are reasonably robust over time and that any apparent degradation of criterion-related validity coefficients has more to do with changes in the determinants of task performance and changes in the nature of the criterion domain rather temporal degradation per se (i.e., the age of the test scores). A key exception to the conclusion that temporal validity degradation is more myth than reality concerns decision validity. Although the evidence is sparse, it is likely that the utility of a given GMA test score for making diagnostic decisions about an individual deteriorates over time. Importantly, we also note several areas in need of additional and more rigorous research before strong conclusions can be supported.  相似文献   

13.
A web-based survey of validity test use by North American neuropsychologists was conducted, with 282 participants meeting inclusion criteria. Respondents indicated that they use a median of one stand-alone performance validity test (PVT), one embedded PVT, and one symptom validity test (SVT) per pediatric assessment. The vast majority of respondents indicated they give at least one PVT (92%) and at least one SVT (88%) during each pediatric assessment. A meaningful difference in validity use (i.e., at least a medium effect size) was only found for those who engage in forensic work, with those clinicians giving more stand-alone PVTs than those who do not conduct forensic work. The most frequently used validity measures in pediatric assessments are presented, as are reasons participants reported for both using and not using validity tests. Limitations and qualitative comparisons to other surveys on validity test use with adults are discussed.  相似文献   

14.
To evaluate the construct validity (convergent and divergent) of Sivik Psycho Somaticism test (SPS) and test of Operationality (OPER), Pearson correlation coefficients between SPS scales and subscales and Karolinska Scheme of Personality (KSP) were calculated. Seventy-eight healthy individuals and 196 psychosomatic patients completed the SPS and OPER tests and KSP. The results show that the SPS and OPER subscales are significantly correlated to most KSP subscales. The correlations were higher for the psychosomatic group than for the normal population. The results confirm the validity of the SPS and OPER constructs.  相似文献   

15.
对学前儿童语言学习能力诊断量表的效度评价   总被引:1,自引:0,他引:1  
以所编制的量表为工具 ,对采集的数据进行效度分析 ,结果表明各分测验与全量表有较好的相关 ,说明量表的内容效度是比较高的。使用因素分析的方法 ,将全部变量作系统分类 ,研究量表的结构 ,绝大部分分测验在所得的四个因素上的共通性都大于 0 .70 ;保留下的分测验与所属因素的相关系数在0 .5 3 -0 .84之间 ,它们在各个因素上有较高的负荷量 ,说明量表有较好的结构效度。从效度分析的结果看 ,本量表的测量结果应该是准确的。另外 ,还根据因素分析结果指示的方向 ,调整了分测验 ,调整后的量表结构不但与假设的量表结构十分吻合 ,而且更条理化。  相似文献   

16.
评价中心的结构效度研究   总被引:8,自引:0,他引:8  
评价中心虽然具备很高的预测效度,但其结构效度指标却不太理想,如研究普遍发现其汇聚效度和区分效度较低。影响评价中心结构效度的因素众多,如评分维度因素(数量和类型)、评价者因素(培训方式和人员类型)、测评方法因素(情景导向特征、特质激活潜力、测评活动形式)以及系统的观察与评价程序。该文从上述因素出发,综述了评价中心结构效度的相关研究,总结了提高评价中心结构效度的措施,并指出了未来的研究方向  相似文献   

17.
Divergent thinking (DT) tests are among the most popular techniques for measuring creativity. However, the validity evidence for DT tests, as applied in educational settings, is inconsistent partly due to different scoring methods. This study explored the reliability and validity issues of various techniques for administering and scoring two DT tests. Results show distinct differences among several methods for scoring these DT tests and suggest that the percentage scoring method (i.e., dividing originality scores by fluency scores) may be the most appropriate scoring strategy. The potential impact on educational research and practice is discussed in detail.  相似文献   

18.
Single-response situational judgment tests (SRSJTs) differ from multiple-response SJTs (MRSJTS) in that they present test takers with edited critical incidents and simply ask test takers to read over the action described and evaluate it according to its effectiveness. Research comparing the reliability and validity of SRSJTs and MRSJTs is thus far extremely limited. The study reported here directly compares forms of a SRSJT and MRSJT and explores the reliability, convergent validity, and predictive validity of each format. Results from this investigation present preliminary evidence to suggest SRSJTs may produce internal consistency reliability, convergent validity, and predictive validity estimates that are comparable to those achieved with many traditional MRSJTs. We conclude by discussing practical implications for personnel selection and assessment, and future research in psychological science more broadly.  相似文献   

19.
To evaluate the construct validity (convergent and divergent) of the Sivik Psycho Somaticism test (SPS) and test of Operationality (OPER), Pearson correlation coefficients between SPS scales and subscales, OPER and Minnesota Multiphasic Personality Inventory (MMPI) subscales Hypochondria (Hs), Depression (D), Hysteria (Hy) and Alexithymia (Al) were calculated. Eighty-eight healthy individuals and 285 psychosomatic patients completed the SPS and OPER tests and MMPI; Hs, D, Hy and Al. The results show that most of the SPS subscales and OPER are significantly correlated to several MMPI subscales in both a normal and a psychosomatic population. The results are in concordance with the theoretical hypotheses and confirm the validity of the SPS and OPER constructs.  相似文献   

20.
综合系统在罗夏墨迹技术(RIM)的标准化、客观化方面取得了很大的成功,但仍有一些批评者对它的信度、效度、常模与标准化等问题提出了质疑,作者们对这些争论的焦点问题进行了系统的回顾。总结有关的元分析和实验研究,发现RIM的多数核心变量都有较好的评分者信度和跨时间稳定性,只有心理状态变量的信度较低。以往的整体元分析和局部元分析都发现RIM不逊于其他测验。这些结果表明RIM在临床疾病诊断,外显行为预测方面都是有价值的。与其他 测验一样,RIM也有其优势与不足,在应用中应与其他测验互补  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号