首页 | 本学科首页   官方微博 | 高级检索  
     

认知诊断计算机自适应测验中平衡属性收敛的新方法
引用本文:孙小坚 王钰彤 张世夷 辛涛. 认知诊断计算机自适应测验中平衡属性收敛的新方法[J]. 心理科学, 2005, 0(5): 1236-1244
作者姓名:孙小坚 王钰彤 张世夷 辛涛
作者单位:1. 北京师范大学;2. 西南大学;
摘    要:提出两种认知诊断计算机自适应测验下平衡属性收敛的新方法(MABI、RTA),模拟研究系统探讨和比较了此二者与已有方法(ABI、IABI和RABI)的表现。结果发现:(1)新方法较不考虑属性收敛的方法有更高的准确率以及更均衡的题目使用率;(2)新方法较ABI和RABI有稍低的准确性,但有更平衡的题目使用率;(3)新方法与IABI的准确性和题目使用率在不同选题策略下各有合优势。总之,两种新方法较好地兼顾测量准确性、题目使用率以及题库曝光情况。

关 键 词:认知诊断计算机化自适应测验  属性收敛  分类准确性  题目使用率  
收稿时间:2018-08-04

New Methods to Balance Attribute Coverage for Cognitive Diagnostic Computerized Adaptive Testing
Abstract:The focus on cognitive diagnosis assessment (CDA) has become particularly intense in test theory research in recent years, which can provide detailed information about the strengths and weaknesses of examinees for specific content domains. Meanwhile, computerized adaptive testing (CAT) can provide equivalent or even higher accuracy in the measurement of an examinee’s latent skills, but with reductions in test length of up to 50%, compare to traditional paper-and-pencil testing. Recently, aiming to maximize the benefits of both CDA and CAT, researchers have attempted to combine the two, and naming cognitive diagnostic computerized adaptive testing (CD-CAT).During CD-CAT, many factors can affect the reliability and validity of the test, one of which is the balance of attribute coverage. It is very important to make sure that each attribute is measured adequately, or the reliability of the test will be reduced. Therefore, researchers have developed some attribute balance indices (ABIs) to satisfy the attribute coverage. While a shortcoming of both the ABI and revised ABI (RABI) is that they tend to select items that measuring single attribute, therefore, items measuring single attribute will be overused, as a consequence, an uneven distribution of item exposures will be raised. The improved ABI (IABI), on the contrary, inclines to select items that measuring all attributes even when the minimum number of items that measuring some specific attribute are satisfied. To overcome the shortcomings of these ABIs in some degree, two new attribute coverage control methods?modified ABI (MABI) and ratio of test length to the number of attributes (RTA)? are proposed in current study.To examine the performance of MABI and RTA, a Monte Carlo simulation was conducted. Five factors are manipulated: Number of attributes (4 and 6), test length (20 and 30 items), attribute coverage control method (without attribute coverage control [Non], ABI, IABI, RABI, MABI, and RTA), and item selection method (KL, PWKL, MI, and MPWKL methods), and model type (DINA, RRUM). There are 2 × 2 × 4 × 6 × 2 = 192 conditions in current study, of these, attribute coverage control method and item selection methods are within-group variable, and the rest are between-group variables. In addition, the covariates include number of items in the item bank (500 items), number of individuals, and distribution of item parameters. Furthermore, the minimum items measuring each attribute (Bk) are fixed as 4. The evaluate criteria are pattern correct classification rate (PCCR), attribute correct classification rate (ACCR), and the usage of k-attribute items. The results show that: (a) Methods with attribute coverage control (ABI, IABI, RABI, MABI, and RTA), in general, perform better than the method that without attribute coverage control (Non). (b) The ABI and RABI method produces higher PCCR and ACCR than MABI and RTA method in most conditions, while produces more uneven item usage than MABI and RTA. (c) The performance among IABI, MABI, and RTA are twisted each other under different conditions. (d) The IABI, MABI and RTA methods can deal with the trade-off between correct classification rate and item usage quite well.
Keywords:cognitive diagnostic computerized adaptive testing   attribute coverage   correct classification rate   item usage  
点击此处可从《心理科学》浏览原始摘要信息
点击此处可从《心理科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号