The use of empirical prior information about participants has been shown to substantially improve the efficiency of computerized adaptive tests (CATs) in educational settings. However, it is unclear how these results translate to clinical settings, where small item banks with highly informative polytomous items often lead to very short CATs. We explored the risks and rewards of using prior information in CAT in two simulation studies, rooted in applied clinical examples. In the first simulation, prior precision and bias in the prior location were manipulated independently. Our results show that a precise personalized prior can meaningfully increase CAT efficiency. However, this reward comes with the potential risk of overconfidence in wrong empirical information (i.e., using a precise severely biased prior), which can lead to unnecessarily long tests, or severely biased estimates. The latter risk can be mitigated by setting a minimum number of items that are to be administered during the CAT, or by setting a less precise prior; be it at the expense of canceling out any efficiency gains. The second simulation, with more realistic bias and precision combinations in the empirical prior, places the prevalence of the potential risks in context. With similar estimation bias, an empirical prior reduced CAT test length, compared to a standard normal prior, in 68% of cases, by a median of 20%; while test length increased in only 3% of cases. The use of prior information in CAT seems to be a feasible and simple method to reduce test burden for patients and clinical practitioners alike. 相似文献
Test equating is a statistical procedure to ensure that scores from different test forms can be used interchangeably. There are several methodologies available to perform equating, some of which are based on the Classical Test Theory (CTT) framework and others are based on the Item Response Theory (IRT) framework. This article compares equating transformations originated from three different frameworks, namely IRT Observed-Score Equating (IRTOSE), Kernel Equating (KE), and IRT Kernel Equating (IRTKE). The comparisons were made under different data-generating scenarios, which include the development of a novel data-generation procedure that allows the simulation of test data without relying on IRT parameters while still providing control over some test score properties such as distribution skewness and item difficulty. Our results suggest that IRT methods tend to provide better results than KE even when the data are not generated from IRT processes. KE might be able to provide satisfactory results if a proper pre-smoothing solution can be found, while also being much faster than IRT methods. For daily applications, we recommend observing the sensibility of the results to the equating method, minding the importance of good model fit and meeting the assumptions of the framework. 相似文献
Drawing tests in children diagnosed with dyslexia and dysgraphia were quantitatively compared. Fourteen children with dysgraphia, 19 with dyslexia and 13 normally developing were asked to copy 3 figures: a circle, a square and a cross. An optoelectronic system allowed the acquisition of the drawing track in three-dimensions. The participants’ head position and upper limb movements were measured as well. A set of parameters including movement duration, velocity, length of the trace, Range of Motion of the upper limb, was computed and compared among the 3 groups. Children with dyslexia traced the circle faster than the other groups. In the cross test, dyslexic participants showed a reduced execution time and increased velocity while drawing the horizontal line. Children with dyslexia were also faster in drawing certain sides of square with respect to the other groups. 相似文献
Computerized classification testing (CCT) aims to classify persons into one of two or more possible categories to make decisions such as mastery/non-mastery or meet most/meet all/exceed. A defining feature of CCT is its stopping criterion: the test terminates when there is enough confidence to make a decision. There is abundant research on CCT with a single cut-off, and two common stopping criteria are the sequential probability ratio test (SPRT) statistic and the generalized likelihood ratio statistic (GLR). However, there is a relative scarcity of research extending the SPRT to the multi-hypothesis case for when there is more than one cut-off. In this paper, we propose a new multi-category GLR (mGLR) statistic as well as a stochastically curtailed version of the CCT with three or more categories. A simulation study was conducted to show that the mGLR statistic outperformed the existing stopping rules by generating shorter average test length without sacrificing classification accuracy. Results also revealed that the stochastically curtailed mGLR successfully increased test efficiency in certain testing conditions. 相似文献
Functional fixedness involves difficulty with conceptualizing creative object uses. When it obstructs problem‐solving, individuals must reframe their approach. We examined how different training techniques—chunk decomposition (i.e., considering an object’s basic parts and physical properties) and constraint relaxation (i.e., considering an object’s different functions)—might rely upon different routes to creative reframing. Additionally, we investigated how different forms of cognitive load interact with these dual routes. Participants learned one of three techniques. Chunk decomposition participants created object breakdown diagrams; constraint relaxation participants created object functions lists; and free association (control) participants wrote a word that they associated with each of several concrete nouns. After training, participants attempted to solve five functional fixedness problems. E1 investigated how increasing germane cognitive load via either direct or indirect prompting affected training transfer. Experiment 2 investigated how reducing extraneous cognitive load by providing no transfer instructions and using an eye‐closure strategy. Across both experiments, results supported differences in accuracy and response latency by training. However, chunk decomposition and constraint relaxation did not follow the same pattern, suggesting different mechanisms of the effect. We discuss possible applications to increase innovation in real‐world domains such as education, business, and engineering. 相似文献
Although research has identified effective evidence-based depression prevention interventions for diverse youth, little is known about how the intervention process unfolds with immigrant family youth. This study utilized a qualitative approach to explore cultural and clinical differences in the implementation of Interpersonal Psychotherapy–Adolescent Skills Training (IPT-AST) in two schools, one serving youth from primarily immigrant, Asian American families and the second, youth from mostly nonimmigrant, non-Hispanic White families. A total of 131 IPT–AST sessions were audio recorded, transcribed, and coded for presence and patterns of cultural and clinical constructs. Results revealed that sessions with immigrant family youth were more likely to contain discussions of interpersonal problems characterized by estrangement, goals of spending time together with important others, mentions of emotion suppression and academic achievement expectations, conversations about acculturation, differences in value orientation, and discomfort with implementing new intervention skills. Dialogue from interventionist and youth exchanges is presented to illustrate how these themes emerged and were addressed by interventionists in a culturally responsive manner. The study highlights how IPT–AST with immigrant family and Asian American youth may unfold differently compared to youth from nonimmigrant families. Implications of findings for providers are discussed. 相似文献