首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In Experiment I a group of pupils from a secondary school was given a test of general intelligence, a test of the ability to categorize objects in a flexible way, and five different problem-solving tasks. Subjects who were successful on the problems had higher scores on the intelligence test than the rest. The Categorizing Test was not, however, a good indicator of success. In Experiment II a comparison was made between scores on different parts of the so-called Family Test. With high school pupils and university students as subjects, correlation coefficients in the neighborhood of 0.40 were found between scores when suggesting possible classifications of objects, and scores when choosing a classification to fit different sets of objects. The triangular form of the scatterplots was taken as an indication that an ability to present different interpretations of one's experience is a necessary, but not sufficient condition for success in problem-solving tasks.  相似文献   

2.
The assessment of multiliterate handwriting performance is rarely reported despite increased globalization. The present study describes the psychometric properties of a handwriting speed test developed for children who are biliterate in English and Chinese. This included interrater reliability, test-retest reliability, interitem correlation, construct validity, and concurrent validity. The test's reliabilities between two raters and over a 1-wk. interval were high with ICCs ranging from .89 to .99. Interitem correlation between the English and Chinese items was .87. The presence of age trends but not sex differences was a positive indicator of the test's validity. Correlations of .91 and 1.00 between the Chinese and the English items of the Handwriting Assessment Tool with the Chinese Handwriting Speed Test and Handwriting Speed Test, respectively, provided evidence of concurrent validity. These preliminary results showed the Handwriting Assessment Tool is reliable and is a potentially useful handwriting test for children biliterate in English and Chinese. The feasibility of assessing biliterate handwriting speed performance with the same set of scoring criteria for different writing systems was supported.  相似文献   

3.
Studied the reliability of the Washington University Sentence Completion Test by giving 51 9th graders and 26 college students the test twice, a week apart. For 9th graders the design included a test-retest group and two groups given half of the test at each session. Although test-retest correlations were high for the 9th graders, retest scores dropped significantly. With college students (a) test-retest correlations through positive and significant were lower, (b) retest scores did not change systematically, and (c) percentage agreement between test and retest scores was high. Discrepant results were related to motivational set and variance in test scores. Split-half correlations and internal consistency coefficients were high. Likelihood of lower retest scores makes problematic the use of this test for short term pretest-posttest studies seeking to stimulate ego development.  相似文献   

4.
Sarason's Test Anxiety Scale, translated into an Ethiopian language, was administered to 391 students in Grade 8 and to 422 students in preparatory school (Grades 11 and 12). In the first sample, 32 items loaded above the 0.3 criterion of acceptable item-remainder correlations and Cronbach alpha of .84. In the second sample, Cronbach alpha was .84 for the 34 items, but only 19 items had acceptable item-remainder correlations. The internal consistency reliabilities were comparable with those reported in the literature. However, the results of confirmatory factor analyses with extraction of four factors did not confirm the item loadings on factors as reported in the literature. Younger students (Grade 8) were found to have higher mean Test Anxiety than Grades 11 and 12 students. The Amharik version of the Test Anxiety Scale as a whole could be considered reliable and useful for Ethiopian students.  相似文献   

5.
应征公民计算机自适应化拼图测验的编制   总被引:1,自引:0,他引:1  
在文献回顾和参考外军有关资料的基础上,根据项目反应理论和空间能力测验的有关理论编制试题库。首先采用纸笔测验的形式进行预实验,探讨采用IRT理论编制CAT拼图测验的可行性。然后,在预实验的基础上对试题进行修订并扩充试题数量,编制计算机辅助测验。选择三参数Logistic模型,采用铆题等值设计,分7份不同的试卷在全国征兵心理检测的过程中对55777名应征公民进行施测。根据测试结果,对题目进行分析,选择高质量的题目构成CAT试题库,采用a系数分层抽样的方法控制曝光率,并采用不同的测验终止策略编制CAT拼图测验。最后用WAIS智力测验积木分测验和三门功课的考试成绩为效标,通过72名被试对CAT拼图测验进行效度验证。结果显示该测验符合项目反应理论三参数Logistic模型的假设,各题目参数比较理想,所编制的测验具有较好的信度和效度,可用于应征公民心理选拔的实践  相似文献   

6.
《创造力研究杂志》2013,25(1):89-95
An alternative method for testing creativity was investigated. A multiple-choice paper-and-pencil test called the Abedi–Schumacher Creativity Test (CT) was developed in an attempt to shorten the amount of time required for the administration and scoring of creativity tests. This instrument was translated into Spanish and was administered with the Torrance Tests of Creative Thinking (TTCT) and the Villa and Auzmendi Creativity Test (VAT) to 2,270 students in Spain. Teacher ratings of student creativity were also examined. Significant but low correlations were found between the 4 CT subscale scores and the students' academic achievement measures and TTCTsubscale scores. The reliability coefficient of the CT subscale scores was at an acceptable level; however, the correlation coefficients between the CT and the TTCT and VAT were moderate or higher. The results of this study have influenced further modification of the CT test items.  相似文献   

7.
The purpose of this study was to examine whether the Test of Self-Conscious Affect (TOSCA; Tangney, J. P., Wagner, P. E., & Gramzow, R. (1989). The Test of Self-Concious Affect. Fairfax, VA: George Mason University) measures maladaptive forms or aspects of guilt and adaptive aspects of shame that have been described in the literature. First, a judgmental and logical analysis showed that the TOSCA primarily measures mild and adaptive forms and aspects of guilt and maladaptive aspects of shame. Next, principal components analyses (PCAs) in a student (N=328) and adult (N=542) sample showed that items that had a high loading on the guilt factor primarily were items that referred to reparative behavior, while items that had high loadings on the shame factor consisted primarily of items that referred to low self-esteem. To investigate to which extent these items were responsible for correlations found with the TOSCA, we constructed a revised guilt scale containing only items that referred to reparative behavior and a revised shame scale consisting of items that only referred to negative self-esteem, and related these to indices of interpersonal and intrapersonal functioning. The revised TOSCA scales reproduced both the pattern and magnitude of correlations obtained with the original TOSCA scales. Thus, taken together, the results of this study support the interpretation of the TOSCA guilt scale as a measure of mild and adaptive forms of guilt and the TOSCA shame scale as a measure of maladaptive aspects associated with shame. Implications of these findings for further research on the nature of guilt and shame are discussed.  相似文献   

8.
This study describes the construction and validation of a Japanese adaptation of Spielberger's (1980) Test Anxiety Inventory (TAI) and presents evidence of the reliability and validity of this new instrument. The items for the Japanese TAI (TAI-J) were selected on the basis of content validity and itemremainder correlations, which were .40 or higher for both sexes. Alpha reliability coefficients for the TAI-J Total scores were .90 or higher for both high school and college students; test-retest stability over a three-week interval was .89. Mean TAI-J Total scores for Japanese high school females were significantly higher than those of Japanese college students, whose scores were slightly lower than those reported for American undergraduates. TAI-J Total scores correlated .72 with a Japanese trait anxiety measure. Significant negative correlations were found between TAI-J Total scores and measures of academic achievement. Implications of these results for the measurement of test anxiety in Japanese students are discussed from a cross-cultural perspective.  相似文献   

9.
自编235个图形推理测验题目。采用铆测验等值设计,以72个联合型瑞文测验题目为铆题,对初中到大学各能力层次的1733名男性进行了测验。使用BILOG MG3.0(边际极大似然估计)对实测数据进行了分析,采用Logsitic 3参数模型。剔除数据与模型拟合不好的题目以及信息函数最大值小于0.3的题目,最终建立一个包含181道题目的题库。该题库可以用于淘汰智力较低的应征青年  相似文献   

10.
This investigation extends stereotype threat research by examining its effect on sex and ethnicity in employment testing. Study 1 used archival data from a company‐specific cognitive ability test, demonstrating score increases (greater for some minorities) when applicants completed demographic items after a cognitive skills test, rather than before the test. The first experiment (Study 2) used college students taking the same company‐specific test with inconclusive results. Possible reasons dictated the design of a second experiment with a college population (Study 3). This study used the Wonderlic Personnel Test (WPT) and investigated priming through positioning of demographic items, as well as high/low threat. Positioning of the demographic items after the WPT was associated with an increase in test scores.  相似文献   

11.
Most objectively scored tests use items with easily identifiable correct answers. When such veridical scoring keys cannot be constructed, researchers sometimes use proportion consensus scoring (PCS) to identify the best answers. To determine if PCS identifies the best answers, we scored a test using both PCS and veridical scoring. Among 353 undergraduates, regular PCS, two-stage PCS, and expert PCS all had high correlations for easy items, but no PCS methods had high correlations for difficult items. Thus, PCS cannot reliability identify the best answers to individual items. However, PCS worked well for total scores. For easy items, total scores had correlations above .99 for all PCS methods. For difficult items, expert and two-stage PCS had correlations of .92 and .82 for the 60-item test. Thus, expert and two-stage PCS can be justified (even for difficult items) if the scoring key is based upon people who truly possess some degree of expertise and if scores are summed over many items.  相似文献   

12.
Responses to the Minnesota Multiphasic Personality Inventory (MMPI) were assessed with respect to their relevance to schema theory. The relation between scores on self-reported personality dimensions and the speed of processing test items associated with each dimension was examined. With previously derived factor analytic content scales, negative correlations were obtained between scale scores and mean latencies for endorsing relevant items, and positive correlations were found between scale scores and mean latencies for rejecting relevant items. A similar analysis completed on the traditional clinical scales revealed no such pattern. Results were interpreted as supporting the conceptualization of item responding as a content-based, schema-relevant process.  相似文献   

13.
Responses to the Minnesota Multiphasic Personality Inventory (MMPI) were assessed with respect to their relevance to schema theory. The relation between scores on self-reported personality dimensions and the speed of processing test items associated with each dimension was examined. With previously derived factor analytic content scales, negative correlations were obtained between scale scores and mean latencies for endorsing relevant items, and positive correlations were found between scale scores and mean latencies for rejecting relevant items. A similar analysis completed on the traditional clinical scales revealed no such pattern. Results were interpreted as supporting the conceptualization of item responding as a content-based, schema-relevant process.  相似文献   

14.
To help clarify the relationship between the Matching Familiar Figures (MFF) Test and measures of school achievement and adjustment, 450 children from kindergarten, second, and fifth grades were tested and achievement scores and teacher's ratings of classroom and personal adjustment were obtained. Regardless of grades, MFF latency in most instances did not predict achievement or adjustment. At the kindergarten level the correlations between MFF errors and both achievement and adjustment were significant for most subscales but at second grade correlations were lower and generally nonsignificant except for a few adjustment items. At the fifth grade level the correlations were moderately high but when IQ was partialled out the correlations with achievement and school adjustment dropped to near zero; however, partialling out IQ did not affect the moderate correlations between errors and personal adjustment. The errors score evidencedmore important relationships than the latency scores. The relationship between the MFF and ratings of social and emotional adjustment was consistently higher than the correlations between the MFF and achievement. These results suggest that MFF errors are personalogically relevant and that MFF performance is generally more related to adjustment than achievement.  相似文献   

15.
(1) The study concerns Rorschach, Children Apperception Test, Bender Gestalt Test and Draw-a-Man. Data have been obtained on a number of personality variables from 12 pairs of identical twins at the ages of 9;6–10;6. (2) For the Ro 17 out of 21 and for the CAT 15 out of 23 intraclass coefficients are positive. The coefficients for the Gestalt test and Draw-a-Man are high and positive. It is argued that intraclass coefficients must be considered as underestimating both the true reliability of the ratings and the correlation between true scores for identical twins. (3) The intertest correlations give support to the construct validity of the ratings on the variables studied. Especially favourable are the intercorrelations for 'General adjustment'.  相似文献   

16.
以青少年学业情绪问卷为基础,结合物理学科特点对问卷项目进行修改,修订中学生物理学业情绪问卷.以314名初二到高二学生为被试进行预测,通过验证性因素分析对问卷项目进行调整,最终问卷包括67个项目.经检验问卷的内部一致性信度、分半信度和效标效度符合测量学的要求.在此基础上,选用728名初二到高三学生为被试.结果发现:(1)总体来看,男生,尤其是来自重点中学的男生,其积极物理学业情绪要高于女生,而女生的消极物理学业情绪要高于男生;(2)重点中学的初中生的积极低唤醒物理学业情绪得分显著高于高中生,且重点中学的初中生的消极低唤醒物理学业情绪得分显著低于高中生,而普通中学则正好相反;(3)在物理学业情绪中,积极高唤醒和积极低唤醒的所有因子与物理学业成就皆呈显著正相关,而消极高唤醒(仅羞愧)和消极低唤醒(厌倦、无助、心烦-疲乏)与物理学业成就呈显著负相关.  相似文献   

17.
Maximizing the discriminating power of a multiple-score test involves maximizing the homogeneity of each subtest and minimizing the correlations between subtests. A method is presented for constructing such tests from items whose intercorrelations are not too high. Under certain restrictions the saturation, defined as the proportion of inter-item covariance to total variance, is maximized for each subtest. The nucleus of each subtest is three items with high covariancesinter se. All items which will lower the saturation are discarded; the one item is added which will maximize the saturation of the resultant test. This process is repeated until all the items are included or discarded for that subtest. If the correlation between any such subtests approaches the geometric mean of their saturations, their items form a new pool for one or more subtests. Formulas are presented for deciding which items to eliminate in order to reduce further the correlations between subtests.This research was supported in part by the United States Air Force under Contract AF 33(038)-10588 with Human Resources Research Center, Lackland Air Force Base, San Antonio, Texas. Permission is granted for reproduction, translation, publication use and disposal in whole and in part by or for the United States Government.  相似文献   

18.
A test theory using only ordinal assumptions is presented. It is based on the idea that the test items are a sample from a universe of items. The sum across items of the ordinal relations for a pair of persons on the universe items is analogous to a true score. Using concepts from ordinal multiple regression, it is possible to estimate the tau correlations of test items with the universe order from the taus among the test items. These in turn permit the estimation of the tau of total score with the universe. It is also possible to estimate the odds that the direction of a given observed score difference is the same as that of the true score difference. The estimates of the correlations between items and universe and between total score and universe are found to agree well with the actual values in both real and artificial data.Part of this paper was presented at the June, 1989, Meeting of the Psychometric Society. The authors wish to thank several reviewers for their suggestions. This research was mainly done while the second author was a University Fellow at the University of Southern California.  相似文献   

19.
The factor structures of two recently developed measures of emotional intelligence, the Situational Test of Emotional Understanding and Situational Test of Emotion Management (STEU, STEM; MacCann & Roberts, 2008) were examined. The results did not support a factor structure of either measure’s subscales indicated by the approach used in developing the test items, and examination of the factors obtained using parallel analysis to determine the number of factors to extract did not yield interpretable factors. These findings suggest that only total scale scores should be used for these tests, although the general factor extracted from the items was not strong for either test; further development work on these tests is indicated.  相似文献   

20.
Visual perceptual skills of school-age children are often assessed using the Supplemental Developmental Test of Visual Perception of the Developmental Test of Visual-Motor Integration. The study purpose was to consider the construct validity of this test by evaluating its scalability (interval level measurement), unidimensionality, differential item functioning, and hierarchical ordering of its items. Visual perceptual performance scores from a sample of 356 typically developing children (171 boys and 185 girls ages 5 to 11 years) were used to complete a Rasch analysis of the test. Seven items were discarded for poor fit, while none of the items exhibited differential item functioning by sex. The construct validity, scalability, hierarchical ordering, and lack of differential item functioning requirements were met by the final test version. Since 7 test items did not fit the Rasch analysis specifications, the clinical value of the test is questionable and limited.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号