首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In a computer simulation study, random samples from a uniform density were substituted for each of two independent samples from normal and various nonnormal densities. This procedure was compared with conventional ranking and with Bell and Doksum's (1965) procedure, which substituted random normal deviates for initial sample values. After performing the Student t test, the program transformed the initial scores and performed additional t tests on ranks, random uniform scores, and random normal scores. For several distributions, the test on random normal scores was more powerful than the others, consistent with known asymptotic results. The probabilities of Type I and Type II errors of the test on random uniform scores were nearly the same as those of the Mann-Whitney-Wilcoxon test, for all distributions examined.  相似文献   

2.
A sample of 120 children (69 boys and 51 girls), ranging in age from 9 to 14 yr., was administered the Harter Self-perception Profile for Children before and after a 5-wk. program of swimming instruction. It is predicted that children who improve most in swimming will also have the largest gains in athletic self-concept. The data supported the prediction for an association between gains in swimming skill and self-concept for certain skill groups.  相似文献   

3.
Eating knowledge, nutritional knowledge, and psychological changes among female collegiate dancers were examined before and after a 4-wk. team-centered program on sport nutrition, exercise, and disordered eating consequences. Collegiate female dancers from two NCAA Division I institutions participated in a control (n = 19; M age = 19.1 yr., SD = 1.0) or intervention (n = 21; M age = 19.2 yr., SD = 1.2) group. Measures were administered to both groups before and after intervention to assess eating disorders, depression, and nutritional and disordered eating knowledge. There was a statistically significant increase in scores on nutritional and overall eating disorder knowledge in the intervention group compared to the control group. Mean scores on depression, drive for thinness, body dissatisfaction, and maturity fears decreased in the intervention group.  相似文献   

4.
The Type I error probability and the power of the independent samples t test, performed directly on the ranks of scores in combined samples in place of the original scores, are known to be the same as those of the non‐parametric Wilcoxon–Mann–Whitney (WMW) test. In the present study, simulations revealed that these probabilities remain essentially unchanged when the number of ranks is reduced by assigning the same rank to multiple ordered scores. For example, if 200 ranks are reduced to as few as 20, or 10, or 5 ranks by replacing sequences of consecutive ranks by a single number, the Type I error probability and power stay about the same. Significance tests performed on these modular ranks consistently reproduce familiar findings about the comparative power of the t test and the WMW tests for normal and various non‐normal distributions. Similar results are obtained for modular ranks used in comparing the one‐sample t test and the Wilcoxon signed ranks test.  相似文献   

5.
The purpose of the research was to conduct a validation study of a transferred group intelligence test. The Lorge-Thorndike Intelligence Test, Level A, Form 1 was translated to Hebrew and administered to an Israeli sample comparable to the original norming group. The results obtained were concerned with variability, reliability, validity, item analysis, intercorrelations among subtests, and factor analysis. While few comparisons of the Israeli sample with the original data were made, the validation of the transferred test was examined in the light of other criteria for evaluating a new test. The suitability of the transferred test for Israeli high achievers from high-status families was supported by the results. The major conclusions were: (1) the total mean IQ scores of Israeli students were approximately the same as the American mean; (2) mean raw scores increased with age; (3) relative difficulty of individual items remained stable through translation and administration in Israeli culture, in three out of five subtests; (4) it appears reasonable to suggest that the factor structure of the translated test is consistent with the intent of the original.  相似文献   

6.
Reliability generalization (RG) is a meta-analytic technique that allows for the systematic examination of variation in score reliability for different samples of test takers; this procedure is based on the recognition that reliability is not a stable property of a test but is sample dependent. As a demonstration of an RG analysis, I obtained 63 reliability coefficients for each of the MMPI-2 (Butcher et al., 2001) Personality Psychopathology 5 (Harkness, McNulty, & Ben-Porath, 1995) scales. The overall variability of alpha coefficients supports the argument that reliability is sample dependent and underscores the need for researchers to calculate reliability estimates based on their research samples rather than simply citing published alpha coefficients as evidence of score reliability. I observed statistically significant mean reliability differences for scores across the 5 scales, with the highest level of reliability observed for scores on the measure of Negative Emotionality and the lowest levels of reliability observed for scores on the measures of Aggression and Disconstraint. There was no evidence that the sex-composition of a sample was systematically related to score reliability, and there were no statistically significant differences in reliability between scores obtained with the English version of the test and those obtained with translated forms. However, reliability was consistently lower for scores on some scales when the data were obtained in nonclinical settings as opposed to clinical ones. Sample size was not significantly correlated with reliability estimates. RG methods have the potential for deepening the level of understanding about the role of reliability in the evaluation and use of personality tests.  相似文献   

7.
A 12-wk. study was conducted to contrast the effects of a longer and more frequent aerobic exercise protocol with a shorter and less frequent circuit strength-training protocol for improving U.S. Air Force physical fitness test scores of subjects who previously failed to achieve a passing point total. 83 men and women of the U.S. Air Force (M age = 32.7 yr.) participated in either the unsupervised standard conditioning program, which recommended approximately 60 min. of aerobic activity 4 to 5 days per week (n=26), or the supervised circuit strength-training program, which required approximately 25 min. of alternating strength and endurance exercises 3 days per week (n=57). Subjects were assessed on a 2400-m (1.5-mile) run, abdominal circumference, push-ups completed in 1 min., and abdominal crunches completed in 1 min. Dependent t tests with Bonferroni adjustment indicated that significant improvements were attained by the circuit strength-training group only on each of the aforementioned measures. Significantly more participants in the circuit strength-training group (26%) achieved a passing point total than in the standard conditioning group (19%) at Wk. 12 (chi1(2) =3.96, p = .05). Implications for enhancing physical fitness in poorly conditioned adults were discussed.  相似文献   

8.
WAIS test-retest reliabilities were calculated for a clinical out-patient sample with testing intervals varying from 1 to 10 yr. There was no relationship between test-retest interval and the stability of test scores and the correlations between IQs were quite satisfactory (Full Scale IQ = .897, Verbal IQ = .906, Performance IQ = .876). Reliabilities remained high even when the sample was divided by diagnosis into organic, neurotic, personality disorder, and schizophrenic subgroups.  相似文献   

9.
HAMILTON CH 《Psychometrika》1950,15(2):151-168
A formula for estimating real scores on a multiple-choice test from a knowledge of raw scores is derived. This formula does not involve the assumption of a binomial distribution of real scores as does the Calandra formula. Other important formulas derived show: the variance of real scores in terms of the variance of raw scores and the correlation between real scores and raw scores. If the variance of real scores (or of raw scores also) is binomial, the regression of real scores on raw scores is linear; but, otherwise the regression is curvilinear. Yet the linear estimating formula is a close approximation to the curvilinear relationship. Factors affecting the regression of real scores on raw scores and the correlation coefficient are: (1) the number of choices per question; (2) the number of questions answered; (3) the ratio of the average group raw score to the variance of raw scores.  相似文献   

10.
Fifty items from Goldberg's International Personality Item Pool were compiled to form a public‐domain measure of personality, the Australian Personality Inventory (API). Data from a random community sample (N = 7615) and a university‐based sample (N = 271) were used to explore psychometric properties of this 50‐item measure of the five‐factor model of personality (FFM). In both samples, internal reliabilities were adequate. In the university‐based sample an appropriate pattern of convergent and divergent relationship was found between scale scores and domain scores from the NEO Five‐Factor Inventory. After adjusting for an apparent response set (mean response across items), exploratory factor analyses clearly retrieved the FFM in both samples. It is provisionally concluded that raw scale scores from the API provide reliable estimates of the FFM, but adjustment for mean response across the 50 items might clarify the five‐factor structure, especially in less educated samples.  相似文献   

11.
This study assessed the intercorrelations of scores on the Developmental Test of Visual-Motor Integration, the locally standardized Copying Test, and teachers' ratings of scholastic skills in a South African multi-ethnic preschool sample. The study also investigated whether cultural and socioeconomic factors might influence test data. Participants were 71 Black, 101 Coloured, and 66 White children attending preschools in a semirural district. Participants' ages ranged from 4 yr., 9 mo. to 7yr., 0 mo. (M=5.8 yr., SD= 0.3 yr.). Analysis yielded a correlation of .75 between the test scores and supports the suitability of the widely used Developmental Test of Visual-Motor Integration in a multi-ethnic sample. Scores on the Copying Test correlated higher with teachers' ratings. However, significant differences in test performance among groups by race and socioeconomic status suggest the rate of perceptual-motor development may be related to cultural factors. Normative data are reported for groups by race and socioeconomic status.  相似文献   

12.
This investigation concerned the relationship between the Luria-Nebraska Neuropsychological Battery--Children's Revision and the WISC--R for a sample of 32 children identified as learning disabled. The children's mean age was 9 yr., 11 mo.; they were identified as learning disabled on the basis of ability (WISC--R)/achievement discrepancy test scores. The sample was of low average intellectual ability according to the WISC--R and the Luria-Nebraska T-scores. Intercorrelations between scores on the WISC--R and Luria-Nebraska lists were generally nonsignificant, with the exception of language and arithmetic measures on each test. Also, 84% or 27 of the present sample of 32 were correctly identified as learning disabled using a criterion of three or more Luria-Nebraska subscale scores greater than one SD above the mean.  相似文献   

13.
The effects of training overhand ball throws to enhance underhand ball-throwing accuracy were studied with 921 Italian high school pupils (575 girls, M age = 16.5 yr., SD = 1.3; 346 boys, M = 16.5 yr., SD = 1.2) ages 14 to 18 years. A standardized test (10 underhand tennis-ball throws into the floor area delimited by an 80-cm radius wooden hoop lying on the floor 9 m from the standing pupil) was performed and scored. An Experimental group (435 girls, M age = 16.5 yr., SD = 1.3; 204 boys, M age = 16.5 yr., SD = 1.2) was randomly selected to participate in 10 weekly training periods. The set included 10 overhand throws of a rubber bouncing ball (14.5 cm diameter, weight 240 g) into a basketball hoop from 5 standardized positions (in front of the basketball hoop; at 45 degrees, 90 degrees on the right and on the left of the basketball hoop) at a distance of 4.40 m. The underhand test was repeated for both groups. On the average, boys obtained higher test scores than girls of the same age. Older adolescents had higher mean scores than younger adolescents. After 10 weeks, boys and girls of all ages improved. Mean differences between sessions were significant for the Experimental group (Student t test, p < .01). In Session 2, consistent differences between Experimental and Control groups were also found (Student t test, p < .05).  相似文献   

14.
The test-retest stability of the California Verbal Learning Test was examined for a normal elderly sample of 28 men and 74 women along with the correlation for total recall with scores on selected variables. Mean age was 72 yr. The test was re-administered after one year. Mean total recall did not change; the stability coefficient was .64. Coefficients for the other scores ranged considerably (.27 for recognition hits to .62 for perseverations) and were compared with prior research findings. Recall was moderately correlated with scores on the North American Reading Test, Trails B, visual reproductions, and subjective memory ability.  相似文献   

15.
The present study examined age, sex and personality correlates of self-actualization among older adults. The Personal Orientation Inventory and Personality Research Form were administered individually to 80 community men and women aged 56 yr.-84 yr. No main effect for sex was observed on inventory scores, but younger subjects (56 yr.-67 yr.) obtained higher scores than older subjects (68 yr.-84 yr.) on four subscales: Existentiality, Feeling Reactivity, Acceptance of Aggression and Capacity for Intimate Contact. Age was negatively correlated with Feeling Reactivity in both sexes; Inner-directedness decreased with age in men and Capacity for Intimate Contact diminished with age in women. For men, inventory scores were positively associated with the traits of Change and Endurance and negatively correlated with Abasement, Order and Succorance. For women, Impulsivity correlated positively and Order was associated negatively with scores on the inventory. These results suggest that the components of self-actualization are differentially influenced by age for men and women and that the traits accompanying self-actualization are sex-linked.  相似文献   

16.
The purpose of this study was to provide data on the validity of using the WPPSI--R with 16 preschoolers who had been referred for psychological assessment. A comparison of the preschoolers' performance on the WPPSI--R and the Stanford-Binet L-M was made. Children ranged in age from 3-0 to 6-0 years, with a mean age of 4-5 yr. The correlations between WPPSI--R and Stanford-Binet L-M scores were strong; WPPSI--R Verbal IQ and Stanford-Binet L-M IQ, r = .85; WPPSI--R Performance IQ and Stanford-Binet L-M IQ, r = .75, and WPPSI--R Full Scale IQ and Stanford-Binet L-M IQ, r = .82. For this very small sample of children whose mean IQ was significantly below average, the WPPSI--R appeared to assess validly their current intellectual functioning. Attention should be given concurrent and predictive validity as well as the efficiency of using the WPPSI--R with other and larger samples of referred children.  相似文献   

17.
The Halstead-Reitan Neuropsychological Test Battery offers a multifaceted, empirically supported interpretive framework for neuropsychological assessment founded upon raw scores referenced to a brain-damaged sample. Complementary approaches have used T scores from normal samples often composed of community volunteers or medical patients, augmented by regression. By pooling normal controls of all types from published studies, a sample large enough for stratification of core Halstead-Reitan measures by age and education was obtained (206 studies, N1 = 17,980). Comparisons based on T scores with two published normative systems in an outpatient sample produced a continuum: liberal norms for medical patients, intermediate pooled values from many types of subjects, and stringent norms for community volunteers. Features recommending clinical application of pooled norms include enhanced accuracy in clinical situations precluding a full battery, few empty cells or inversions for nonelderly subjects, and a screening application based upon a very large Trail Making Test sample (Trails B: n > 14,000).  相似文献   

18.
Memory impairment following closed-head injury (CHI) in children is well documented. Characterization of the memory deficits of children with CHI could contribute to the prediction of academic performance and rehabilitation of these children. Twenty-five children who sustained closed-head injury and 25 matched controls were administered the Rey Auditory Verbal Learning Test (AVLT). The advantage of this memory test is that a number of memory components are measured simultaneously, thus enabling us to study the relations between different aspects of memory within the same patient sample. The findings indicate that the Rey AVLT is a good test for characterization of impaired verbal memory in children following CHI. Transformation of scores derived from the Rey AVLT to Z-scores enables us to determine the relative effect of CHI in children on different memory scores. Raw scores were more vulnerable than relational ones, derived as the difference between two raw scores (e.g., learning, Trial 5?1), to closed-head injury in children, and scores reflecting word span were the least vulnerable. The results are discussed in terms of the possible contribution of the frontal lobes, which are frequently affected in closed-head injuries, to memory performance.  相似文献   

19.
An analysis of test scores of a longitudinal sample of normal children, who placed in the lowest quartile of the class on the California Achievement Test and children in the highest quartile yielded significant differences on three perceptual-motor tests and on Piaget's preoperational and precausal tests at Kindergarten level. Although matched for IQ in Kindergarten, significant IQ differences appeared in Grade 1 and continued to increase over 5 yr., as those on other tests. Cattell's Early School Personality Questionnaire indicated that low achievers were significantly less mature and more tense and anxious than high achievers from Grades 1 through 5. Maturational lag is hypothesized for retardation of learning.  相似文献   

20.
International comparisons of IQ test norms show differences between nations. In the present study, nonverbal reasoning, processing speed and working memory subtest scores of the US, German, French, Finnish, and Scandinavian (combined Swedish‐Norwegian‐Danish sample) WAIS IV standardization samples were compared. The European samples had higher scores on the reasoning subtests compared to the American sample, corroborating earlier studies. The Finnish and Scandinavian samples had lower processing speed and working memory scores than the American, German, and French samples. Mechanisms that may underlie the observed national IQ profiles include: (1) test‐taking attitudes—in tests that require balancing speed and accuracy of performance Americans may prioritize fast performance while Europeans avoid mistakes; (2) differences between languages in digit articulation times; and (3) educational factors—the European advantage on reasoning subtests may be based on there being better educational systems in Europe as compared to the US.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号