期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Validation tests

LARCEBEAU S 《L' Année psychologique》1953,53(2):553-562

相似文献

2.

Psychological tests

Roy A. Burkhart John A. Whitesel Lawrence K. Frank 《Pastoral Psychology》1951,2(6):49-51

相似文献

3.

One-and-two-tailed tests

MARKS RM 《Psychological review》1953,60(3):207-208

相似文献

4.

Diagnosing tests: using and misusing diagnostic and screening tests

Streiner DL 《Journal of personality assessment》2003,81(3):209-219

Tests can be used either diagnostically (i.e., to confirm or rule out the presence of a condition in people suspected of having it) or as a screening instrument (determining who in a large group of people has the condition and often when those people are unaware of it or unwilling to admit to it). Tests that may be useful and accurate for diagnosis may actually do more harm than good when used as a screening instrument. The reason is that the proportion of false negatives may be high when the prevalence is high, and the proportion of false positives tends to be high when the prevalence of the condition is low (the usual situation with screening tests). My first aim of this article is to discuss the effects of the base rate, or prevalence, of a disorder on the accuracy of test results. My second aim is to review some of the many diagnostic efficiency statistics that can be derived from a 2 x 2 table, including the overall correct classification rate, kappa, phi, the odds ratio, positive and negative predictive power and some variants of them, and likelihood ratios. In the last part of this article, I review the recent Standards for Reporting of Diagnostic Accuracy guidelines (Bossuyt et al., 2003) for reporting the results of diagnostic tests and extend them to cover the types of tests used by psychologists. 相似文献

5.

Grading distractor-identification tests

Joe Dan Austin 《Psychometrika》1981,46(2):129-137

On distractor-identification tests students mark as many distractors as possible on each test item. A grading scale is developed for this type testing. The scale is optimal in that it is the unique scale giving an unbiased estimate of the student's true score, i.e., the score that would result if no guessing occurred. If the test is administered as a usual multiple choice test and graded using the usual correction for guessing scale, the expected item score is the same as for the distractor-identification testing using the optimal grading scale. However, the variance of the item score is shown to be less for distractor-identification testing than for usual multiple choice testing under certain conditions. 相似文献

6.

Conditioned taste aversions: Two-stimulus tests are more sensitive than one-stimulus tests

Frederick W. Grote Robert T. Brown 《Behavior research methods》1971,3(6):311-312

Weanling and mature rats were presented with saccharin or saline solutions for 1 h on alternate days. Following exposure to saccharin, rats were injected with 0, 21, or 37 mg/kg of cyclophosphamide. Injections had no significant effect on saccharin preference in one-stimulus tests, but had a highly significant effect in two-stimulus tests. 相似文献

7.

Generalizability of stratified-parallel tests 总被引：6，自引：0，他引：6

Nageswari Rajaratnam Lee J. Cronbach Goldine C. Gleser 《Psychometrika》1965,30(1):39-56

相似文献

8.

Mental tests and fossils

Littman RA 《Journal of the history of the behavioral sciences》2004,40(4):423-431

This article investigates the origins of the intelligence test item known as the Ball and Field in Lewis M. Terman's Stanford Revision of the Binet-Simon Intelligence Scale. The question was initially raised by the resemblance of paleontological ocean bed floor tracings left by ancient creatures to the responses produced by children given the Ball and Field Test. A version of the Ball and Field Test was invented by Clifton F. Hodge, one of Terman's graduate school instructors who devised it as a result of his observations about how birds and other animals navigated and found their way. He then tested how humans and children located hidden objects and found that, in many ways, animals and humans used similar strategies for getting home or finding objects. 相似文献

9.

Loglinear Rasch model tests 总被引：1，自引：0，他引：1

Hendrikus Kelderman 《Psychometrika》1984,49(2):223-245

Existing statistical tests for the fit of the Rasch model have been criticized, because they are only sensitive to specific violations of its assumptions. Contingency table methods using loglinear models have been used to test various psychometric models. In this paper, the assumptions of the Rasch model are discussed and the Rasch model is reformulated as a quasi-independence model. The model is a quasi-loglinear model for the incomplete subgroup × score × item 1 × item 2 × ... × itemk contingency table. Using ordinary contingency table methods the Rasch model can be tested generally or against less restrictive quasi-loglinear models to investigate specific violations of its assumptions. 相似文献

10.

On the Wilson tests

N. Donald Ylvisaker 《Psychometrika》1960,25(3):297-302

A general critical analysis of the median tests proposed by Wilson for certain analysis of variance hypotheses is presented. Specifically, discrepancies between the purported and actual approximate distributions of some of the test statistics are noted. Validity and power of the resulting tests are discussed.This work was sponsored in part by the Office of Naval Research while the author was at Stanford University. Reproduction in whole or in part is permitted for any purpose of the United States Government. The author wishes to thank Professors Fred C. Andrews, Lincoln E. Moses, and David L. Wallace for their helpful criticisms and suggestions in the writing of this paper. 相似文献

11.

Statistical evaluation of Rorschach tests

HECTOR H 《Psychiatrie, Neurologie, und medizinische Psychologie》1950,2(7):214-218

相似文献

12.

Subtlety in structured personality tests

SEEMAN W 《Journal of consulting psychology》1952,16(4):278-283

相似文献

13.

Cross-validation of clerical aptitude tests

HAY EN 《The Journal of applied psychology》1950,34(3):153-158

相似文献

14.

One-tailed tests and unexpected results

GOLDFRIED MR 《Psychological review》1959,66(1):79-80

相似文献

15.

Verbal tests of spatial conceptualization

L C Hartlage 《Journal of experimental psychology. General》1969,80(1):180-182

相似文献

16.

Successive tests of pair recognition

Sikström SP 《Memory (Hove, England)》1998,6(5):531-554

A large number of experiments have found a moderate degree of dependence between subsequent tests of recognition and cued recall as described by the TW-function. This paper investigates the dependence in word pair recognition. Tests of word pair recognition are conducted with the subsequent test being free recall, cued recall, recognition, and cued recognition. The dependence is compared to subsequent tests of cued recognition (i.e. recognition of a target with the presence of a cue). The results are related to a general theory of memory called TECO (Target, Event, Cue, & Object, see Sikstr?m 1996b). This theory makes different quantitative predictions depending on the number of shared connections in the subsequent tests. Using a function suggested by TECO, different degrees of dependencies are predicted for pair and cued recognition. The predictions of the TECO-function show a non-significant deviation from observed data, whereas those of the TW-function deviate significantly in all conditions. 相似文献

17.

Projective tests are valid.

B P Karon 《The American psychologist》1978,33(8):764-765

相似文献

18.

A rejoinder on one-tailed tests

JONES LV 《Psychological bulletin》1954,5(6):585-586

相似文献

19.

Ability estimation for conventional tests

Jwa K. Kim W. Alan Nicewander 《Psychometrika》1993,58(4):587-599

Five different ability estimators—maximum likelihood [MLE ()], weighted likelihood [WLE ()], Bayesian modal [BME ()], expected a posteriori [EAP ()] and the standardized number-right score [Z ()]—were used as scores for conventional, multiple-choice tests. The bias, standard error and reliability of the five ability estimators were evaluated using Monte Carlo estimates of the unknown conditional means and variances of the estimators. The results indicated that ability estimates based on BME (), EAP () or WLE () were reasonably unbiased for the range of abilities corresponding to the difficulty of a test, and that their standard errors were relatively small. Also, they were as reliable as the old standby—the number-right score. 相似文献

20.

Bayesian tests of measurement invariance

A. J. Verhagen J. P. Fox 《The British journal of mathematical and statistical psychology》2013,66(3):383-401

Random item effects models provide a natural framework for the exploration of violations of measurement invariance without the need for anchor items. Within the random item effects modelling framework, Bayesian tests (Bayes factor, deviance information criterion) are proposed which enable multiple marginal invariance hypotheses to be tested simultaneously. The performance of the tests is evaluated with a simulation study which shows that the tests have high power and low Type I error rate. Data from the European Social Survey are used to test for measurement invariance of attitude towards immigrant items and to show that background information can be used to explain cross‐national variation in item functioning. 相似文献