期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Testing for adverse impact when sample size is small

Collins MW Morris SB 《The Journal of applied psychology》2008,93(2):463-471

Adverse impact evaluations often call for evidence that the disparity between groups in selection rates is statistically significant, and practitioners must choose which test statistic to apply in this situation. To identify the most effective testing procedure, the authors compared several alternate test statistics in terms of Type I error rates and power, focusing on situations with small samples. Significance testing was found to be of limited value because of low power for all tests. Among the alternate test statistics, the widely-used Z-test on the difference between two proportions performed reasonably well, except when sample size was extremely small. A test suggested by G. J. G. Upton (1982) provided slightly better control of Type I error under some conditions but generally produced results similar to the Z-test. Use of the Fisher Exact Test and Yates's continuity-corrected chi-square test are not recommended because of overly conservative Type I error rates and substantially lower power than the Z-test. 相似文献

2.

Power of pairwise comparisons in the equal variance and unequal sample size case

Philip H. Ramsey Patricia P. Ramsey 《The British journal of mathematical and statistical psychology》2008,61(1):115-131

A Monte Carlo simulation was conducted to compare five, pairwise multiple comparison procedures. The number of means varied from 4 to 6 and the sample size ratio varied from 1 to 60. Procedures were evaluated on the basis of Type I errors, any‐pair power and all‐pairs power. Four procedures were shown to be conservative, while the fifth provided adequate control of Type I errors only for restricted values of sample size ratios. No procedure was found to be uniformly most powerful. The Tukey‐Kramer procedure was found to provide the best any‐pair power provided it is applied without requiring a significant overall F test. In most cases, the Hayter‐Fisher modification of the Tukey‐Kramer was found to provide very good any‐pair power and to be uniformly more powerful than the Tukey‐Kramer when a significant overall F test is required. A partition‐based version of Peritz's method usually provided the greatest all‐pairs power. A modification of the Shaffer‐Welsch was found to be useful in certain conditions. 相似文献

3.

Optimization of sample size in controlled experiments: The CLAST rule

Botella J Ximénez C Revuelta J Suero M 《Behavior research methods》2006,38(1):65-76

Sequential rules are explored in the context of null hypothesis significance testing. Several studies have demonstrated that the fixed-sample stopping rule, in which the sample size used by researchers is determined in advance, is less practical and less efficient than sequential stopping rules. It is proposed that a sequential stopping rule called CLAST (composite limited adaptive sequential test) is a superior variant of COAST (composite open adaptive sequential test), a sequential rule proposed by Frick (1998). Simulation studies are conducted to test the efficiency of the proposed rule in terms of sample size and power. Two statistical tests are used: the one-tailed t test of mean differences with two matched samples, and the chi-square independence test for twofold contingency tables. The results show that the CLAST rule is more efficient than the COAST rule and reflects more realistically the practice of experimental psychology researchers. 相似文献

4.

Multilevel factorial experiments for developing behavioral interventions: power, sample size, and resource considerations

Dziak JJ Nahum-Shani I Collins LM 《心理学方法》2012,17(2):153-175

Factorial experimental designs have many potential advantages for behavioral scientists. For example, such designs may be useful in building more potent interventions by helping investigators to screen several candidate intervention components simultaneously and to decide which are likely to offer greater benefit before evaluating the intervention as a whole. However, sample size and power considerations may challenge investigators attempting to apply such designs, especially when the population of interest is multilevel (e.g., when students are nested within schools, or when employees are nested within organizations). In this article, we examine the feasibility of factorial experimental designs with multiple factors in a multilevel, clustered setting (i.e., of multilevel, multifactor experiments). We conduct Monte Carlo simulations to demonstrate how design elements-such as the number of clusters, the number of lower-level units, and the intraclass correlation-affect power. Our results suggest that multilevel, multifactor experiments are feasible for factor-screening purposes because of the economical properties of complete and fractional factorial experimental designs. We also discuss resources for sample size planning and power estimation for multilevel factorial experiments. These results are discussed from a resource management perspective, in which the goal is to choose a design that maximizes the scientific benefit using the resources available for an investigation. 相似文献

5.

Accuracy in parameter estimation for ANCOVA and ANOVA contrasts: sample size planning via narrow confidence intervals

Lai K Kelley K 《The British journal of mathematical and statistical psychology》2012,65(2):350-370

Contrasts of means are often of interest because they describe the effect size among multiple treatments. High-quality inference of population effect sizes can be achieved through narrow confidence intervals (CIs). Given the close relation between CI width and sample size, we propose two methods to plan the sample size for an ANCOVA or ANOVA study, so that a sufficiently narrow CI for the population (standardized or unstandardized) contrast of interest will be obtained. The standard method plans the sample size so that the expected CI width is sufficiently small. Since CI width is a random variable, the expected width being sufficiently small does not guarantee that the width obtained in a particular study will be sufficiently small. An extended procedure ensures with some specified, high degree of assurance (e.g., 90% of the time) that the CI observed in a particular study will be sufficiently narrow. We also discuss the rationale and usefulness of two different ways to standardize an ANCOVA contrast, and compare three types of standardized contrast in the ANCOVA/ANOVA context. All of the methods we propose have been implemented in the freely available MBESS package in R so that they can be easily applied by researchers. 相似文献

6.

Accuracy in parameter estimation for targeted effects in structural equation modeling: sample size planning for narrow confidence intervals

Lai K Kelley K 《心理学方法》2011,16(2):127-148

In addition to evaluating a structural equation model (SEM) as a whole, often the model parameters are of interest and confidence intervals for those parameters are formed. Given a model with a good overall fit, it is entirely possible for the targeted effects of interest to have very wide confidence intervals, thus giving little information about the magnitude of the population targeted effects. With the goal of obtaining sufficiently narrow confidence intervals for the model parameters of interest, sample size planning methods for SEM are developed from the accuracy in parameter estimation approach. One method plans for the sample size so that the expected confidence interval width is sufficiently narrow. An extended procedure ensures that the obtained confidence interval will be no wider than desired, with some specified degree of assurance. A Monte Carlo simulation study was conducted that verified the effectiveness of the procedures in realistic situations. The methods developed have been implemented in the MBESS package in R so that they can be easily applied by researchers. 相似文献

7.

On sample size calculation for 2×2 fixed‐effect ANOVA when variances are unknown and possibly unequal

Jiin‐Huarng Guo Dr Wei‐Ming Luh 《The British journal of mathematical and statistical psychology》2009,62(2):417-425

The factorial 2 × 2 fixed‐effect ANOVA is a procedure used frequently in scientific research to test mean differences between‐subjects in all of the groups. But if the assumption of homogeneity is violated, the test for the row, column, and the interaction effect might be invalid or less powerful. Therefore, for planning research in the case of unknown and possibly unequal variances, it is worth developing a sample size formula to obtain the desired power. This article suggests a simple formula to determine the sample size for 2 × 2 fixed‐effect ANOVA for heterogeneous variances across groups. We use the approximate Welch t test and consider the variance ratio to derive the formula. The sample size determination requires two‐step iterations but the approximate sample sizes needed for the main effect and the interaction effect can be determined separately with the specified power. The present study also provides an example and a SAS program to facilitate the calculation process. 相似文献

8.

Familiarity and personal experience as mediators of recall when planning for future contingencies

Klein SB Robertson TE Delton AW Lax ML 《Journal of experimental psychology. Learning, memory, and cognition》2012,38(1):240-245

In this article, we demonstrate that planning tasks enhance recall when the context of planning (a) is self-referential and (b) draws on familiar scenarios represented in episodic memory. Specifically, we show that when planning tasks are sorted according to the degree to which they evoke memories of personally familiar scenarios (e.g., planning a picnic), recall is reliably superior to tasks that fail to do so (e.g., planning an Arctic trek). We discuss the implications of these findings for planning tasks and their relation to episodic memory. 相似文献

9.

It's in the sample: the effects of sample size and sample diversity on the breadth of inductive generalization

Lawson CA Fisher AV 《Journal of experimental child psychology》2011,110(4):499-519

Developmental studies have provided mixed evidence with regard to the question of whether children consider sample size and sample diversity in their inductive generalizations. Results from four experiments with 105 undergraduates, 105 school-age children (M = 7.2 years), and 105 preschoolers (M = 4.9 years) showed that preschoolers made a higher rate of projections from large samples than from small samples when samples were diverse (Experiments 1 and 3) but not when samples were homogeneous (Experiment 4) and not when the task required a choice between two samples (Experiment 2). Furthermore, when a property occurred in large and diverse samples, preschoolers exhibited a broad pattern of projection, generalizing the property to items from categories not represented in the evidence. In contrast, adults followed a normative pattern of induction and never attributed properties to items from categories not represented in the evidence. School-age children showed a mixed pattern of results. 相似文献

10.

Tests for equality of several alpha coefficients when their sample estimates are dependent

David J. Woodruff Leonard S. Feldt 《Psychometrika》1986,51(3):393-413

In a variety of measurement situations, the researcher may wish to compare the reliabilities of several instruments administered to the same sample of subjects. This paper presents eleven statistical procedures which test the equality ofm coefficient alphas when the sample alpha coefficients are dependent. Several of the procedures are derived in detail, and numerical examples are given for two. Since all of the procedures depend on approximate asymptotic results, Monte Carlo methods are used to assess the accuracy of the procedures for sample sizes of 50, 100, and 200. Both control of Type I error and power are evaluated by computer simulation. Two of the procedures are unable to control Type I errors satisfactorily. The remaining nine procedures perform properly, but three are somewhat superior in power and Type I error control.A more detailed version of this paper is also available. 相似文献

11.

Power function charts for specification of sample size in analysis of variance

Leonard S. Feldt Moharram W. Mahmoud 《Psychometrika》1958,23(3):201-210

The specification of sample size is an important aspect of the planning of every experiment. When the investigator intends to use the techniques of analysis of variance in the study of treatments effects, he should, in specifying sample size, take into consideration the power of theF tests which will be made. The charts presented in this paper make possible a simple and direct estimate of the sample size required forF tests of specified power. 相似文献

12.

Remarks on the method of paired comparisons: III. A test of significance for paired comparisons when equal standard deviations and equal correlations are assumed

MOSTELLER F 《Psychometrika》1951,16(2):207-218

A test of goodness of fit is developed for Thurstone's method of paired comparisons, Case V. The test involves the computation of , wheren is the number of observations per pair, and and are the angles obtained by applying the inverse sine transformation to the fitted and the observed proportions respectively. The number of degrees of freedom is (k–1) (k–2)/2.This research was performed in the Laboratory of Social Relations under a grant made available to Harvard University by the RAND Corporation under the Department of the Air Force, Project RAND. 相似文献

13.

Power and sample size calculations for multivariate linear models with random explanatory variables

Gwowen?Shieh Email author 《Psychometrika》2005,70(2):347-358

This article considers the problem of power and sample size calculations for normal outcomes within the framework of multivariate linear models. The emphasis is placed on the practical situation that not only the values of response variables for each subject are just available after the observations are made, but also the levels of explanatory variables cannot be predetermined before data collection. Using analytic justification, it is shown that the proposed methods extend the existing approaches to accommodate the extra variability and arbitrary configurations of the explanatory variables. The major modification involves the noncentrality parameters associated with the F approximations to the transformations of Wilks likelihood ratio, Pillai trace and Hotelling-Lawley trace statistics. A treatment of multivariate analysis of covariance models is employed to demonstrate the distinct features of the proposed extension. Monte Carlo simulation studies are conducted to assess the accuracy using a child’s intellectual development model. The results update and expand upon current work in the literature.The author wishes to thank the associate editor and the referees for comments which improve the paper considerably. This research was partially supported by a grant from the Natural Science Council of Taiwan. 相似文献

14.

Cognitive abstraction, shifting, and control: clinical sample comparisons of psychopaths and nonpsychopaths

P B Sutker A N Allain 《Journal of abnormal psychology》1987,96(1):73-75

相似文献

15.

Confidence intervals and sample size calculations for the standardized mean difference effect size between two normal populations under heteroscedasticity

G. Shieh 《Behavior research methods》2013,45(4):955-967

The use of effect sizes and associated confidence intervals in all empirical research has been strongly emphasized by journal publication guidelines. To help advance theory and practice in the social sciences, this article describes an improved procedure for constructing confidence intervals of the standardized mean difference effect size between two independent normal populations with unknown and possibly unequal variances. The presented approach has advantages over the existing formula in both theoretical justification and computational simplicity. In addition, simulation results show that the suggested one- and two-sided confidence intervals are more accurate in achieving the nominal coverage probability. The proposed estimation method provides a feasible alternative to the most commonly used measure of Cohen’s d and the corresponding interval procedure when the assumption of homogeneous variances is not tenable. To further improve the potential applicability of the suggested methodology, the sample size procedures for precise interval estimation of the standardized mean difference are also delineated. The desired precision of a confidence interval is assessed with respect to the control of expected width and to the assurance probability of interval width within a designated value. Supplementary computer programs are developed to aid in the usefulness and implementation of the introduced techniques. 相似文献

16.

Realizing complex delayed intentions in young and old adults: The role of planning aids 总被引：1，自引：0，他引：1

Kliegel M Martin M McDaniel MA Einstein GO Moor C 《Memory & cognition》2007,35(7):1735-1746

Although it has been suggested that the delayed realization of intended actions should benefit from appropriate intention planning, empirical evidence on this issue is scarce. In three experiments, we examined whether and which planning aids provided in the intention formation phase affect delayed intention realization in young and old adults. One finding was that intention planning directly affected delayed intention realization: instructing participants to include the cue for appropriate intention initiation in their plans benefited delayed performance. Another finding was that older adults' performance was improved when they were guided in structuring their plan in combination with guidance in implementing this plan after a delay. In sum, the results point to the importance of plan-related factors for understanding the delayed realization of intended actions. 相似文献

17.

Where to look for information when planning scientific research in Psychology: Sources and channels

《International Journal of Clinical and Health Psychology》2014,14(1):76-82

It is imperative that researchers invest time in the planning of their research, and it is certainly essential to stop and seek information before making any kind of decision. The present work sets out to guide psychologists in this crucial task. To this end we begin by suggesting a visit to the APA website, where a great deal of relevant information on most topics can be found, whether it pertains to new and controversial issues or to those on which there is greater consensus. In this regard we shall consider at length the meanings of the expressions “evidence-based practice” and “scientific evidence” and their inherent methodological aspects, from “scientific evidence” contributed by systematic reviews to the way it can be obtained using handbooks and guidelines of inestimable value for the successful completion of our research. All such resources will help researchers to set out their hypotheses correctly, to test them adequately and to analyze the data in the most appropriate and rigorous fashion. In this way, the quality of the research will undoubtedly improve. 相似文献

18.

Perception of slant when perspective and stereopsis conflict: experiments with aniseikonic lenses 总被引：1，自引：0，他引：1

B J Gillam 《Journal of experimental psychology. General》1968,78(2):299-305

相似文献

19.

Visual field asymmetries in numerical size comparisons of digits, words, and signs 总被引：1，自引：0，他引：1

J Vaid D Corina 《Brain and language》1989,36(1):117-126

Visual field asymmetries were examined in American Sign Language-English bilinguals for speeded numerical size judgments of pairs of digits, number words, and number signs. Physical size of the number pairs was either congruent or incongruent with their numerical size. The results revealed a greater left visual field (LVF) interference for numbers represented as digits and a greater right visual field (RVF) interference for numbers represented as words or signs. Subjects' performance on number words and signs was also influenced by their skill in English and ASL: interference was greater in the RVF in the subjects' better language but was greater in the LVF for the less skilled language. These findings suggest that lateralization of numerical size judgments is moderated by the mode of number presentation and by prior language experience. 相似文献

20.

Determining the sample size for a replication attempt: A short and simple microcomputer program

Raphael Gillett 《Current Psychology》1990,9(3):304-307

Replication studies frequently fail to detect genuine effects because too few subjects are employed to yield an acceptable level of power. To remedy this situation, a method of sample size determination in replication attempts is described that uses information supplied by the original experiment to establish a distribution of probable effect sizes. The sample size to be employed is that which supplies an expected power of the desired amount over the distribution of probable effect sizes. The method may be used in replication attempts involving the comparison of means, the comparison of correlation coefficients, and the comparison of proportions. The widely available equation-solving program EUREKA provides a rapid means of executing the method on a microcomputer. Only ten lines are required to represent the method as a set of equations in EUREKA’s language. Such an equation file is readily modified, so that even inexperienced users find it a straightforward means of obtaining the sample size for a variety of designs. 相似文献