全文获取类型
收费全文 | 407篇 |
免费 | 46篇 |
国内免费 | 107篇 |
出版年
2024年 | 1篇 |
2023年 | 10篇 |
2022年 | 18篇 |
2021年 | 31篇 |
2020年 | 25篇 |
2019年 | 35篇 |
2018年 | 25篇 |
2017年 | 28篇 |
2016年 | 32篇 |
2015年 | 14篇 |
2014年 | 26篇 |
2013年 | 32篇 |
2012年 | 18篇 |
2011年 | 14篇 |
2010年 | 9篇 |
2009年 | 7篇 |
2008年 | 16篇 |
2007年 | 18篇 |
2006年 | 20篇 |
2005年 | 19篇 |
2004年 | 5篇 |
2003年 | 9篇 |
2002年 | 9篇 |
2001年 | 13篇 |
2000年 | 9篇 |
1999年 | 11篇 |
1998年 | 11篇 |
1997年 | 6篇 |
1996年 | 6篇 |
1995年 | 5篇 |
1994年 | 8篇 |
1993年 | 7篇 |
1992年 | 8篇 |
1991年 | 7篇 |
1990年 | 5篇 |
1989年 | 7篇 |
1988年 | 5篇 |
1987年 | 6篇 |
1986年 | 5篇 |
1985年 | 4篇 |
1984年 | 5篇 |
1983年 | 2篇 |
1982年 | 1篇 |
1981年 | 3篇 |
1979年 | 1篇 |
1977年 | 2篇 |
1976年 | 1篇 |
1975年 | 1篇 |
排序方式: 共有560条查询结果,搜索用时 31 毫秒
551.
We evaluated rates of automatically reinforced stereotypy and item engagement for 2 children with autism under multiple and chained schedules in a multielement design. Each schedule included components during which stereotypy was blocked (S–) or allowed (S+), and we used colored cards as schedule‐correlated stimuli. We report rates of stereotypy and item engagement during S– and S+ components, as well as the percentage of component time that elapsed before the first instances of stereotypy and item engagement. We observed less stereotypy and more consistent item engagement during chained‐schedule sessions, and stimulus control of stereotypy and item engagement was established with the chained schedule. A subsequent concurrent‐chains analysis revealed participant preference for the chained schedule. These results highlight the importance of contingent access to stereotypy when therapists attempt to gain stimulus control of stereotypy and increase functional item engagement. 相似文献
552.
Kerry A. Chalmers 《The British journal of developmental psychology》2014,32(4):480-491
Children aged 3½ to 6½ years viewed pictures of common objects presented either once or three times on one of two consecutive days. A different hand puppet was used to present the pictures on each day, providing both perceptual and temporal cues to source. At test, old (studied) and new (non‐studied) pictures were presented for item recognition and source identification. Results showed that both item and source accuracy were higher for older (M = 5; 9 years) than younger children (M = 4; 6 years). Significant interactions between Age and Day of study were found for both item and source accuracy. For younger children, accuracy was higher for pictures studied on Day 1 than Day 2 (significant for source identification but not item recognition), whereas older children showed the opposite pattern: Higher accuracy for Day 2 than Day 1 (significant for item recognition but not source identification). Results are interpreted with respect to proactive interference and response bias. The utility of signal detection theory measures in determining the basis of age differences in performance of source identification is discussed. 相似文献
553.
Michalis P. Michaelides Militsa Ivanova Christiana Nicolaou 《International Journal of Testing》2020,20(3):187-205
The study examined the relationship between examinees’ test-taking effort and their accuracy rate on items from the PISA 2015 assessment. The 10% normative threshold method was applied on Science multiple-choice items in the Cyprus sample to detect rapid guessing behavior. Results showed that the extent of rapid guessing across simple and complex multiple-choice items was on average less than 6% per item. Rapid guessers were identified, and for most items their accuracy was lower than the accuracy for students engaging in solution-based behavior. A number of plausible explanations were graphically evaluated for items for which accuracy was higher for the rapid guessing subgroup. Overall, this empirical investigation presents original evidence on test-taking effort as measured by response time in PISA items and tests propositions of Wise’s (2017) Test-Taking Theory. 相似文献
554.
Yanyan Fu Tyler Strachan Edward H. Ip John T. Willse Shyh-Huei Chen Terry Ackerman 《International Journal of Testing》2020,20(2):169-186
This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and conditions. When a test measured weakly discriminated dimensions, it became harder to recover the latent correlation. Results also showed that increasing the sample size, test length, or using simpler models (i.e., two-parameter logistic rather than three-parameter logistic, compensatory rather than noncompensatory) could improve the recovery of latent correlation. 相似文献
555.
William C. M. Belzak 《Multivariate behavioral research》2020,55(5):722-747
AbstractDifferential item functioning (DIF) is a pernicious statistical issue that can mask true group differences on a target latent construct. A considerable amount of research has focused on evaluating methods for testing DIF, such as using likelihood ratio tests in item response theory (IRT). Most of this research has focused on the asymptotic properties of DIF testing, in part because many latent variable methods require large samples to obtain stable parameter estimates. Much less research has evaluated these methods in small sample sizes despite the fact that many social and behavioral scientists frequently encounter small samples in practice. In this article, we examine the extent to which model complexity—the number of model parameters estimated simultaneously—affects the recovery of DIF in small samples. We compare three models that vary in complexity: logistic regression with sum scores, the 1-parameter logistic IRT model, and the 2-parameter logistic IRT model. We expected that logistic regression with sum scores and the 1-parameter logistic IRT model would more accurately estimate DIF because these models yielded more stable estimates despite being misspecified. Indeed, a simulation study and empirical example of adolescent substance use show that, even when data are generated from / assumed to be a 2-parameter logistic IRT, using parsimonious models in small samples leads to more powerful tests of DIF while adequately controlling for Type I error. We also provide evidence for minimum sample sizes needed to detect DIF, and we evaluate whether applying corrections for multiple testing is advisable. Finally, we provide recommendations for applied researchers who conduct DIF analyses in small samples. 相似文献
556.
Scott B. Morris Michael Bass Elizabeth Howard Richard E. Neapolitan 《International Journal of Testing》2020,20(2):146-168
The standard error (SE) stopping rule, which terminates a computer adaptive test (CAT) when the SE is less than a threshold, is effective when there are informative questions for all trait levels. However, in domains such as patient-reported outcomes, the items in a bank might all target one end of the trait continuum (e.g., negative symptoms), and the bank may lack depth for many individuals. In such cases, the predicted standard error reduction (PSER) stopping rule will stop the CAT even if the SE threshold has not been reached and can avoid administering excessive questions that provide little additional information. By tuning the parameters of the PSER algorithm, a practitioner can specify a desired tradeoff between accuracy and efficiency. Using simulated data for the Patient-Reported Outcomes Measurement Information System Anxiety and Physical Function banks, we demonstrate that these parameters can substantially impact CAT performance. When the parameters were optimally tuned, the PSER stopping rule was found to outperform the SE stopping rule overall, particularly for individuals not targeted by the bank, and presented roughly the same number of items across the trait continuum. Therefore, the PSER stopping rule provides an effective method for balancing the precision and efficiency of a CAT. 相似文献
557.
Personality development research heavily relies on the comparison of scale means across age. This approach implicitly assumes that the scales are strictly measurement invariant across age. We questioned this assumption by examining whether appropriate personality indicators change over the lifespan. Moreover, we identified which types of items (e.g. dispositions, behaviours, and interests) are particularly prone to age effects. We reanalyzed the German Revised NEO Personality Inventory normative sample (N = 11,724) and applied a genetic algorithm to select short scales that yield acceptable model fit and reliability across locally weighted samples ranging from 16 to 66 years of age. We then examined how the item selection changes across age points and item types. Emotion‐type items seemed to be interchangeable and generally applicable to people of all ages. Specific interests, attitudes, and social effect items—most prevalent within the domains of Extraversion, Agreeableness, and Openness—seemed to be more prone to measurement variations over age. A large proportion of items were systematically discarded by the item‐selection procedure, indicating that, independent of age, many items are problematic measures of the underlying traits. The implications for personality assessment and personality development research are discussed. © 2019 European Association of Personality Psychology 相似文献
558.
Jung Aa Moon Sandip Sinharay Madeleine Keehner Irvin R. Katz 《International Journal of Testing》2020,20(2):122-145
The current study examined the relationship between test-taker cognition and psychometric item properties in multiple-selection multiple-choice and grid items. In a study with content-equivalent mathematics items in alternative item formats, adult participants’ tendency to respond to an item was affected by the presence of a grid and variations of answer options. The results of an item response theory analysis were consistent with the hypothesized cognitive processes in alternative item formats. The findings suggest that seemingly subtle variations of item design could substantially affect test-taker cognition and psychometric outcomes, emphasizing the need for investigating item format effects at a fine-grained level. 相似文献
559.
560.
Constructed-response items have been shown to be appropriate for cognitively diagnostic assessments because students’ problem-solving procedures can be observed, providing direct evidence for making inferences about their proficiency. However, multiple strategies used by students make item scoring and psychometric analyses challenging. This study introduces the so-called two-digit scoring scheme into diagnostic assessments to record both students’ partial credits and their strategies. This study also proposes a diagnostic tree model (DTM) by integrating the cognitive diagnosis models with the tree model to analyse the items scored using the two-digit rubrics. Both convergent and divergent tree structures are considered to accommodate various scoring rules. The MMLE/EM algorithm is used for item parameter estimation of the DTM, and has been shown to provide good parameter recovery under varied conditions in a simulation study. A set of data from TIMSS 2007 mathematics assessment is analysed to illustrate the use of the two-digit scoring scheme and the DTM. 相似文献