首页 | 本学科首页   官方微博 | 高级检索  
文章检索
  按 检索   检索词:      
出版年份:   被引次数:   他引次数: 提示:输入*表示无穷大
  收费全文   407篇
  免费   46篇
  国内免费   107篇
  2024年   1篇
  2023年   10篇
  2022年   18篇
  2021年   31篇
  2020年   25篇
  2019年   35篇
  2018年   25篇
  2017年   28篇
  2016年   32篇
  2015年   14篇
  2014年   26篇
  2013年   32篇
  2012年   18篇
  2011年   14篇
  2010年   9篇
  2009年   7篇
  2008年   16篇
  2007年   18篇
  2006年   20篇
  2005年   19篇
  2004年   5篇
  2003年   9篇
  2002年   9篇
  2001年   13篇
  2000年   9篇
  1999年   11篇
  1998年   11篇
  1997年   6篇
  1996年   6篇
  1995年   5篇
  1994年   8篇
  1993年   7篇
  1992年   8篇
  1991年   7篇
  1990年   5篇
  1989年   7篇
  1988年   5篇
  1987年   6篇
  1986年   5篇
  1985年   4篇
  1984年   5篇
  1983年   2篇
  1982年   1篇
  1981年   3篇
  1979年   1篇
  1977年   2篇
  1976年   1篇
  1975年   1篇
排序方式: 共有560条查询结果,搜索用时 31 毫秒
551.
We evaluated rates of automatically reinforced stereotypy and item engagement for 2 children with autism under multiple and chained schedules in a multielement design. Each schedule included components during which stereotypy was blocked (S–) or allowed (S+), and we used colored cards as schedule‐correlated stimuli. We report rates of stereotypy and item engagement during S– and S+ components, as well as the percentage of component time that elapsed before the first instances of stereotypy and item engagement. We observed less stereotypy and more consistent item engagement during chained‐schedule sessions, and stimulus control of stereotypy and item engagement was established with the chained schedule. A subsequent concurrent‐chains analysis revealed participant preference for the chained schedule. These results highlight the importance of contingent access to stereotypy when therapists attempt to gain stimulus control of stereotypy and increase functional item engagement.  相似文献   
552.
Children aged 3½ to 6½ years viewed pictures of common objects presented either once or three times on one of two consecutive days. A different hand puppet was used to present the pictures on each day, providing both perceptual and temporal cues to source. At test, old (studied) and new (non‐studied) pictures were presented for item recognition and source identification. Results showed that both item and source accuracy were higher for older (M = 5; 9 years) than younger children (M = 4; 6 years). Significant interactions between Age and Day of study were found for both item and source accuracy. For younger children, accuracy was higher for pictures studied on Day 1 than Day 2 (significant for source identification but not item recognition), whereas older children showed the opposite pattern: Higher accuracy for Day 2 than Day 1 (significant for item recognition but not source identification). Results are interpreted with respect to proactive interference and response bias. The utility of signal detection theory measures in determining the basis of age differences in performance of source identification is discussed.  相似文献   
553.
The study examined the relationship between examinees’ test-taking effort and their accuracy rate on items from the PISA 2015 assessment. The 10% normative threshold method was applied on Science multiple-choice items in the Cyprus sample to detect rapid guessing behavior. Results showed that the extent of rapid guessing across simple and complex multiple-choice items was on average less than 6% per item. Rapid guessers were identified, and for most items their accuracy was lower than the accuracy for students engaging in solution-based behavior. A number of plausible explanations were graphically evaluated for items for which accuracy was higher for the rapid guessing subgroup. Overall, this empirical investigation presents original evidence on test-taking effort as measured by response time in PISA items and tests propositions of Wise’s (2017 Wise, S. L. (2017). Rapid‐guessing behavior: Its identification, interpretation, and implications. Educational Measurement: Issues and Practice, 36(4), 5261. doi:10.1111/emip.12165[Crossref], [Web of Science ®] [Google Scholar]) Test-Taking Theory.  相似文献   
554.
This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and conditions. When a test measured weakly discriminated dimensions, it became harder to recover the latent correlation. Results also showed that increasing the sample size, test length, or using simpler models (i.e., two-parameter logistic rather than three-parameter logistic, compensatory rather than noncompensatory) could improve the recovery of latent correlation.  相似文献   
555.
Abstract

Differential item functioning (DIF) is a pernicious statistical issue that can mask true group differences on a target latent construct. A considerable amount of research has focused on evaluating methods for testing DIF, such as using likelihood ratio tests in item response theory (IRT). Most of this research has focused on the asymptotic properties of DIF testing, in part because many latent variable methods require large samples to obtain stable parameter estimates. Much less research has evaluated these methods in small sample sizes despite the fact that many social and behavioral scientists frequently encounter small samples in practice. In this article, we examine the extent to which model complexity—the number of model parameters estimated simultaneously—affects the recovery of DIF in small samples. We compare three models that vary in complexity: logistic regression with sum scores, the 1-parameter logistic IRT model, and the 2-parameter logistic IRT model. We expected that logistic regression with sum scores and the 1-parameter logistic IRT model would more accurately estimate DIF because these models yielded more stable estimates despite being misspecified. Indeed, a simulation study and empirical example of adolescent substance use show that, even when data are generated from / assumed to be a 2-parameter logistic IRT, using parsimonious models in small samples leads to more powerful tests of DIF while adequately controlling for Type I error. We also provide evidence for minimum sample sizes needed to detect DIF, and we evaluate whether applying corrections for multiple testing is advisable. Finally, we provide recommendations for applied researchers who conduct DIF analyses in small samples.  相似文献   
556.
The standard error (SE) stopping rule, which terminates a computer adaptive test (CAT) when the SE is less than a threshold, is effective when there are informative questions for all trait levels. However, in domains such as patient-reported outcomes, the items in a bank might all target one end of the trait continuum (e.g., negative symptoms), and the bank may lack depth for many individuals. In such cases, the predicted standard error reduction (PSER) stopping rule will stop the CAT even if the SE threshold has not been reached and can avoid administering excessive questions that provide little additional information. By tuning the parameters of the PSER algorithm, a practitioner can specify a desired tradeoff between accuracy and efficiency. Using simulated data for the Patient-Reported Outcomes Measurement Information System Anxiety and Physical Function banks, we demonstrate that these parameters can substantially impact CAT performance. When the parameters were optimally tuned, the PSER stopping rule was found to outperform the SE stopping rule overall, particularly for individuals not targeted by the bank, and presented roughly the same number of items across the trait continuum. Therefore, the PSER stopping rule provides an effective method for balancing the precision and efficiency of a CAT.  相似文献   
557.
Personality development research heavily relies on the comparison of scale means across age. This approach implicitly assumes that the scales are strictly measurement invariant across age. We questioned this assumption by examining whether appropriate personality indicators change over the lifespan. Moreover, we identified which types of items (e.g. dispositions, behaviours, and interests) are particularly prone to age effects. We reanalyzed the German Revised NEO Personality Inventory normative sample (N = 11,724) and applied a genetic algorithm to select short scales that yield acceptable model fit and reliability across locally weighted samples ranging from 16 to 66 years of age. We then examined how the item selection changes across age points and item types. Emotion‐type items seemed to be interchangeable and generally applicable to people of all ages. Specific interests, attitudes, and social effect items—most prevalent within the domains of Extraversion, Agreeableness, and Openness—seemed to be more prone to measurement variations over age. A large proportion of items were systematically discarded by the item‐selection procedure, indicating that, independent of age, many items are problematic measures of the underlying traits. The implications for personality assessment and personality development research are discussed. © 2019 European Association of Personality Psychology  相似文献   
558.
The current study examined the relationship between test-taker cognition and psychometric item properties in multiple-selection multiple-choice and grid items. In a study with content-equivalent mathematics items in alternative item formats, adult participants’ tendency to respond to an item was affected by the presence of a grid and variations of answer options. The results of an item response theory analysis were consistent with the hypothesized cognitive processes in alternative item formats. The findings suggest that seemingly subtle variations of item design could substantially affect test-taker cognition and psychometric outcomes, emphasizing the need for investigating item format effects at a fine-grained level.  相似文献   
559.
560.
Constructed-response items have been shown to be appropriate for cognitively diagnostic assessments because students’ problem-solving procedures can be observed, providing direct evidence for making inferences about their proficiency. However, multiple strategies used by students make item scoring and psychometric analyses challenging. This study introduces the so-called two-digit scoring scheme into diagnostic assessments to record both students’ partial credits and their strategies. This study also proposes a diagnostic tree model (DTM) by integrating the cognitive diagnosis models with the tree model to analyse the items scored using the two-digit rubrics. Both convergent and divergent tree structures are considered to accommodate various scoring rules. The MMLE/EM algorithm is used for item parameter estimation of the DTM, and has been shown to provide good parameter recovery under varied conditions in a simulation study. A set of data from TIMSS 2007 mathematics assessment is analysed to illustrate the use of the two-digit scoring scheme and the DTM.  相似文献   
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号