首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Two types of global testing procedures for item fit to the Rasch model were evaluated using simulation studies. The first type incorporates three tests based on first‐order statistics: van den Wollenberg's Q1 test, Glas's R1 test, and Andersen's LR test. The second type incorporates three tests based on second‐order statistics: van den Wollenberg's Q2 test, Glas's R2 test, and a non‐parametric test proposed by Ponocny. The Type I error rates and the power against the violation of parallel item response curves, unidimensionality and local independence were analysed in relation to sample size and test length. In general, the outcomes indicate a satisfactory performance of all tests, except the Q2 test which exhibits an inflated Type I error rate. Further, it was found that both types of tests have power against all three types of model violation. A possible explanation is the interdependencies among the assumptions underlying the model.  相似文献   

2.
Ali N  Chater N  Oaksford M 《Cognition》2011,119(3):403-418
In this paper, two experiments are reported investigating the nature of the cognitive representations underlying causal conditional reasoning performance. The predictions of causal and logical interpretations of the conditional diverge sharply when inferences involving pairs of conditionals—such as if P1then Q and if P2then Q—are considered. From a causal perspective, the causal direction of these conditionals is critical: are the Picauses of Q; or symptoms caused byQ. The rich variety of inference patterns can naturally be modelled by Bayesian networks. A pair of causal conditionals where Q is an effect corresponds to a “collider” structure where the two causes (Pi) converge on a common effect. In contrast, a pair of causal conditionals where Q is a cause corresponds to a network where two effects (Pi) diverge from a common cause. Very different predictions are made by fully explicit or initial mental models interpretations. These predictions were tested in two experiments, each of which yielded data most consistent with causal model theory, rather than with mental models.  相似文献   

3.
This simulation study investigates the performance of three test statistics, T1, T2, and T3, used to evaluate structural equation model fit under non normal data conditions. T1 is the well-known mean-adjusted statistic of Satorra and Bentler. T2 is the mean-and-variance adjusted statistic of Sattertwaithe type where the degrees of freedom is manipulated. T3 is a recently proposed version of T2 that does not manipulate degrees of freedom. Discrepancies between these statistics and their nominal chi-square distribution in terms of errors of Type I and Type II are investigated. All statistics are shown to be sensitive to increasing kurtosis in the data, with Type I error rates often far off the nominal level. Under excess kurtosis true models are generally over-rejected by T1 and under-rejected by T2 and T3, which have similar performance in all conditions. Under misspecification there is a loss of power with increasing kurtosis, especially for T2 and T3. The coefficient of variation of the nonzero eigenvalues of a certain matrix is shown to be a reliable indicator for the adequacy of these statistics.  相似文献   

4.
Monotone regression: Continuity and differentiability properties   总被引:1,自引:0,他引:1  
Least-squares monotone regression has received considerable discussion and use. Consider the residual sum of squaresQ obtained from the least-squares monotone regression ofy i onx i . TreatingQ as a function of they i , we prove that the gradient Q exists and is continuous everywhere, and is given by a simple formula. (We also discuss the gradient ofd=Q 1/2.) These facts, which can be questioned (Louis Guttman, private communication), are important for the iterative numerical solution of models, such as some kinds of multidimensional scaling, in which monotone regression occurs as a subsidiary element, so that they i and hence indirectlyQ are functions of other variables.  相似文献   

5.
A family of scaling corrections aimed to improve the chi-square approximation of goodness-of-fit test statistics in small samples, large models, and nonnormal data was proposed in Satorra and Bentler (1994). For structural equations models, Satorra-Bentler's (SB) scaling corrections are available in standard computer software. Often, however, the interest is not on the overall fit of a model, but on a test of the restrictions that a null model sayM 0 implies on a less restricted oneM 1. IfT 0 andT 1 denote the goodness-of-fit test statistics associated toM 0 andM 1, respectively, then typically the differenceT d =T 0T 1 is used as a chi-square test statistic with degrees of freedom equal to the difference on the number of independent parameters estimated under the modelsM 0 andM 1. As in the case of the goodness-of-fit test, it is of interest to scale the statisticT d in order to improve its chi-square approximation in realistic, that is, nonasymptotic and nonormal, applications. In a recent paper, Satorra (2000) shows that the difference between two SB scaled test statistics for overall model fit does not yield the correct SB scaled difference test statistic. Satorra developed an expression that permits scaling the difference test statistic, but his formula has some practical limitations, since it requires heavy computations that are not available in standard computer software. The purpose of the present paper is to provide an easy way to compute the scaled difference chi-square statistic from the scaled goodness-of-fit test statistics of modelsM 0 andM 1. A Monte Carlo study is provided to illustrate the performance of the competing statistics. This research was supported by the Spanish grants PB96-0300 and BEC2000-0983, and USPHS grants DA00017 and DA01070.  相似文献   

6.
Prior’s three-valued modal logic Q was developed as a philosophically interesting modal logic. Thus, we should be able to modify Q as a temporal logic. Although a temporal version of Q was suggested by Prior, the subject has not been fully explored in the literature. In this paper, we develop a three-valued temporal logic Q t and give its axiomatization and semantics. We also argue that Q t provides a smooth solution to the problem of future contingents. Presented by Daniele Mundici  相似文献   

7.
A formal theory of quantity T Q is presented which is realist, Platonist, and syntactically second-order (while logically elementary), in contrast with the existing formal theories of quantity developed within the theory of measurement, which are empiricist, nominalist, and syntactically first-order (while logically non-elementary). T Q is shown to be formally and empirically adequate as a theory of quantity, and is argued to be scientifically superior to the existing first-order theories of quantity in that it does not depend upon empirically unsupported assumptions concerning existence of physical objects (e.g. that any two actual objects have an actual sum). The theory T Q supports and illustrates a form of naturalistic Platonism, for which claims concerning the existence and properties of universals form part of natural science, and the distinction between accidental generalizations and laws of nature has a basis in the second-order structure of the world.  相似文献   

8.
This paper is a study of certain aspects of restricted ranking, a method intended for use by a panel ofm judges evaluating the relative merits ofN subjects, candidates for scholarships, awards, etc. Each judge divides theN subjects intoR classes so thatn i individuals receive a gradei (i = 1, 2, ...,R; Σn i =N) where theR numbersn i are close toN/R (n i =N/R whenN is divisible byR) and are preassigned and the same for all judges. This method is superior in several respects to other likely alternatives. Under the null hypothesis that allnR =N subjects are of equal merit, four tests of significance are developed. The effectiveness of the method is investigated both theoretically by means of the asymptotic relative efficiency and more generally by simulation studies. When the numbersn i are not restricted to values close to or equal toN/R but instead are given values conforming to a normally distributed pattern, the resulting method is known as theQ-sort, so designated by certain investigators in psychotherapy. The simulation studies reveal that restricted ranking is only slightly inferior to complete ranking and generally superior in the cases considered to theQ-sort, although there are likely to be other situations when the latter is superior. The authors are indebted to Dr. Clyde Kramer of Virginia Polytechnic Institute for bringing theQ-sort ranking technique of psychotherapy to their attention.  相似文献   

9.
A first-order theory T{{\mathcal T}} has the Independence Property provided T   \vdash (Q)(FT F1 ú. . .úFn){{{\mathcal T} \, \, \vdash (Q)(\Phi \Rightarrow {\Phi_1} \vee.\,.\,.\vee {\Phi_n})}} implies T   \vdash (Q)(FT Fi){{{\mathcal T} \, \, \vdash (Q)(\Phi \Rightarrow {\Phi_i})}} for some i whenever F,F1, . . . ,Fn{{\Phi,\Phi_1,\,.\,.\,.\,,\Phi_n}} are formulae of a suitable type and (Q) is any quantifier sequence. Variants of this property have been noticed for some time in logic programming and in linear programming.  相似文献   

10.
Item pools or item banks used in most testing situations are inherently multidimensional. This is especially a problem in computerized adaptive testing (CAT), which is driven by item response theory; item response theory requires that the item pool be unidimensional. This series of computer simulations demonstrates how alternative item-presentation controls (content-balancing and “mini-CATs”) may be employed in CAT to estimate ability accurately in spite of the violation of unidimensionality. Averaged, shorter mini-CATs provide the most accurate estimation of ability and ameliorate problems intrinsic to violating the unidimensionality assumption of item response theory.  相似文献   

11.
A first order uncountably valued logicL Q(0,1) for management of uncertainty is considered. It is obtained from approximation logicsL T of any poset type (T, ) (see Rasiowa [17], [18], [19]) by assuming (T, )=(Q(0, 1), ) — whereQ(0, 1) is the set of all rational numbersq such that 0<q<1 and is the arithmetic ordering — by eliminating modal connectives and adopting a semantics based onLT-fuzzy sets (see Rasiowa and Cat Ho [20], [21]). LogicL Q(0,1) can be treated as an important case ofLT-fuzzy logics (introduced in Rasiowa and Cat Ho [21]) for (T, )=(Q(0, 1), ), i.e. asLQ(0, 1)-fuzzy logic announced in [21] but first examined in this paper.L Q(0,1) deals with vague concepts represented by predicate formulas and applies approximate truth-values being certain subsets ofQ(0, 1). The set of all approximate truth-values consists of the empty set ø and all non-empty subsetss ofQ(0, 1) such that ifqs andqq, thenqs. The setLQ(0, 1) of all approximate truth-values is uncountable and covers up to monomorphism the closed interval [0, 1] of the real line.LQ(0, 1) is a complete set lattice and therefore a pseudo-Boolean (Heyting) algebra. Equipped with some additional operations it is a basic plain semi-Post algebra of typeQ(0, 1) (see Rasiowa and Cat Ho [20]) and is taken as a truth-table forL Q(0,1) logic.L Q(0,1) can be considered as a modification of Zadeh's fuzzy logic (see Bellman and Zadeh [2] and Zadeh and Kacprzyk, eds. [29]). The aim of this paper is an axiomatization of logicL Q(0,1) and proofs of the completeness theorem and of the theorem on the existence ofLQ(0, 1)-models (i.e. models under the semantics introduced) for consistent theories based on any denumerable set of specific axioms. Proofs apply the theory of plain semi-Post algebras investigated in Cat Ho and Rasiowa [4].Presented byCecylia Rauszer  相似文献   

12.
Subgroup analyses allow us to examine the influence of a categorical moderator on the effect size in meta‐analysis. We conducted a simulation study using a dichotomous moderator, and compared the impact of pooled versus separate estimates of the residual between‐studies variance on the statistical performance of the Q B (P) and Q B (S) tests for subgroup analyses assuming a mixed‐effects model. Our results suggested that similar performance can be expected as long as there are at least 20 studies and these are approximately balanced across categories. Conversely, when subgroups were unbalanced, the practical consequences of having heterogeneous residual between‐studies variances were more evident, with both tests leading to the wrong statistical conclusion more often than in the conditions with balanced subgroups. A pooled estimate should be preferred for most scenarios, unless the residual between‐studies variances are clearly different and there are enough studies in each category to obtain precise separate estimates.  相似文献   

13.
A table is developed and presented to facilitate the computation of the PearsonQ 3 (cosine method) estimate of the tetrachoric correlation coefficient. Data are presented concerning the accuracy ofQ 3 as an estimate of the tetrachoric correlation coefficient, and it is compared with the results obtainable from the Chesire, Saffir, and Thurstone tables for the same four-fold frequency tables.The authors are indebted to Mr. John Scott, Chief of the Test Development Section of the U.S. Civil Service Commission, for his encouragement and to Miss Elaine Ambrifi and Mrs. Elaine Nixon for the large amount of computational work involved in this paper.  相似文献   

14.
The issue of unidimensionality is dealt with in various research areas in the field of Psychology (e.g. conceptual spaces, semantic modeling, psychometrics) and always involves spatial modeling. An investigation of the dimensionality of opposite spatial scales (even basic) has however not yet been carried out. In this paper we look at whether opposite judgments of height, size, width and length (high/low, large/small, wide/narrow, and long/short) imply underlying unidimensional continua. In three experiments, independent ratings for the 8 above mentioned properties were elicited with participants looking at photographic representations of various objects (Study 1), real life objects (Study 2) and spatial extensions in object-independent conditions (Study 3). Explorative and confirmative factor analysis and Andrich Extended Rating Scale Models were applied in order to determine whether the ratings referred to opposite scales on the same linear continuum. Results from the three studies consistently revealed that this is not the case. A joint analysis of the data showed interesting interactions between the spatial properties analyzed suggesting a possible explanation for the lack of unidimensionality.  相似文献   

15.

This paper introduces the logic QLETF, a quantified extension of the logic of evidence and truth LETF, together with a corresponding sound and complete first-order non-deterministic valuation semantics. LETF is a paraconsistent and paracomplete sentential logic that extends the logic of first-degree entailment (FDE) with a classicality operator ∘ and a non-classicality operator ∙, dual to each other: while ∘A entails that A behaves classically, ∙A follows from A’s violating some classically valid inferences. The semantics of QLETF combines structures that interpret negated predicates in terms of anti-extensions with first-order non-deterministic valuations, and completeness is obtained through a generalization of Henkin’s method. By providing sound and complete semantics for first-order extensions of FDE, K3, and LP, we show how these tools, which we call here the method of anti-extensions + valuations, can be naturally applied to a number of non-classical logics.

  相似文献   

16.
Survey data often contain many variables. Structural equation modeling (SEM) is commonly used in analyzing such data. With typical nonnormally distributed data in practice, a rescaled statistic Trml proposed by Satorra and Bentler was recommended in the literature of SEM. However, Trml has been shown to be problematic when the sample size N is small and/or the number of variables p is large. There does not exist a reliable test statistic for SEM with small N or large p, especially with nonnormally distributed data. Following the principle of Bartlett correction, this article develops empirical corrections to Trml so that the mean of the empirically corrected statistics approximately equals the degrees of freedom of the nominal chi-square distribution. Results show that empirically corrected statistics control type I errors reasonably well even when N is smaller than 2p, where Trml may reject the correct model 100% even for normally distributed data. The application of the empirically corrected statistics is illustrated via a real data example.  相似文献   

17.
The standard view about counterfactuals is that a counterfactual (A > C) is true if and only if the A‐worlds most similar to the actual world @ are C‐worlds. I argue that the worlds conception of counterfactuals is wrong. I assume that counterfactuals have non‐trivial truth‐values under physical determinism. I show that the possible‐worlds approach cannot explain many embeddings of the form (P > (Q > R)), which intuitively are perfectly assertable, and which must be true if the contingent falsity of (Q > R) is to be explained. If (P > (Q > R)) has a backtracking reading then the contingent facts that (Q > R) needs to be true in the closest P‐worlds are absent. If (P > (Q > R)) has a forwardtracking reading, then the laws required by (Q > R) to be true in the closest P‐worlds will be absent, because they are violated in those worlds. Solutions like lossy laws or denial of embedding won't work. The only approach to counterfactuals that explains the embedding is a pragmatic metalinguistic approach in which the whole idea that counterfactuals are about a modal reality, be it abstract or concrete, is given up.  相似文献   

18.
The recent surge of interests in cognitive assessment has led to the development of cognitive diagnosis models. Central to many such models is a specification of the Q-matrix, which relates items to latent attributes that have natural interpretations. In practice, the Q-matrix is usually constructed subjectively by the test designers. This could lead to misspecification, which could result in lack of fit of the underlying statistical model. To test possible misspecification of the Q-matrix, traditional goodness of fit tests, such as the Chi-square test and the likelihood ratio test, may not be applied straightforwardly due to the large number of possible response patterns. To address this problem, this paper proposes a new statistical method to test the goodness fit of the Q-matrix, by constructing test statistics that measure the consistency between a provisional Q-matrix and the observed data for a general family of cognitive diagnosis models. Limiting distributions of the test statistics are derived under the null hypothesis that can be used for obtaining the test p-values. Simulation studies as well as a real data example are presented to demonstrate the usefulness of the proposed method.  相似文献   

19.
An attempt is made to include the axioms of Mackey for probabilities of experiments in quantum mechanics into the calculus x0 of ukasiewicz. The obtained calculusQ contains an additional modal signQ and four modal rules of inference. The propositionQx is read x is confirmed. The most specific rule of inference may be read: for comparable observations implication is equivalent to confirmation of material implication.The semantic truth ofQ is established by the interpretation with the help of physical objects obeying to the rules of quantum mechanics. The embedding of the usual quantum propositional logic inQ is accomplished.Allatum est die 9 Junii 1976  相似文献   

20.
Dummett's logic LC quantified, Q-LC, is shown to be characterized by the extended frame Q+, ,D, where Q+ is the set of non-negative rational numbers, is the numerical relation less or equal then and D is the domain function such that for all v, w Q+, Dv and if v w, then D v . D v D w . Moreover, simple completeness proofs of extensions of Q-LC are given.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号