首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Conditioned reinforcement value and choice.   总被引:4,自引:4,他引:0       下载免费PDF全文
The delay-reduction hypothesis of conditioned reinforcement states that the reinforcing value of a food-associated stimulus is determined by the delay to primary reinforcement signaled by the onset of the stimulus relative to the average delay to primary reinforcement in the conditioning situation. In contrast, most contemporary models of conditioned reinforcement strength posit that the reinforcing strength of a stimulus is some simple function only of the delay to primary reinforcement in the presence of stimulus. The delay-reduction hypothesis diverges from other conditioned reinforcement models in that it predicts that a fixed-duration food-paired stimulus will have different reinforcing values depending on the frequency of its presentation. In Experiment 1, pigeons' key pecks were reinforced according to concurrent-chains schedules with variable-interval 10-second and variable-interval 20-second terminal-link schedules. The initial-link schedule preceding the shorter terminal link was always variable-interval 60 seconds, and the initial-link schedule requirement preceding the longer terminal link was varied between 1 second and 60 seconds across conditions. In Experiment 2, the initial-link schedule preceding the longer of two terminal links was varied for each of three groups of pigeons. The terminal links of the concurrent chains for the three groups were variable-interval 10 seconds and 20 seconds, variable-interval 10 seconds and 30 seconds, and variable-interval 30 seconds and 50 seconds. In both experiments, preference for the shorter terminal link was either a bitonic function or an inverse function of the initial-link schedule preceding the longer terminal-link schedule. Consistent with the predictions of the delay-reduction hypothesis, the relative values of the terminal-link stimuli changed as a function of the overall frequency of primary reinforcement. Vaughan's (1985) melioration model, which was shown to be formally similar to Squires and Fantino's (1971) delay-reduction model, can be modified so as to predict these results without changing its underlying assumptions.  相似文献   

2.
Key pecking of 4 pigeons was maintained under a multiple variable-interval 20-s variable-interval 120-s schedule of food reinforcement. When rates of key pecking were stable, a 5-s unsignaled, nonresetting delay to reinforcement separated the first peck after an interval elapsed from reinforcement in both components. Rates of pecking decreased substantially in both components. When rates were stable, the situation was changed such that the peck that began the 5-s delay also changed the color of the keylight for 0.5 s (i.e., the delay was briefly signaled). Rates increased to near-immediate reinforcement levels. In subsequent conditions, delays of 10 and 20 s, still briefly signaled, were tested. Although rates of key pecking during the component with the variable-interval 120-s schedule did not change appreciably across conditions, rates during the variable-interval 20-s component decreased greatly in 1 pigeon at the 10-s delay and decreased in all pigeons at the 20-s delay. In a control condition, the variable-interval 20-s schedule with 20-s delays was changed to a variable-interval 35-s schedule with 5-s delays, thus equating nominal rates of reinforcement. Rates of pecking increased to baseline levels. Rates of pecking, then, depended on the value of the briefly signaled delay relative to the programmed interfood times, rather than on the absolute delay value. These results are discussed in terms of similar findings in the literature on conditioned reinforcement, delayed matching to sample, and classical conditioning.  相似文献   

3.
Two experiments with pigeons examined the relation of the duration of a signal for delay ("delay signal") to rates of key pecking. The first employed a multiple schedule comprised of two components with equal variable-interval 60-s schedules of 27-s delayed food reinforcement. In one component, a short (0.5-s) delay signal, presented immediately following the key peck that began the delay, was increased in duration across phases; in the second component the delay signal initially was equal to the length of the programmed delay (27 s) and was decreased across phases. Response rates prior to delays were an increasing function of delay-signal duration. As the delay signal was decreased in duration, response rates were generally higher than those obtained under identical delay-signal durations as the signal was increased in duration. In Experiment 2 a single variable-interval 60-s schedule of 27-s delayed reinforcement was used. Delay-signal durations were again increased gradually across phases. As in Experiment 1, response rates increased as the delay-signal duration was increased. Following the phase during which the signal lasted the entire delay, shorter delay-signal-duration conditions were introduced abruptly, rather than gradually as in Experiment 1, to determine whether the gradual shortening of the delay signal accounted for the differences observed in response rates under identical delay-signal conditions in Experiment 1. Response rates obtained during the second exposures to the conditions with shorter signals were higher than those observed under identical conditions as the signal duration was increased, as in Experiment 1. In both experiments, rates and patterns of responding during delays varied greatly across subjects and were not systematically related to delay-signal durations. The effects of the delay signal may be related to the signal's role as a discriminative stimulus for adventitiously reinforced intradelay behavior, or the delay signal may have served as a conditioned reinforcer by virtue of the temporal relation between it and presentation of food.  相似文献   

4.
In Experiment 1, three pigeons' key pecking was maintained under a variable-interval 60-s schedule of food reinforcement. A 1-s unsignaled nonresetting delay to reinforcement was then added. Rates decreased and stabilized at values below those observed under immediate-reinforcement conditions. A brief stimulus change (key lit red for 0.5 s) was then arranged to follow immediately the peck that began the delay. Response rates quickly returned to baseline levels. Subsequently, rates near baseline levels were maintained with briefly signaled delays of 3 and 9 s. When a 27-s briefly signaled delay was instituted, response rates decreased to low levels. In Experiment 2, four pigeons' responding was first maintained under a multiple variable-interval 60-s (green key) variable-interval 60-s (red key) schedule. Response rates in both components fell to low levels when a 3-s unsignaled delay was added. In the first component delays were then briefly signaled in the same manner as Experiment 1, and in the second component they were signaled with a change in key color that remained until food was delivered. Response rates increased to near baseline levels in both components, and remained near baseline when the delays in both components were lengthened to 9 s. When delays were lengthened to 27 s, response rates fell to low levels in the briefly signaled delay component for three of four pigeons while remaining at or near baseline in the completely signaled delay component. In Experiment 3, low response rates under a 9-s unsignaled delay to reinforcement (tandem variable-interval 60 s fixed-time 9 s) increased when the delay was briefly signaled. The role of the brief stimulus as conditioned reinforcement may be a function of its temporal relation to food, and thus may be related to the eliciting function of the stimulus.  相似文献   

5.
Pigeons' responding was reinforced on a multiple schedule consisting of two two-link chain schedules presented in regular alternation. Responding in initial links (always variable-interval 60-s) produced a key-color change and access to a terminal link. The terminal link for one chain provided food after a fixed delay (fixed-interval or fixed-time); the terminal link for the other provided food after a variable delay (variable-interval or variable-time). The average duration of the terminal-link schedules was varied across conditions, but in every condition the arithmetic mean of the variable-delay terminal-link schedule was equal to the duration of the fixed delay. Response rates were higher in the initial links of the chains with the variable-delay terminal links. Response-decreasing operations (satiation, extinction) were used after performances reached asymptote. Response rates maintained by access to variable-delay terminal links tended to be more resistant to change than were rates maintained by access to fixed-delay terminal links. These results are consistent with the preference for variable- over fixed-interval terminal links observed with concurrent-chains schedules, suggesting (1) that immediacy of reinforcement influences the conditioned reinforcing potency of access to a terminal link and (2) that choice in concurrent chains and resistance of responding to change may be manifestations of the same effect of reinforcement.  相似文献   

6.
Rats' lever pressing produced sucrose reinforcers on a variable-interval schedule where, in different conditions, the duration of a stimulus presented immediately after reinforcement was either correlated or uncorrelated with the duration of the current interreinforcement interval. Under the baseline schedule, in which no stimulus was presented, the minimum interreinforcement interval was 8 s and the mean postreinforcement pause of each subject approximated this value. Response rates increased slowly over the first 10 to 15 s and then remained roughly constant throughout the remainder of the interval. In both the correlated and uncorrelated conditions, the added stimulus resulted in the postreinforcement pauses lengthening to values in excess of the duration of the preceding stimulus. This resulted in a poststimulus pause which was, in most cases, roughly constant irrespective of the duration of the preceding stimulus, or of the reinforcement contingencies prevailing immediately after stimulus offset. Local response-rate patterns in the uncorrelated conditions were similar to those obtained under the baseline schedule in which no stimulus was presented. However, in the correlated condition local response rates increased across the remainder of the interreinforcer interval. Further, the rate of acceleration was inversely related to the duration of the preceding stimulus. These results show that a correlation between stimulus duration and the ensuing time to reinforcement can control behavior—a type of temporal control not previously reported.  相似文献   

7.
Three experiments examined the influence of a brief stimulus (a light) on the behavior of food-deprived rats whose lever pressing on tandem schedules comprising components of different schedule types resulted in food presentation. In Experiment 1, either a tandem variable-ratio variable-interval or a tandem variable-interval variable-ratio schedule was used. The variable-interval requirement in the tandem variable-ratio variable-interval schedule was yoked to the time taken to complete the variable-ratio component in the tandem variable-interval variable-ratio schedule, and the length of the variable-interval component in the latter schedule was yoked to the variable-ratio component in the former schedule. If a brief stimulus occurred following completion of the first component, then behavior was differentiated in the two components; subjects responded more quickly in the variable-ratio than in the variable-interval component. If the stimulus was removed, then response rate was determined by the nature of the final component. Similar results were obtained in Experiments 2 and 3 with the use of a three-component tandem variable-ratio variable-interval variable-ratio schedule or tandem variable-interval variable-ratio variable-interval schedule. Thus, a brief stimulus that was not explicitly paired with reinforcement engendered behavior typical of the component schedule preceding its presentation.  相似文献   

8.
Behavior of humans in variable-interval schedules of reinforcement   总被引:9,自引:8,他引:1       下载免费PDF全文
During Phase I, human subjects pressed a button for monetary reinforcement in five variable-interval schedules, each of which specified a different frequency of reinforcement. The rate of responding was an increasing, negatively accelerated function of reinforcement frequency; the data conformed closely to Herrnstein's equation. During Phase II, the same five schedules were in operation, but in addition a concurrent variable-interval schedule (B) was introduced, responses on which were always reinforced at the same frequency. Response rate in component A increased while the response rate in B decreased, as a function of the reinforcement frequency in component A. Relative response rates in the two component schedules matched the relative frequencies of reinforcement. Comparing the absolute response rates in component A during Phase I and Phase II it was found that introduction of the concurrent schedule did not affect the value of the theoretical maximum response rate, but did increase the value of the reinforcement frequency needed to obtain any particular submaximal response rate.  相似文献   

9.
Institutionalized retardates were exposed to a multiple variable-interval: extinction schedule of reinforcement in which 5-min periods of variable-interval reinforcement and 5-min periods of extinction were presented in a random order. This schedule was found to generate sequential contrast effects: response rates during variable-interval reinforcement were higher when a variable-interval period followed an extinction period than when it followed another variable-interval period. The rate of responding within a variable-interval period also was affected by the number of extinction periods preceding a variable-interval period. As the number of successive extinction periods that preceded a variable-interval period increased, the rate of responding during that variable-interval period increased. The sequential contrast effects were transient, being most evident during the early sessions and generally disappearing by the tenth session.  相似文献   

10.
College students responded for monetary rewards in two experiments on choice between differentially segmented aperiodic schedules of reinforcement. On a microcomputer, the concurrent chains were simulated as an air-defense video game in which subjects used two radars for detecting and destroying enemy aircraft. To earn more cash-exchangeable points, subjects had to shoot down as many planes as possible within a given period of time. For both experiments, access to one of two radar systems (terminal link) was controlled by a pair of independent concurrent variable-interval 60-s schedules (initial link) with a 4-s changeover delay always in effect. In Experiment 1, the appearance of an enemy aircraft in the terminal link was determined by a variable-interval (15 s or 60 s) schedule or a two-component chained variable-interval schedule of equal duration. Experiment 2 was similar to Experiment 1 except for the segmented schedule, which had three components. Subjects preferred the unsegmented schedule over its segmented counterpart in the conditions with variable-interval 60 s, and preference tended to be more pronounced with more components in the segmented schedule. These findings are compatible with those from previous studies of periodic and aperiodic schedules with pigeons or humans as subjects.  相似文献   

11.
To learn whether prior discrimination training based on one stimulus would block learning about a subsequently added stimulus, rats were first trained to press a bar on a variable-interval schedule of food reinforcement. Occasional stimuli were presented during which no reinforcement was available. Responding became suppressed in the presence of these stimuli. Stimuli could be noise, light, or a compound of noise plus light. A group trained with noise in Phase 1, then trained with the compound in Phase 2, showed less suppression to light in a subsequent test than a group that had the same compound training in Phase 2 but only variable-interval training in Phase 1. This showed that prior training with noise blocked the development of control by light during compound training. Two further groups showed that noise training following compound training did not have the same effect on control by light.  相似文献   

12.
Eight pigeons were exposed to independent concurrent schedules. Concurrent variable-interval 60-second variable-interval 60-second schedules were presented to one group of four subjects. Following baseline training, a limited hold was added to one of the schedules and the duration of the hold was decreased in successive conditions. Concurrent variable-interval 120-second variable-interval 40-second schedules were presented to another group of four subjects. These subjects were first exposed to decreasing durations of a limited hold in the variable-interval 40-second component. After replication of the baseline, a limited hold in the variable-interval 120-second component was decreased in duration. The initial durations of the holds were determined from the subjects' responding in the baseline conditions. A duration was chosen such that approximately 25% of the scheduled reinforcers would be canceled if responding remained unchanged.

Approximate matching of time proportions and reinforcement proportions was observed when the limited hold was added to the variable-interval 60-second schedule and when the limited hold was added to the variable-interval 40-second schedule. Time proportions were less extreme than reinforcement proportions when the limited hold operated in a variable-interval 120-second schedule. Overall reinforcement rates tended to decrease with continued training in concurrent schedules with a limited hold. Absolute deviations from time matching also decreased. The results provide evidence against the principle of reinforcement maximization, and support Herrnstein and Vaughan's (1980) melioration hypothesis.

  相似文献   

13.
In Experiment 1, six naive pigeons were trained on a foraging schedule characterized by different states beginning with a search state in which completion of a fixed-interval on a white key led to a choice state. In the choice state the subject could, by appropriate responding on a fixed ratio of three, either accept or reject the schedule of reinforcement that was offered (either a variable-interval five-second or a variable-interval 20-second). If the subject accepted the schedule, it entered a “handling state” in which the appropriate variable-interval schedule was presented. Completion of the variable-interval schedule produced food. The independent variable was the fixed-interval value in the search state, and the dependent variable was the rate of acceptance of the long variable-interval in the choice state. Experiment 2 was identical except that the search state required completion of a variable-interval, instead of a fixed-interval, schedule. The rate of acceptance of the long variable-interval schedule in both experiments was a direct function of the length of the search state, in accordance with both optimality theory and the delay-reduction hypothesis.  相似文献   

14.
Two experiments with pigeons examined the effects of unsignaled, nonresetting delays of reinforcement on responding maintained by different reinforcement rates. In Experiment 1, 3-s unsignaled delays were introduced into each component of a multiple variable-interval (VI) 15-s VI 90-s VI 540-s schedule. When considered as a proportion of the preceding immediate reinforcement baseline, responding was decreased similarly for the three multiple-schedule components in both the first six and last six sessions of exposure to the delay. In addition, the relation between response rates and reinforcement rates was altered such that both parameters of the single-response version of the matching law (i.e., k and Re) were decreased. Experiment 2 examined the effects of unsignaled delays ranging from 0.5 s to 8.0 s on responding maintained by a multiple VI 20-s VI 120-s schedule of reinforcement. Response rates in both components increased with brief unsignaled delays and decreased with longer delays. As in Experiment 1, response rates as a proportion of baseline were affected similarly for the two components in both the first six and last six sessions of exposure to the delay. Unlike delays imposed between two stimulus events, the effects of delays between responses and reinforcers do not appear to be attenuated when the average time between reinforcers is longer. In addition, the disruptions produced by unsignaled delays appear to be inconsistent with the general finding that responding maintained by higher rates of reinforcement is less resistant to change.  相似文献   

15.
Four birds key pecked on concurrent variable-interval one-minute variable-interval four-minute schedules with a two-second changeover delay. Response rates to the variable-interval one-minute key were then reduced by signaling its reinforcer availability and later by providing its reinforcers independently of responding. Each manipulation increased response rates to the variable-interval four-minute key even though relative reinforcement rates were unchanged. In a final phase, eliminating the variable-interval one-minute key and its schedule produced the highest rates of all to the variable-interval four-minute key. These results show that both reinforcement and response rates to one schedule influence response rates to another schedule. These results join those of Guilkey, Shull, & Brownstein (1975) in failing to replicate Catania (1963). Moreover, they violate the predictions of the equation for simple action (de Villiers & Herrnstein, 1976). In terms of a median-rate measure (reciprocal of the median interresponse time), rates to the variable-interval four-minute key were high when responding was not reduced to the variable-interval one-minute key and were low when it was reduced. This rate difference suggests a process difference between concurrent-schedule procedures that maintain high concurrent response rates versus those that do not. This process difference jeopardizes attempts to integrate single- and concurrent-operant performances within a single formulation.  相似文献   

16.
Four experiments examined the effects of increasing the number of food pellets given to hungry rats for a lever-press response. On a simple variable-interval 60-s schedule, increased number of pellets depressed response rates (Experiment 1). In Experiment 2, the decrease in response rate as a function of increased reinforcement magnitude was demonstrated on a variable-interval 30-s schedule, but enhanced rates of response were obtained with the same increase in reinforcement magnitude on a variable-ratio 30 schedule. In Experiment 3, higher rates of responding were maintained by the component of a concurrent variable-interval 60-s variable-interval 60-s schedule associated with a higher reinforcement magnitude. In Experiment 4, higher rates of response were produced in the component of a multiple variable-interval 60-s variable-interval 60-s schedule associated with the higher reinforcement magnitude. It is suggested that on simple schedules greater reinforcer magnitudes shape the reinforced pattern of responding more effectively than do smaller reinforcement magnitudes. This effect is, however, overridden by another process, such a contrast, when two magnitudes are presented within a single session on two-component schedules.  相似文献   

17.
Preference in concurrent variable-interval fixed-ratio schedules   总被引:10,自引:10,他引:0       下载免费PDF全文
Five pigeons were trained on concurrent variable-interval fixed-ratio schedules in three experiments. Experiment 1 used two variable-interval schedules and one fixed-ratio schedule, and the ratio requirement was varied. Using the generalized matching law, sensitivity to reinforcement was close to 1.0, but performance was biased toward the variable-interval schedule with the lower reinforcement rate. In Experiment 2, which used one variable-interval and one fixed-ratio schedule, the interval schedule was varied. All birds showed sensitivities to reinforcement of less than 1.0 and of less than the values obtained in Experiment 1. The performance was also biased toward the fixed-ratio schedule. Because the generalized matching law could not account for the differences in the data from Experiments 1 and 2, an extension of this law was suggested and successfully tested in Experiment 3. The proposed dual-sensitivity model was also shown to clarify some previously reported results.  相似文献   

18.
Pigeons' pecks were maintained on concurrent variable-interval 1-min variable-interval 3-min schedules of reinforcement, with a changeover delay of 2 sec. When changeover delay was increased successively to 5.0, 7.5, and 12.5 sec (Exp. I) the actual relative rate of reinforcement for the variable-interval 3-min key decreased progressively for two birds, abruptly for two other birds, and the subjects devoted proportionately less of their time and responding to that key. However, the relative performance measures (relative time and relative responding) approximated the actual relative rate of reinforcement, with a maximum discrepancy of 11%, over all changeover delay values investigated. Experiment II attempted to lengthen response-run durations on the variable-interval 3-min key so that they were long enough to meet the changeover delay requirement at each new changeover delay value, by progressively increasing the changeover delay by 0.5-sec increments. With this procedure the actual relative rate of reinforcement approximated more closely the scheduled relative rate as changeover delay increased. As in Exp. I, relative performance measures approximated the actual relative reinforcement rate (maximum discrepancy 17%).  相似文献   

19.
Three experiments are reported in which two pigeons were trained to detect differences in stimulus duration under varying levels of absolute rate of reinforcement. Two red stimuli, differing in duration, were arranged probabilistically on the center key of a three-key chamber. On completion of the center-key duration, the center keylight was extinguished and the two side keys were illuminated white. Correct responses were left-key pecks following the shorter duration and right-key pecks following the longer duration. In Experiment 1, relative rate of reinforcement for correct responses was held constant and absolute rate of reinforcement was varied in seven conditions from continuous reinforcement to a variable-interval 90-second schedule. In Experiment 2, relative rate of reinforcement was manipulated across three different absolute rates of reinforcement (continuous reinforcement, variable-interval 15-second, and variable-interval 45-second). Stimulus discriminability was unaffected by changes in absolute or relative rates of reinforcement. Experiment 3 showed that discriminability was also unaffected by arranging the same consequences (three-second blackout) for unreinforced correct responses and errors.  相似文献   

20.
In two experiments, key-peck responding of pigeons was compared under variable-interval schedules that arranged immediate reinforcement and ones that arranged unsignaled delays of reinforcement. Responses during the nominal unsignaled delay periods had no effect on the reinforcer presentations. In Experiment 1, the unsignaled delays were studied using variable-interval schedules as baselines. Relative to the immediate reinforcement condition, 0.5-s unsignaled delays decreased the duration of the reinforced interresponse times and increased the overall frequency of short (<0.5-s) interresponse times. Longer, 5.0-s unsignaled delays increased the duration of the reinforced interresponse times and decreased the overall frequency of the short interresponse times. In Experiment 2, similar effects to those of Experiment 1 were obtained when the 0.5-s unsignaled delays were imposed upon a baseline schedule that explicitly arranged reinforcement of short interresponse times and therefore already generated a large number of short interresponse times. The results support earlier suggestions that the unsignaled 0.5-s delays change the functional response unit from a single key peck to a multiple key-peck unit. These findings are discussed in terms of the mechanisms by which contingencies control response structure in the absence of specific structural requirements.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号