[From Bill Powers (951212.0600 MST)]
Bruce Abbott (951211.1830 EST) --
Reinforcement as defined in EAB is an empirical fact, not a theory.
On reflection, this is what you might call a deduction from a set of
hypothetico-empirical facts. Hypothetico-empirical facts are facts that
make so much sense within a frame of reference, or are so useful in
supporting a conclusion that simply has to be correct, that there is no
actual need to observe or test them.
An operant is a member of a class of activities having a common
consequence (e.g., depressing a lever to the point of switch-
closure). When this consequence is linked to another (the later is
made "contingent" on the former), the frequency (or some other
aspect, depending on the contingency) of the operant is sometimes
observed to increase as a result. If so, the contingent
consequence of the operant has been demonstrated to reinforce the
operant.
It stands to reason that any consequence could be produced by a number
of different actions (hypothesis). Therefore if key-pressing occurs, it
must have been caused by one of the many actions that can cause
keypressing. If many actions COULD cause keypressing, then many actions
DO, on different occasions, cause keypressing (hypothetico-empirical
fact). There is an internal process that tends to emit such acts, but
the tendency to emit any one of the acts is normally low.
If reinforcement is made contingent on keypressing, then whichever act
currently causes keypressing also causes appearance of a reinforcer, and
the reinforcer causes the tendency to emit that act to increase, a
result called reinforcement. Since the appearance of the reinforcer
COULD HAVE CAUSED the increase in tendency to produce the act, and since
we did observe an increase in the frequency of that act, the reinforcer
DID CAUSE the increase in the frequency of the act. It is not necessary
to observe the causal link, or test to see if it exists, because the
causal link is a hypothetico-empirical fact. Hypothetico-empirical facts
do not need proof.
Now let's try a different set of hypothetico-empirical facts. As can be
observed, when a contingency is established between a consequence of
action and the appearance of a food pellet, nothing happens at first. In
fact, nothing can happen until that consequence occurs in the right
pattern (number of repetitions, timing, etc.) because the contingency by
itself can't cause the reinforcer to appear. And the critical
consequence can't occur until one of the actions that can cause it has
occurred.
It is possible that the tendencies to produce all of the actions that
can cause the consequence are mostly suppressed. Since we observe these
actions occurring only at low rates, it stands to reason that they must
be suppressed by something, which we can call a suppressor of action.
When one of those actions does occur and results in appearance of a
reinforcer, it is possible that the reinforcer acts on the suppressor of
action and decreases its suppression of the act that has just occurred.
Thus the reinforcement does not cause an increase in the tendency to
emit the act, but removes some of the suppression of that act by
weakening the connection between the suppressor of action and that act.
Reinforcement is therefore the wrong name for the effect of the
rewarding object. The correct name is "desuppressor." Since all of the
other actions that could have produced the desuppressor when the
contingency is in effect have been almost totally suppressed, the action
that does produce a desuppressor is suppressed by a smaller amount and
tends to occur more often. This creates even more desuppression, and
that action quickly becomes predominant, while the others that produce
the same effect remain suppressed. So we can say that the contingent
consequence of the operant has been demonstrated to desuppress the
operant.
···
----------------------------------------
There is another way to describe the phenomenon which doesn't assert
either a positive or a negative causal influence of the food pellet, and
relies only on observable causal links.
With the contingency absent, no action can cause a food pellet to
appear. So if there is an increase in the frequency of one of the
actions that can depress the lever, there will be no effect on delivery
of food pellets.
With the contingency present, if (and only if) there is an increase in
the frequency of one of the actions that can produce the critical
consequence, the frequency of delivery of food pellets necessarily
increases because of the contingency, which is an observable causal
path. If the frequency of the behavior that creates the consequence
continues to rise, the frequency of delivery of food pellets also
continues to rise as a result. If that behavior rises to a final level
and remains constant at that level, the rate of delivery of food pellets
also rises to a final level and remains constant at that level. If the
behavior rate fluctuates, the food delivery rate also fluctuates, always
in the way described by the form of the contingency.
There is one more general observation we can make. When the contingency
is not in effect, it is seldom observed that the operant in question is
produced significantly more often than other operants (other actions
with a different common effect). With the contingency in effect, the
same operant is often observed to occur at a higher frequency than
ordinarily seen. So the presence of the contingency is associated with
increases of the frequency of occurrance of a particular operant, and
thus also with effects that depend on the operant such as delivery of
food pellets. However, merely establishing a contingency is not enough
to predict an increase in frequency of the operant, so the contingency
itself has no causal relation to the frequency of the operant.
In this account, we use only true empirical facts, actual observations,
and no hypothetico-empirical facts. We propose no causal links that we
can't observe, while we take into account the one causal link that we
can observe in complete detail, the contingency. We have said everything
non-theoretical that can be said about the observations.
There is one proposition in particular that does not appear in this
factual account. Nowhere is it proposed that occurrances of food pellets
have any effect on the frequency of occurrance of the operant act. On
the contrary, the changes in frequency of the food deliveries are
treated as strictly dependent on changes in frequency of the operant
acts, as determined by the form of the contingency.
---------------------------------
Given this factual account of the observations, there is, of course,
nothing to prevent our proposing mechanisms that might exist inside the
organism which explain the changes in frequency of the operant act. We
might propose, for example, a positive feedback connection inside the
organism, such that when the food delivery rate increases, the average
rate of production of the operant act increases, thus further increasing
the food delivery rate. Or we could propose a negative feedback
connection inside the organism, with the frequency of the operant act
depending on the difference between delivered food rate and some
reference rate defined by a system constant. We could propose a system
that detects both actions and food deliveries, and goes through a
systematic or stochastic search process to find the act that produces
the greatest yield of food.
There is, in short, any number of theories that could be offered to
explain the changes in operant rates. With several theories of equal-
appearing plausibility, we then have to think of experimental
manipulations that should have different effects according to each
theory, and use them to eliminate the theories that predict incorrectly.
---------------------------------
You claim that reinforcement is an empirical fact. It is not. It is a
conclusion drawn from a mix of actual observations and imagined or
hypothetico-empirical facts. When you present a description based only
on observable facts, the causal role of reinforcers never appears.
-----------------------------------------------------------------------
Best,
Bill P.