[From Bill Powers (2000.12.16.0345 MST)]
Bruce Abbott (2000.12.05.2215 EST)--
... observations reveal the following:
I want to demonstrate to you that observations do no such thing. You
interpret the observations in a particular way. They can be interpreted in
other ways you do not mention. I will show another way for each point you
make, not because I defend it, but just to show that more than one
interpretation is possible.
1. Initially, when a bit of the rat's behavior (depressing the lever) did
_not_ produce a food pellet, the rate at which that act occurred remained low.
A. Initially, when pressing the lever did not produce food, the rat moved
about the cage, sniffing, licking, and scratching at things including the
lever. The rate of lever depressions remained low because most of the rat's
actions that could depress the lever were being applied elsewhere.
2. When it was then arranged that the same bit of behavior now produced a
food pellet each time it occurred, the rate at which that act occurred
increased over the rate observed in (1).
B. When depression of the lever produced a food pellet each time it
occurred, the animal would pause and eat the food, and spend slightly more
time in the vicinity of the lever before moving on. As the animal spent
increasing amounts of time in the vicinity of the lever, the rate at which
the lever was depressed increased (over the rate observed in (1)), which
caused the rate of food delivery to increase.
3. When it was then arranged that the same bit of behavior once again did
_not_ produce a food pellet, the rate at which that act occurred diminished
to the level observed under (1).
C. When pressing the lever no longer produced food, the animal changed its
behavior, first to an increase in lever pressing (you left that out), and
then to an increasingly variable pattern until it was back to the original
pattern of sniffing around the cage and scratching and licking at things.
After the initial rise in lever pressing, the decreasing amount of time the
animal spent at the lever resulted in a lower and lower number of presses
until the lever was pressed only when the animal happened to be moving
around near it.
Because nothing else changed except whether lever-pressing produced a pellet
or not, and because these same changes occur reliably every time these tests
are performed, it can be inferred with a high degree of confidence that it
is the delivery of the food pellet as a consequence of a lever-press that is
responsible for the observed increase in the rate of lever-pressing in (2),
and that the loss of this consequence is responsible for the observed
decrease in the rate of lever-pressing in (3). This can be stated with
confidence because nothing else has changed between conditions that could
explain the change in rate of lever-pressing. We can thus conclude that the
increased rate of lever-pressing in (2) relative to (1) and (3) is due to
the fact that in (2) the lever-press produces pellets, whereas in (1) and
(3) it does not.
You can state that the rate of lever pressing in (2) is higher than in (1),
but you can't say it is higher _because_ pressing produces food pellets. In
an exactly parallel way, you can say that the speed of a car is higher when
pressing the accelerator feeds gas to the engine than when pressing it
fails to feed gas to the engine. But we would never say that the fact that
the accelerator works properly is what makes the car accelerate.
In fact, given the availability of food via lever presses, the rate of
pressing can still increase and decrease without any further change in the
availability, increasing and decreasing the rate of food delivery and
consumption. This shows that the availability of food via the lever is a
factor in the changes of behavior, but only an enabling factor, not a
cause: to know whether food is available is necessary, but not sufficient,
to predict subsequent behavior (a "cause" is both necessary and sufficient
to produce its effect, which is why physics does not deal with causes: they
are extremely rare). This is analogous to the way having gas in the gas
tank is necessary for a car to be driven away, but not sufficient to
account for its being driven away. The car is not driven away _because_
there is gas in the tank. A rat does not press a lever _because_ doing so
would produce food. The car cannot be driven away if there is no gas in the
tank, and the rat cannot produce food if the lever is the only means of
producing food and is not working. But given the gas and the contingency,
the behavior is not yet accounted for.
The basic difficulty here is in treating a contingency, which is the
_possibility_ for an act to produce an effect, as if it were a causal
variable. But the possibility of an effect is not the same as occurrance of
an effect. Before the effect can be produced, the possibility of producing
it must exist, but that is not enough. The action that produces the effect,
given the possibility, must also occur. And even that is not enough: the
system must be so organized that the _lack_ of the effect motivates an
attempt to produce it.
This effect of the contingency between
lever-pressing and food-pellet delivery on response rate is termed
"reinforcement" and if delivering food-pellets for lever-presses has this
effect on lever-presses, then the pellets are said to serve as
"reinforcers."
I hope I have shown that this is an insufficient description. The
contingency makes it possible for an action to have a certain effect, but
is insufficient to account for production of the action. It is not
"delivering food for lever presses" that has the effect of increasing lever
presses; it is the increase in lever presses that makes the contingency
into an actual increase in production of food. It is the driver of the car,
not the gas in the tank, that puts the car into motion and drives it away.
What we actually observe is that when the apparatus is connected so that a
lever press, if it occurred, would produce a food pellet, both lever
pressing and food pellet production increase. The food pellet production
increases _because_ the lever pressing has increased; of that there is no
doubt. Why the lever pressing increases can't be explained without a model.
I'm only speaking for myself. If there are EABers out there who think that
reinforcement involves something else (e.g., linear cause-effect), then I
say nail 'em. But first we need a model that demonstrates how a living
control-system generates those changes in behavior referred to as
"reinforcement," "extinction," etc. when exposed to the changes in
contingency under which those effects are demonstrated. Thus far we haven't
got one.
Well, then, let's get one. Offline, you and I have already made a start
with the "furnace" models. In my model I included a "search" function
which, right now, merely sets a "failed" flag false to indicate success. We
could expand that to include an actual search process which is terminated
when it results in production of a rise in temperature. Of course you would
want it to terminate when a command produced a flash, but the same
principle would apply. This model is becoming a representation of the basic
situation outlined in this post. When we have it working well enough, we
can explore the appropriateness of the way EABers would interpret the
visible part of its behavior.
When that happens, we can also go public with it and perhaps, for a change,
know what we are talking about.
Best,
Bill P.