LAw of Effect and PCT

[From Bill Powers (941210.0735 MST)]

Martin Taylor (941209.1210) --

Appreciate your support re reinforcement as an explanatory construct.
However, there must be _something_ I can argue with you about...

Your discussion of reorganization and its relationship to apparent
reinforcement was inspirational. In fact I can't find anything to argue
with, so let me add some arguments on your side.

In a perceptual control UNIT (a single ECU with one scalar perceptual
signal and one scalar output signal) there CANNOT be different acts to
bring about the same perceptual consequence. All there can be is a
greater or lesser degree of the same act.

This is true, but it involves a rather advanced concept of what
"different acts" means. In operant conditioning, I believe it is thought
that different degrees of reinforcement are what cause different degrees
of the same act: the rate of behavior is said to be maintained by the
rate of reinforcement. Thus when the scheduled ratio is changed, the
result is said to be reinforcment of a different rate of behaving, and
that is why the behavior rate is different under different schedules.

Just what do we mean by "the same act?" Suppose a horse has been trained
to pull a cart at a steady speed. The Law of Effect would say that the
horse has learned that pulling at a certain speed when commanded to walk
has produced rewards in the past, and so when commanded to walk again it
should pull at the same speed to get the same reward. The consequence of
getting the reward has selected the response of walking when the command
"walk" is heard (or a slight tug of the reins is felt, etc.). Have I
stated the three-term contingency correctly?

But now the horse trained on level ground is taken out into the country,
where it is commanded to walk while the cart is going uphill. Now the
"act" of walking at a certain speed requires a considerable increase in
the muscle efforts that propel the horse forward. Nevertheless, the
well-trained horse emits the actions that result in moving at the
prescribed speed up the hill. It increases the muscle forces just enough
to counteract the disturbance due to the tendency of the cart to roll
back down the hill. It has never been rewarded for doing this, but it
does it anyway.

And now the horse reaches the top of the hill and starts down the other
side. Being well-trained, it continues to pull the cart at the same
speed when told to walk. In order to do this, however, it must
completely reverse all its muscle forces, because now the cart is
tending to go down the hill faster and faster and must be held back. So
the horse proceeds down the hill at the same pace, while exerting forces
in the reverse direction.

So -- is the horse producing "the same act" throughout this scenario? If
we view walking speed as the measure of the act, then the answer is yes.
If we look at the means by which this walking speed is maintained, the
answer has to be no, because surely pulling forward and pushing backward
are different acts, shifting the greatest efforts to different muscles
and reversing the effect on the environment.

This is an elaboration of your comments on microscopic and macroscopic
ways of seeing behavior. But I think that we need to focus even on the
behavior of a single ECU, because as far as I can see, different degrees
and directions of the same output effect are treated as different acts
under the Law of Effect, as I said above with respect to schedules of
reinforcement.

Behind the law of effect is a concept of causality in the environment:

1. Behavior -->physical laws--> Consequence

But as a general model, this is incorrect. A more correct model would be

             Behavior -->physical laws--> Consequence
                                               ^
2. |
                                     Variable Disturbances

Consequences are not produced by behavior alone. They are a joint
function of influences from behavior and influences from independent
processes in the environment. An even more correct general model would
be

             Behavior -->physical laws--> Consequence
                               ^ ^
                               > >
3. | Variable Disturbances
                               >
                      Variable disturbances

With model 3 in mind, we can see the mistake in supposing that an
organism can -- or should -- learn a particular behavior that will
always produce a particular consequence. If the disturbances arise from
invisible sources, as they most often do, they become known only because
the SAME degree of behavior produces, or seems to produce, a DIFFERENT
degree of the consequence.

Under conventional concepts of causality in behavior, it is impossible
to explain how a constant consequence can be created under the above
conditions. The Law of Effect can't apply, because if the same
consequence is to be repeated, a different degree of behavior must occur
every time, and it must be exactly that degree that will offset, moment
by moment, the effects of the variable disturbances. There can be no
association between a given amount of behavior and a given amount of the
consequence.

I have noticed, incidentally, that in some engineering treatments of
optimal control, the above situation is only partially recognized. The
variable disturbances are treated as statistical variables, in terms of
their means, variances, and spectral distributions, but there seems to
be no direct treatment of the case in which the behavior actually does
vary so as to offset the detailed changes in the variable disturbances,
leaving the consequence, in fact, essentially constant.

So persuasive is the concept of the first model above that
experimenters, without even thinking much about it, almost always set up
their experiments to exclude the variable disturbances shown in the
second and third models. At the very least they try to make sure that
nothing extraneous can directly alter the consequence independently of
behavior and the physical laws connecting the behavior to the
consequence, as in the second model. So the experimental conditions are
normally chosen to support the concept that behavior is directly
connected to its consequences.

When variable disturbances are successfully excluded, so that Model 1
correctly describes the situation, it is possible to proposed models
that explain the observations. The Law of Effect is such a model. It
proposes that the consequence has a causal effect on the way the
organism behaves in response to stimuli. The result of this causal
effect is to increase the probability of the behavior that previously
produced the same consequence. Since nothing but behavior can influence
the consequence in Model 1 above, this explanation will appear to be
correct -- that is, it will fit the observations.

But models must be tested under more than one condition to see if they
continue to work. When we introduce variable disturbances, it is no
longer true that the organism can produce the same consequence by
repeating the same behavior that previously accompanied it. Under the
Law of Effect, we would have to predict that with the disturbances
acting, the best the organism could learn to do would be to produce the
behavior that had the right effect _on the average_. But of course that
behavior would sometimes add to the effects of large disturbances and
sometimes small disturbances, sometimes disturbances in one direction
and other times disturbances in the opposite direction. So the
prediction would be that the consequence would vary according to the
size and direction of the disturbances, with only its mean value, over
many trials, being reliably related to the behavior.

What actually happens when disturbances are introduced is that the
consequence remains essentially the same, but the behavior that seems to
produce it becomes highly variable. That result is so unexpected, so
impossible from the point of view of Model 1 and the explanations of
behavior built on Model 1, that behaviorists seem unable to accept that
it actually happens.

The fact that it DOES happen tells us that we need a different model of
behavior. The Law of Effect, while plausible under certain conditions,
is not a general law, and in fact predicts incorrectly when disturbances
are allowed back into the picture.

ยทยทยท

-----------------------------------------------------------------------
Best.

Bill P.

[Martin Taylor 941212 16:50]

Bill Powers (941210.0735 MST)

Your discussion of reorganization and its relationship to apparent
reinforcement was inspirational. In fact I can't find anything to argue
with, so let me add some arguments on your side.

Thank you. Maybe jet lag is not such a bad thing, after all.

Now let me reciprocate, by adding further to your side of the discussion.

Behind the law of effect is a concept of causality in the environment:

1. Behavior -->physical laws--> Consequence

But as a general model, this is incorrect. A more correct model would be

            Behavior -->physical laws--> Consequence
                                              ^
2. |
                                    Variable Disturbances

Consequences are not produced by behavior alone. They are a joint
function of influences from behavior and influences from independent
processes in the environment. An even more correct general model would
be

            Behavior -->physical laws--> Consequence
                              ^ ^
                              > >
3. | Variable Disturbances
                              >
                     Variable disturbances

With model 3 in mind, we can see the mistake in supposing that an
organism can -- or should -- learn a particular behavior that will
always produce a particular consequence. If the disturbances arise from
invisible sources, as they most often do, they become known only because
the SAME degree of behavior produces, or seems to produce, a DIFFERENT
degree of the consequence.

Let's go further than saying the same DEGREE, and note that the "variable
disturbances" applied to the physical laws can produce a different KIND
of consequence (as seen by the external analyst who has more perceptual
input functions than just the one). For example, when we are walking,
certain muscular forces result, through the physical events in the world,
in propelling us forward; but when (as today), the surface is icy and
irregular, the same forces result in the back of our heads banging the
ground (no--this didn't happen to me, but it nearly did). The consequence
is of a different kind. Above some threshold, less force, in this situation,
means more propulsion, more force means falling down.

This happens even when the source of the disturbance is visible (to a quite
different ECU), if the effect of the disturbance is not correctly pereived.
But this kind of thing is guaranteed to happen when the source is invisible
(as Bill specifies), though control is normally maintained quite happily
if the changes in the physical laws are slow enough for the particular
control system in question.

Under conventional concepts of causality in behavior, it is impossible
to explain how a constant consequence can be created under the above
conditions. The Law of Effect can't apply, because if the same
consequence is to be repeated, a different degree of behavior must occur
every time, and it must be exactly that degree that will offset, moment
by moment, the effects of the variable disturbances.

Is this so, according to the APPEARANCE of the Law of Effect? Wouldn't the
Law say that in the context of icy ground, the consequence of more force
is to be avoided, so less would be applied? If the walker cannot see the
ice, PCT does not guarantee control that prevents falling--as far as I can
see, it is strictly a question of perceptual bandwidth as to whether the
control system falls or not--and if the walker can see the ice, the Law of
Effect as propounded by Bruce Abbott seems to say that the walking forces
would be reduced, avoiding the fall consequent on the use of normal forces.

More generally, control will occur and be maintained without higher-level
setting of reference levels based on perception of "icy terrain," provided
that the bandwidths of the two kinds of variable disturbances are low enough
in comparison to the transport lag of the control loop. Control is
maintained, in other words, with NO perception of the source of the variable
disturbance, and (as Rick would have it) quite possibly NO perception that
there even IS a variable disturbance. This cannot (as I understand it)
happen with the Law of Effect. If the perceptual context is the same, the
outputs should be the same but ineffective under the Law of (In)Effect, and
THAT is what Model 3 above seems intended to show.

But if the information rate of either kind of disturbance is too high, it
control is lost--we fall on the icy bumps.

Incidentally, I may be wrong, but in all the skimming of the Law of Effect
discussion, I have not read Bruce Abbott as claiming that it is a correct,
or even reasonable, explanation of behaviour. I was quite surprised to
read Rick's comments that seemed to indicate that he saw Bruce as other
than a Devil's Advocate. If you are going to make a Saint, you HAVE to
have the strongest possible Devilment position to falsify. It's no good
setting up an easily destroyed straw man. I think I made this argument
a few times over the last few years: One of the key things that PCT
proponents should do is to take the STRONGEST non-PCT description/explanation
of phenomena, and to show how that position fails when PCT continues to
work. Bruce, it seems to me, has been trying to make the strongest case
he can for the Law of Effect, without expecting that it will be shown to
be as effective as PCT (at least, he said so, several times). This seems
to me to be the best way of reinforcing PCT.

Appreciate your support re reinforcement as an explanatory construct.
However, there must be _something_ I can argue with you about...

Will this do?

Martin