Reorganization?

[From Bruce Abbott (941207.1540 EST)]

It's good to be back among the living. I trust that everyone has taken the
opportunity to rest up--you're gonna need it. We have some serious issues to
tackle. Take this one, for starters....

Bill Powers (941206.1205 MST)

This is why we were so impressed by the E. coli phenomenon: it is a
direct demonstration of the principle of reorganization.

Am I delirious with fever? Did Bill Powers actually state in the above that
an innately-organized, pre-programmed, negative feedback loop illustrates
REORGANIZATION? I am totally mystified.

According to B:CP, reorganization is a different process altogether, one that
results in the CREATION of a new control system. I quote:

     Reorganization is a process akin to rewiring or microprogramming a
     computer so that those operations it can perform are changed.
     Reorganization alters behavior, but does not produce _specific
     behaviors_. It changes the parameters of behavior, not the content. . .
     . Reorganization is an operation _on_ a system, not _by_ a system.
     [B:CP, p. 179]

Is reorganization now to include the ordinary operation of a control system in
response to perceptual error? Please explain...

When E. coli
goes in the right direction, it does NOTHING to change the direction.
Only while it is going in the wrong direction is there any process that
alters its direction. The right direction is defined by default: it is
that direction that leads to no change in direction.

Bruce's model actually illustrated that point, more or less by accident.
The effective logical path that resulted in the correct changes in
probabilities was the one that "punished" swimming in the wrong
direction.

You are ignoring the other "half" of the discrimination. When e. coli was
swimming the wrong way (dNut negative), the effective logical path that
resulted in correct changes in probability was the one that "reinforced"
swimming in the right direction. The effects that occur when swimming in the
"right" and "wrong" direction are perfectly symmetrical and thus actually
contradict your point.

Furthermore, it was EXPERIENCE with the consequences of a tumble under these
two conditions that determined the tumble rates under the two situations, not
a pre-programmed servomechanism qualitatively no different from the spinal
reflex. If the term "reorganization" is to have any worthwhile meaning, it
must be restricted to the situation where control is GAINED over a perceptual
variable, not where such control is merely executed. My "learning" model e.
coli shows reorganization by your (old) definition; your original (static) e.
coli most definitely does not.

The concept of reorganization gives us a new understanding of "reward."
It is not that getting a reward causes the behavior that leads to the
reward; it is that lack of the rewarding thing creates a state of
reorganization that will cease only when the lack is corrected.

Now this seems more reasonable, although it has nothing to do with the earlier
discussion of e. coli behavior. However, it needs to be restated slightly.
It is the fact that a behavior PRODUCES the reward that leads to the
repetition of the behavior. However, "the behavior" is NOT to be defined as a
particular set of muscle contractions, but rather as a goal-directed activity
mediated by several layers of perceptual control system (e.g., pulling on a
string). And whether some consequence of behavior serves to "reward" or
"punish" behavior depends, of course, on the effect of that consequence on
controlled perceptual variables, relative to their reference levels.

Which brings up the question of "incentives." Rats will lever-press for non-
nutritive sucrose solution, even though this substance has no effect on an
intrinsic error signal for nutritional level. Does lack of sucrose create a
state of reorganization that will cease only then the lack is corrected? If
not, why do rats learn to press levers in order to get it?

Regards,

Bruce

Tom Bourbon [941208.1107]

[From Bruce Abbott (941207.1540 EST)]

It's good to be back among the living. I trust that everyone has taken the
opportunity to rest up--you're gonna need it. We have some serious issues to
tackle. Take this one, for starters....

he said, feverishly. :wink:

Bill Powers (941206.1205 MST)

This is why we were so impressed by the E. coli phenomenon: it is a
direct demonstration of the principle of reorganization.

Am I delirious with fever? Did Bill Powers actually state in the above that
an innately-organized, pre-programmed, negative feedback loop illustrates
REORGANIZATION? I am totally mystified.

According to B:CP, reorganization is a different process altogether, one that
results in the CREATION of a new control system. . . .

Bruce, my understanding of Bill's comment was different from yours. I took
him to be saying that the organization of _the PCT model_ for E. coli
locomotion can serve as a theoretical mechanism that drives reorganization
in the manner described by Bill way back at the beginnings of PCT. Back
then, he had an idea of what reorganization should look like: it results in
new reference signals, or in changes in the parameters of existing
perceptual input functions (PIFs) or of output functions (OFs), or in the
creation of new PIFs; it converges on good control (p = p*) but has no plan
or rule for how to get to that stipulated end there -- arrives there by way
of random tinkering with parameters; and so on. Back then, Bill didn't have
a working model for that process. The model for E. coli locomotion looks
like that kind of mechanism. Bill will correct my interpretation, if it is
wrong, but I believe something like this history was what he had in mind
when he wrote the message above. Maybe your recently variable reference
signals for core temperature prevented you from thinking of that
interpretation. :wink:
. . .

Bill:

When E. coli
goes in the right direction, it does NOTHING to change the direction.
Only while it is going in the wrong direction is there any process that
alters its direction. The right direction is defined by default: it is
that direction that leads to no change in direction.

Bruce's model actually illustrated that point, more or less by accident.
The effective logical path that resulted in the correct changes in
probabilities was the one that "punished" swimming in the wrong
direction.

Bruce:

You are ignoring the other "half" of the discrimination. When e. coli was
swimming the wrong way (dNut negative), the effective logical path that
resulted in correct changes in probability was the one that "reinforced"
swimming in the right direction. The effects that occur when swimming in the
"right" and "wrong" direction are perfectly symmetrical and thus actually
contradict your point.

There they are again -- those direct changes in probabilities of behaviors.
I'm not trying to be nasty. I'm simply emphasizing the point that in your
models of E. coli locomotion, the consequences of various actions act
directly to change a descriptive statistic -- the probabilities of various
actions; in the pre-Bruce PCT model for E. coli locomotion, the
probabilities of various actions were uncontrolled, incidental side effects
of changes in the delay between tumbles. The organization of the
un-Abbotted E. coli model did not control the probabilities of tumbles or
non-tumbles.

No, I was not asleep. Yes, I know you are trying to model learning, as you
have come to know and love it aftr years in EAB. All I am saying is that
I see a very big difference between models in which the superficial
descriptive statistical properties of behavior simply "fall out" from the
behavior of the model, and those in which the model "succeeds" in large
part because the modeler directly alters the statistical properties of the
model's actions.

I'll be a happy guy when I see a model of a control system that learns
without benefit of arbitrary adjustments to the descriptive statistical
features of its behavior. I wish you success in your attempt do create that
model -- it will be a wonderful thing.

Later,

Tom