Info theory

_Martin_Taylor · December 22, 1992, 1:32am

[Martin Taylor 921221 20:00]
(Bill Powers 921221.1500)

Bill, your posting just arrived, as I was on the way out to go home, so this
must be short (our phone lines have got very bad recently, and I can no
longer try to do it from home).

As usual, you are an acute observe, especially in relation to the question
of model types. But I really do think that I can use information theory
to identify that the PCT structure was correct, at least feasible. When
you put in the appropriate perceptual input functions, gains, and delays,
you get the same model that you and/or Tom would produce without information
theory, so it should make the same predictions in any specific case. So
why should I try to do better, when I anticipate the result being identity?

What I do want to do is to get some deductions about the structure and its
behaviour that are not obvious, even though they may (should) agree with
what you have found to work in practice. I find that it makes much more
sense to me to have a good theoretical underpinning that allows me to
generalize from a practical result than just to see the practical result
and wonder what might happen if some little thing were changed.

Martin, the difference that Tom is talking about, I believe, is
between a descriptive model and a generative model. A descriptive
model provides a general picture of which a specific behavior is
only one example. A generative model actually generates
(simulated) behavior for direct point-by-point comparison with
real behavior.

Yes, I understand. I have a bit of a problem with limiting myself to
either kind of model exclusively, though, and it is a problem that has
been with me since undergraduate days. If a generative model does
predict reality well, without excessive use of parameters, then it
produces strong evidence of the plausibility of the theory that
underlies it. But if the generative model fails, it does not give
evidence against the underlying theory, because the failure could
have been only in the choice of parameters. So the generative model
is a one-sided kind of support.

On the other hand, the theory by itself is only plausible unless it
can be shown to predict reality, and that can be done only through
generative models or mathematical analysis. In the case of your and
Tom's models, the prediction is very good. So I see little point in
trying to create generative models from what I see as a theoretical
support for the same structure on which your models are based. It
is conceivable that in some situations the information-theoretic
approach might produce numerical statements of more precision or using
fewer parameters, but those situations probably will not be easy to
find. They will be at higher levels in the hierarchy, most probably.
I'm not even going to look for them at present, at least not until
I can see some problems with your practical approach that are resolved
in the information-theoretic approach.

In the end, the generative model will explain behavior, while
descriptive models show that the behavior thus explained and the
structure of the successful model are consistent with general
laws.

Wise.

I obviously must either drop the information-theory thread or take it
much further back to first principles. In response to what Tom seemed
to want in his revised version of his challenge, I anticipated doing
just that. But perhaps it would help if I contradicted one of your
assumptions:

You seem to be taking the position of an external observer who
has one probe on the reference signal and another on the
perceptual signal.

No. Throughout, I am trying to take the position that the only
probabilities that can be observed are based within the observing entity.
Sometimes I slip, I acknowledge. But that's a simple mistake, not
a failure of principle. In this case, the reference signal and
the perceptual signal are both known within the ECS. If you remember
a long way back, this came up. There is no need for an external
evaluation of the probability distribution, any more than there is a
need for an evaluation of a neural current that is based on a rate
of neural impulses. I suppose it might be possible for an external
observer with a probe to make the analyses, and sometimes it is
didactically easier to posit such an observer. But in practice there
isn't one, and it is not necessary to think of one.

One didactic problem is that WE are external to the ECS in question, so
that WE externally observe (imagine) what is going on. But we have to
try to imagine ourselves being in the ECS. It's not easy.

As you can see, we do not

begin with the
phenomenon of messages passing between behaving systems.

I've been playing with information at an intuitive level for as long as
you've been playing with control systems. It's hard for me to get back
to basics (or even to exact formulae, since I don't use them much), but
it will be a good exercise for me to try.

If the reference signal and the perceptual signal are both
varying in a pattern that requires a bandwidth of, say, 2 Hz,
doesn't this mean that both signals are carrying information at a
rate corresponding to that bandwidth?

They could be, but they need not be. The problem is in thinking of
information as being carried, as if an external observer could see it.
That's the same problem as the "codingism" problem. The information
arriving at a receiving point depends on the probabilities of the
received pattern as believed by the receiver, which are unknown to
any other party. The information the originator thinks is being sent
is based on the originator's beliefs as to the probabilities held by
the receiver. In the 2Hz case, the presumption is that all these
probabilities are flat (actually Gaussian) distributions, maximizing
the information that could be transmitted.

Anyway, I'll try to put together a discussion of this, but it may take
some time. I'm sorry I annoyed Rick with my comment. Information
theory clearly isn't as easy to understand as I thought, but in any
case I had no right to say soemthing that could be perceived as insulting.
I'm afraid I may have done it again, in a somewhat hasty reaction to
Rick's riposte. Rick--if you are reading this--let's try to keep things
more technical, if we can.