[From Bruce Abbott (990510.1945 EST)]
Bill Powers (990510.1146 MDT) --
Hmm. Thinking out loud.
Apparent matching _could_ be explained by saying that animals basically
select the best key and use it, but as the total payoff declines, they
begin more and more often trying the other key. Actually, with concurrent
schedules I think that there is always a net gain from trying the other key
(this just occurred to me). An occasional press on the less productive key
does not substantially reduce the delivery rate from pressing the best key.
In fact, it need not reduce it at all unless the animal uses the other key
too often. The schedules are set up, according to Bruce, so that once the
interval runs out, the key is primed to produce a reinforcer on the next
peck, so anytime between the start of an interval and its termination, the
other key can be pecked without any penalty at all.
As a simple example, assume that one schedule is VI 30-s and the other is VI
60-s. By switching between keys often enough, our pigeon could collect
every reinforcer programmed on both schedules, netting itself an average
reinforcement rate of 3 reinforcers per minute (2/min on VI 30-s plus 1/min
on VI 60-s). So in this case the pigeon can increase reinforcement rate 50%
by switching between keys, as opposed to staying with the key offering the
higher payoff rate (VI 30-s).
This means that pressing the best key exclusively is NEVER the best
strategy, under the schedules as defined. The animal should peck one key
for a short time, and if that doesn't produce a bit of food immediately, it
should peck the other key for a short time, and so on back and forth as
quickly as possible. This will produce what looks like a tendency to
matching, because the animal will give up on the less productive key
without receiving a reward sooner than on the more productive one.
Hmmmm. This needs to be spelled out. Why would the animal give up "sooner"
on the less productive key? Maybe it switches keys if the current key
hasn't paid off for some relatively fixed period of time. Because there are
more "excessively long" intervals on the longer VI schedule, switching from
the longer schedule (less productive key) to the shorter one (more
productive key) would take place more often than the reverse. The pigeon is
"giving up" on a key after the same lapse of time without reinforcement,
regardless of schedule; it's not giving up on the less productive key
sooner, it's giving up on the less productive key more often.
Hmm again.
Hey, it should use the e. coli strategy! The "tumbles" here are switches
from one key to another. If there is no food, switch to the next choice; if
there is food, delay the switch to the next choice -- all the while pecking
away at a constant rate at whichever key is the current choice. That will
lead to delivering more pecks on the side that produces the most
reinforcement.
I think you mean "if there is no food _by some interval_, switch." If so,
that sounds reasonable.
We can generalize: one of the "keys" could be "no key" -- i.e. going away
from the keys and pecking at different places in the cage. Of course the
yield there is zero, so there will be a quick switch to the next place, and
a return to pecking on keys. Since the keys do usually provide food on the
first peck after a long delay, the time spend on keys as opposed to away
from them will increase. It should be easy to set the parameters of an e.
coli model to get the best match of model behavior to the data.
Yes, except that some things a pigeon does other than key-pecking may help
to control other variables; to that extent those other behaviors will
compete with keypecking. Herrnstein allowed for this possibility in his
equation for single-key responding: P = kr/(r + ro), where k is a constant
and ro is the rate of reinforcement for "other behavior," scaled in
grain-access-equivalent units.
I expect that by the time I get back from Boston, three people will have
this model working.
Hmmm. I wonder which three? . . . Anyone care to draw straws?
Regards,
Bruce