[Dan Miller (980210)]
From Richard Kennaway (980219.1717 GMT):
My goodness, but I suppose I should have expected it. Flames beget
flames (sorry for the probable/causal allusion). I can only answer a
few of your queries and address some of your concerns, but the
readers will get enough of it, I'm sure.
If you would be more convinced by a new piece from someone else's file
drawer, I can email you a paper of mine I completed a couple of months ago
on the use and practical meaning of correlations, in PostScript format, or
snailmail you a paper copy.
When you first came onto CSGNet a few months ago, I read your papers,
including the one you note. If you want to send it to me, then go
ahead.
In my reply to Bill Powers I wrote:
> Amount of reading (IV) is statistically related to Political
> Progressivism (DV) with a correlation of, r = 0.62.
>
>You say that nothing can be said about this because for nearly half
>of the subjects the relationship does not hold. Actually, this is
>not how population statistics work. A correlation of .62 tells us
>that the relationship is quite strong for the sample studied.
Richard Kennaway speaking for Bill writes:
Firstly, what do you mean by "quite strong"? For what purpose is it
"strong"? What can you do with it?
In the social sciences this is a finding that would raise eyebrows.
So, my measure of strong is level of eye-raising. The stronger the
statistical relationship the higher the eye-raising. BTW, I would
only make such a statement if I had the measures of at least fifty
sociologists. Only then would I make the kind of statement I do.
The fifty has a meaning, right, Richard?
Richard identifies himself as a psychologist by writing the
following:
Secondly, what do you mean by "the relationship"? The observation of the
population correlation does not imply anything at all about, say, what will
happen to an individual's political views if they read more. It doesn't
even make a probabilistic statement about that. So what is this
"relationship", other than a word to suggest that the experimental result
is an observation of a real entity?
It is a statistical relationship - one derived by using a statistic
on variables in a sample. I am talking about the sample,
specifically, and perhaps the population generally. I am not now,
nor never would use the relationship to discuss an individual.
Richard Kennaway then asserts and asks:
What you do is presuppose a mechanism to connect the two variables: the act
of reading creates a context where certain ideas can thrive. You don't
even put it forward as a hypothesis or speculation, but slide it in as a
presupposition. Why?
PCTers have already established the control mechanism (are these the
right words?). This is really all they have established, which I
think is great. It is an outstanding demonstration of how behavior
works. So, I presuppose control. Do we have to establish it in each
case?
Richard, read my posts carefully. This will help you in school. You
ask what I claim.
Are you claiming that this assumed something about the act of reading is
present in individuals? If you are, you are applying population statistics
to individuals. If you are not, where is it? And whatever it is, how do
you account for the 29% of people in whom this mechanism appears to be
inoperative?
I used the finding as an example of possibly interesting IV-DV
research - research that might sensitize people, raise an eyebrow,
get them off their computer. I could have used any of several other
findings that I find interesting. I never for a moment wondered if
you would find it interesting. BTW, how can you claim that the
mechanism is not operative for 29% of the people if you have not seen
the data?
> Because it isn't science, it's coffee-table chat.
Richard, we are lucky to have you to tell us the difference.
Einstein first thought of the special theory of relativity while
in a street car, riding away from a town clock. I don't mean to
equate myself with Einstein, but would you say that such a silly idea
expressed to a friend would be coffee table chat or science? It was,
at the time, only idle speculation.
Have you
looked at scatterplots of bivariate normal distributions of given
correlation? I have, and I wouldn't call the cloud you get with c=0.62
"pretty tight". You can see scatterplots for various values of the
correlation from 0 to 0.99995 in the paper I mentioned.
Yes, the r=.62 is a cloud, but one with a definite shape, wouldn't
you say?
The paper also computes some formulas concerning the probability of making
a successful prediction about an individual's value of one variable given
the other. These calculations assume a bivariate normal distribution, but
I doubt the results would be substantially different in other situations.
Richard, I'm sorry, but why would you want to do the ecological
fallacy? That is, why would you think you could predict "an
individual's value of [sic] one variable given the other"? These are
population statistics. We can only say something about the
population (or sample) we have observed or measured.
> (1) The proportion of individuals which have "Amount of reading"
above
average and "Political Progressivism" below average (i.e. the opposite of
the overall direction of the correlation) is arccos(0.62)/pi = 29%.
Accepting your formula, this is well short of one-half.
(2) The mutual information between the two variables is log (1/(1-c*c))
(using binary logs). For c=0.62, this is 0.35, or about one-third of a
bit. For an individual, there is not enough information in the value of
one variable to even make a single yes-no prediction about the value of the
other.
There you go trying to talk about individuals, again. This is
reductionism at its most flagrant, and an ecological fallacy. Surely
you have the scientific proof for establishing an ecological fallacy?
If so, then use it.
(3) Knowing the value of one variable reduces the standard deviation of the
other by a factor of 1.27.
(4) The proportion of variance in one variable which is "explained" by the
other (in the technical sense of "explain" used in analysis of variance, a
sense which bears no relation to its everyday sense), is 38%.
Analysis of variance is an inferential statistic. We've been talking
about population statistic/descriptive statistics, haven't we. Even
in statistical packages for computers, analysis of variance is not
computed like I did it by hand in graduate school. Is this what you
are getting at? I don't follow.
(5) No prediction whatsoever, not even a probabilistic one, can be made
about how an individual's views, or a whole populations views, will change
if they are induced to read more. (You don't describe the experimental
designs, but I'm assuming that the experiments you cite simply survey a
large number of people, measuring their "amount of reading" and "political
progressivism".)
Who was talking about prediction? The only thing I would do,
supposing the finding had any merit, would be to teach kids how to
read (and, if possible, how to love reading).
Again, defending Bill Powers you write:
Considering that in the case of zero correlation, there would be exactly
half, this is not a strong statement. As indicated above, the proportion
is 29%. And you don't know which 29%. For a bivariate normal
distribution, only 3.7% of the population are so far from the mean of one
variable that they are 95% likely to be on the same side of the mean for
the other variable. (This figure is sensitive to the shape of the extreme
tails of the distribution, and it would require an extremely large sample
to have any reason to believe that the distribution is normal out to that
range.)
And your point is what? That the statistical association doesn't
really exist? Have you ever heard of Kaplan's Law of the Instrument?
> We aren't talking
>about an average person, or any particular person, but rather about
>the sample of individuals (and, perhaps, the population of
>individuals). These are population statistics, and not individual
>statistics. Using such population statistics to describe any
>specific individual would be incorrect usage.
We agree on that, at least. But then, what would be a correct use of the
population statistic you cite?
We then can agree that we can only make descriptive statements
about the sample, or perhaps the larger population from which the
sample was derived.
Then, speaking of Perceptual Control Theorists use of population
statistics to tell us that they are scientists, I wrote:
>This is why I wonder
>why you use correlations to measure control as done by a single
>individual. What is the population? What is the sample?
Richard responds:
Have you run, and understood, Rick Marken's demos? Then you will know what
the variables are whose correlations are computed, and why this is a useful
thing to do.
Sorry, but correlations are point in time measures. Were
point-biserial correlations used? Are these the measures of errors
over a period of time? I don't know exactly which demonstrations you
talk about, but I consider them just that - demonstrations. I've
always been impressed with them as demonstrations. I am convinced
about control. You do understand this. My dispute is with the cant
and rhetoric of this forum.
Are we so needy that we have to have statistics to secure our faith
that we are scientists, or to convince others of that status? Darwin
didn't have them. Is he a scientist? Oh, I forgot - Darwin was
proven wrong by Bill.
>No half
>bright sociologist would reduce population statistics to discuss an
>"average" individual. To use statistics to do so is to use them
>incorrectly.
Richard's snappy reply:
There must be a lot of less than half bright sociologists around, or
non-sociologists of various brightnesses. Population statistics are used
to make statements about individuals all over the place. Look at
psychometric testing -- a field whose whole purpose is to use population
statistics to make statements about individuals.
I've been in sociology for a long time - going to meetings, reading
the journals, talking to folks, and one thing I can say is this.
Maybe only a handful of all sociologists I've had contact with in
these ways has stooped to using psychometrics. The only folks I know
who use psychometric testing in association with population
statistics is, you guessed it, psychologists.
Right about now, my late friend and teacher Carl Couch would tell me
that I've given these people enough of my time. He would remind me
of Thomas Pynchon's great quote in Gravity's Rainbow. He calls it
Proverbs for Paranoids and it goes like this: "You don't have to
worry abou the answers if you get them asking the wrong questions."
Psychologists who use psychometrics (measuring psychos?) with
population statistics to talk about an average, above average,
exceptional individual are asking the wrong questions.
You conclude:
Rick, is "PCT is a useful strand of experimental psychology" on your list
of standard reasons for rejecting PCT?
My god, Richard, I'm sorry. Couch was right. I am giving this too
much time. You were talking to Rick Marken and not to me. The
points you were making were with him and not with me. Silly me. I
rechecked and you did address me by name, go ad hominem, the whole
flame, but you were really trying to get some brownie points with
Rick the Bruiser. I didn't even have to do the test to see what
variable you were controlling for.
I think you've found a home.
Adios,
Dan
Dan Miller
miller@riker.stjoe.udayton.edu