CatComConM2021: 9b. Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments

Monday, August 30, 2021

9b. Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments

Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments. Linguistic Review 19: 9-50

http://www.youtube.com/watch?v=b5u6nSwFXV0

This article examines a type of argument for linguistic nativism that takes the following form: (i) a fact about some natural language is exhibited that al- legedly could not be learned from experience without access to a certain kind of (positive) data; (ii) it is claimed that data of the type in question are not found in normal linguistic experience; hence (iii) it is concluded that people cannot be learning the language from mere exposure to language use. We ana- lyze the components of this sort of argument carefully, and examine four exem- plars, none of which hold up. We conclude that linguists have some additional work to do if they wish to sustain their claims about having provided support for linguistic nativism, and we offer some reasons for thinking that the relevant kind of future work on this issue is likely to further undermine the linguistic nativist position.

59 comments:

Grace LongmireNovember 10, 2021 at 2:59 PM
I am getting the sense that once again, there is a lack of distinguishing between OG and UG happening here. It seems to me that much of what Pullum and Scholz are arguing based on is OG, where children can make errors and be corrected. But how does this argument hold up for UG? For example, they argue that genre is an issue that needs to be considered, and what type of input the child is getting. There is talk of all of this variability in the input being received, but this variability should only apply to OG. The point of UG is that it is universal, so variability would be irrelevant. While I found the thought-process behind their argument compelling for taking a critical look at whether linguists have proven their case, I find these articles difficult to wade through when UG/OG distinctions aren't being recognized.
ReplyDelete
Replies
Caroline Bruce-RobertsonNovember 10, 2021 at 7:52 PM
It certainly seems valuable to bring into question examples of “unlearnable grammar” that are commonly referenced in support of UG theory, but Pullman and Scholz seemed, to me, to too hastily reject the importance of poverty of stimulus arguments other than the one they focus on (their focus being the argument that "People attain knowledge of the structure of their language for which no evidence is available in the data to which they are exposed as children” - pg.14 -> 6 in pdf).

At the beginning of their article, P+S recognize that they are concentrating on only one aspect of the “many headed” hydra that is a comprehensive UG-supporting argument from the poverty of stimulus (APS). But, having effectively shown deficiencies in this one “head", at the end of the article they seem to suggest that there has yet to be any successful support for the APS, that "defenders of the APS are still searching for a well-confirmed case of the type they need” (pg. 46 -> 38 in pdf).

The authors do make clear that they are *not* trying to defend empiricism, but still seem to end on the note that the burden of proof lies on defenders of APS in the nativist (UG) vs. empiricist debate. This seems, to me, to understate the significance of other reasons that APS might be true (all the reasons listed at the beginning of the article that P+S laid out and then said they weren’t going to consider).
ReplyDelete
Replies
UnknownNovember 11, 2021 at 6:26 PM
''it is calculated that a child in a working-class family will have heard 20 million word tokens by theage of 3, and a child being raised in a family on welfare will have heard only 10 million (p. 132). ''

The wide difference in the number of words that a child hears depending on their environment, coupled with the fact that you don't see more UG errors in children from a family on welfare really shows how strong UG is. The author tries to use this fact to show how there is enough data (Even for children from welfare class) to learn the generalization of language (UG) in a data-driven manner. What they fail to mention is that if that were the case, we would see a difference in error making (of UG) between the two groups of children. In other words, if children do learn the generalization of language in a data-driven way, shouldn't the children from the working class have a better understanding of generalization? What we see is only a difference in OG between the two... -Elyass
ReplyDelete
Replies
April Cw LeeNovember 12, 2021 at 5:14 AM
The steps from acquirendum characterization - lacuna specification - indispensability argument- inaccessibility evidence - acquisition evidence seems to be a solid schema for analying the argument from poverty of stimulus. In order not to misread the article, at each analysis, I did need to think again about the specification of APS the authors were analyzing, which was a challenge. This framework led me to perceive that their suggestions, mathematical learning theory and data-driven corpus research meaningful, were be meaningful as future directions in research.
ReplyDelete
Replies
ADNovember 12, 2021 at 12:42 PM
The discussion in the lecture today, regarding a mechanism that allows our trail and error experiences to be successful was interesting. The fact that we need a mechanism that highlights certain features while ignoring others (in order to correctly categorize) reminds me of the computational pandemonium model of pattern recognition. Although the model was constructed to describe how we processes visual stimuli, there are many similarities with this model and the characteristics necessary for such a mechanism that were described in class. Primarily, the “feature demons”, which respond to specific features, are akin to our feature detector that recognize differences between two things. The “cognitive demons” are responsible for patterns and ‘listen’ to the feature demons to determine how closely the features align with the pattern. These ‘cognitive demons’ are similar to the distinction between categories - if the features will be more closely aligned with some categories than others, so we are able to narrow down the possibilities of membership. Finally, the “decision demons”, responsible for the final decision of what pattern is the most likely to be the one we are perceiving, are like the final decision we make regarding the things membership to a category. We are most likely to decide something belongs to one category, based on the similarities that the thing shares with other things of the category. To conclude, I think the Pandemonium model of pattern recognition exemplifies what we would need from a brain mechanism in order to have error free categorization based on sensorimotor experimentation.
ReplyDelete
Replies
Bronwen LathropNovember 13, 2021 at 9:04 PM
I am still finding OG and UG a little bit confusing!

I’m fairly sure that UG can be learned passively, through unsupervised learning. UG on the other hand, cannot be learned passively. This is because no matter what a baby overhears or witnesses, the poverty of the stimulus issue is pervasive. However, children also cannot learn UG through supervised learning. This is evidenced (according to the readings) through the fact that in order for this to happen, children would have to make UG mistakes and be corrected and taught the correct grammar. According to the readings from this week and last week, this does not happen. Therefore, there must be some inborn capacity in children to do this before they are ever taught language. Where I am confused is when it comes to the parameters of UG. As discussed last week, an example of a parameter could be whether or not a language drops a pronoun at the end of the word by turning it into inflection. How can parameters of UG be learned when UG is innate?

If someone could explain this to me I would really appreciate it! I am confused about the components of UG that are universal and inborn versus parameters of UG that are learned and the distinction between these concepts. How are the parameters of UG different from OG?
ReplyDelete
Replies
Camille Delagrave-AjdukNovember 14, 2021 at 10:30 AM
As brought up by other students, it appears that the authors of this text do not believe that negative feedback from one’s environment is significant/necessary for language acquisition. The text argues that the properties of a child’s environment can only provide them with positive language data exposure, but, as brought up in Pinker’s text, children do get negative feedback in the form of corrections from adults or from the general misunderstanding of others. I would also argue that children learn a lot through imitation (mirror neurons?) and that their use of language will refine itself to confirm to the conventions of those around them. While reading this text, I was led to reflect on the necessity of negative data in language acquisition. Personally, I believe that negative feedback is necessary, as UG only provides individuals with the necessary hardware to acquire language and has tremendous flexibility (which leads to linguistic diversity). As much as you can learn through imitation and never have anyone tell you your sentences are not grammatical, someone (who would have started a long chain of imitations) would have needed to have learned or constructed a language which differentiates grammatical from ungrammatical sentences in order for people to understand each other. The way a language is mastered, regardless of the means of acquisition, partly relies, in my opinion, on understanding the distinction between what is right and wrong in a language, which can only be done through negative feedback.
ReplyDelete
Replies
Evelyn BanoNovember 14, 2021 at 2:45 PM
Something that I found interesting in this reading was the consideration of differences in grammar between dialects in the same language. This is discussed on page 18: “There is some evidence that no universal generalization can describe which plurals can occur in which types of compound… British dialects favor regular plural non-head elements considerably more than American dialects.” They give the example of American English using the phrase ‘a drug problem’, while British English uses the phrase ‘a drugs problem’. Another example of this that springs to mind is the American/Canadian dialect using the term ‘math’, and British English using the term ‘maths’, which would seem grammatically incorrect in Canada. This is interesting since this is still the same language without much difference between dialects, and yet there are still slight grammatical differences. They use this to go against the idea that universal grammar dictates the principle, and I think this shows the limits to UG.
I also think these slight grammatical differences between dialects is a very interesting consideration in the topic of language acquisition. Particularly, I think this is interesting for cultural and environmental influences.
ReplyDelete
Replies
Lucie Russell-KearnsNovember 15, 2021 at 4:23 PM
Great example Evelyn, I agree that grammatical differences in dialects is a very interesting topic! However, I would argue that this example is more of an example of Gordon mischaracterizing this principle as part of UG than a dig at UG as a whole. There are various properties of grammar without universal generalizations and just because varieties in plural elements cannot be included in UG does not mean that other principles cannot.
ReplyDelete
Replies
Lola VandameNovember 15, 2021 at 10:10 PM
Universal grammar is intuitive and innate. Despite the diversity of environments in which children grow up acquiring language, UG remains consistent and unanswered, the variability exists within OG. Certain dialects of a same language are dependent on the environments and culture, however these dialects vary only in OG, not in UG. As children learn to speak, they make OG errors despite only hearing correct OG. They will, however, never make UG errors, indicating that they subconsciously “know” UG already. On the other hand, OG is learnt through unsupervised learning, trial and error, and with corrective feedback.
ReplyDelete
Replies
adebimpe ireyomiNovember 17, 2021 at 6:05 PM
This article broadened my understanding of the stimulus poverty arguments which, up until now I thought were very compelling. What stood out to me the most was Pullum and Scholz breakdown of the premises that formed the poverty stimulus arguments in the first place, and a proper definition of the term itself which says that “people attain knowledge of the structure of their language for which no evidence is available in the data to which they’re exposed to as children”. By outlining properties of the child’s environment and accomplishments which have resulted in the poverty stimulus arguments, I now understand that these premises are insufficient to establish the falsity of empiricist claims about language learning.

However, I still have some unanswered questions and confusions about what’s needed for language acquisition. Remarkably, the authors of this text seem not to accept that negative feedback from one’s environment is necessary and that positive feedback is the only thing possible given the properties of their environment. But, I still think its important to consider how children typically do receive an abundance of negative feedback from things like their parents correction, or learning by observation through conversations with peers. It’s difficult to accept that negative feedback is not at all necessary for language acquisition when it’s such a constant and inevitable part of a child’s development.
ReplyDelete
Replies
April Cw LeeNovember 18, 2021 at 10:05 AM
I had a question about Wk 9's lecture, when we continued discussing the dictionary.
My understanding after class and looking at comments from 8b:
kernel: we get this by removing words not involved in defining other words.
what remains defines everything inside and outside
minset: satellite + kernel core. smallest set for grounding
not a dictionary because it cannot define all the words within itself
but has the potential to define every word
satellite: not a dictionary
kernel core: defines in-words
What is btw the minset and the kernel? How would we characterize this portion of the dictionary? I am probably not getting how we get from the kernel to the minset.
ReplyDelete
Replies
Melody ZhouNovember 19, 2021 at 2:22 PM
I’m thinking back to what Professor Harnad said in the lecture last Friday (the Nov 12th lecture), where he claims that language in our species did not begin with speech, but began with gestures. There is evidence of gestures used as language in the formation of different sign languages around the world. However, I was wondering if there was any evidence of gestural languages pre-dating the emergence of spoken languages (or if this would even be possible to see in historical documents or artifacts). If this is true, would that mean the human species was mute when we first evolved and then eventually evolved to have spoken languages?
ReplyDelete
Replies
Emma NephtaliNovember 20, 2021 at 2:09 PM
One thing I’m wrestling with after reading the paper and all the skywritings is the concept of negative evidence. From the readings and the lectures, I understand that we often see violations of OG but not UG, and I understand this to be because there is no such thing as negative examples of UG (the poverty of the stimulus argument). That being said, why do we distinguish positive from negative examples of UG in the first place? If there is no such thing as a negative example of UG (an unthinkable thought), why do we need to specify “positive” vs. “negative”? Isn’t every example of UG we see positive, thus inherently eliminating the need for a “negative” label?
ReplyDelete
Replies
Sofia Di GironimoNovember 24, 2021 at 12:26 PM
Certain elements of the poverty of stimulus argument, as well as of the distinctions between OG and UG were made clearer to me by the discussion around this week’s reading (although the reading itself did not retain a strong enough distinction between OG and UG). I still, however, have some reservations about accepting the poverty of stimulus argument for UG. So certainly, without negative feedback it would have been impossible for a child to learn the rules of UG, such as the example Harnad gave last week:

UG:
John is eager to please Mary (UG+)
*John is easy to please Mary (OG-)

This is because this kind of sentence is never produced by the child. I suppose maybe my confusion can be chalked up to a lack of knowledge (explicit knowledge, that is) about the rules of UG. For this particular example, I feel as though the incorrectness of the second sentence could stem from that “easy” is a word to describe actions/activities, and not people (unless we are referring to their sexual proclivities). Maybe there is something I am missing here -- or perhaps I need to hear more examples of violations of UG in order to have a better grasp of the strength of the argument for its existence. Does the fact that some sentences never occur necessarily have to point to a giant underlying structure like UG? Or can these examples of UG-violating sentences that would never occur to a child be explained by something else, perhaps more related to the semantic content of the words than to the structure of UG?
ReplyDelete
Replies
Melissa CimagliaNovember 24, 2021 at 1:06 PM
I feel like I have a better understanding of the reasoning for and against linguistic nativism after reading both papers, but I still have some fundamental doubts about how this material applies to reverse engineering cognition. The overall theme of these readings seems to be whether we can conclude that people learn the structure of their language despite the paucity of evidence available to children during the language acquisition process. This raises the question of how much language acquisition is dependent on intrinsic mental systems. I am confused about how knowing the degree to which linguistic nativism is correct helps us build T3. T3 obviously needs language to do everything we can do, but do we need to worry about how we acquire language in order to derive its mechanism? I understand that language serves a wide range of functions, including symbol grounding and categorization. Language supplies the symbols upon which components of the actual world are "grounded," hence the ability to acquire categories through instruction is obviously language dependent. As a result, T3 is in need of language skills (but we already knew that). However, if Pinker’s linguistic nativist argument turns out to be right, or vice-versa, what changes about how we conceptualize T3?

My guess is that UG is what we’re really interested in because if UG is needed to pass T2, then it must be at the head of T3. The implications of that might be that a T3 robot who has categorization capacity (i.e. can learn by induction or by unsupervised learning) might not have language capacity.
ReplyDelete
Replies
Peizhao XieNovember 25, 2021 at 7:14 PM
“What if we locked 8 babies in a room for 20 years and see if they make their own language”
This question was raised in a meme that was posted in the chat after the lecture that Prof. Harnad asked the question: what is the thing that we have, but the animal doesn’t give us language. Surely, it is neither practical nor ethical to answer the question by locking babies for 20 years. However, it reminds me of the case of Nicaraguan Sign Language that Prof. Harnad mentioned, which be seen as evidence for gesture theory, suggesting that human language was developed from gestures instead of vocal signals.
In our lecture, Prof. Harnad proposed that it’s a mistake to look for the origin of language in the origin of speech. However, when I look into the case of Nicaraguan Sign Language, I wonder if it could be actually seen as evidence for gesture theory under the frame of discussion in our class. One question is that, without doubt, this ‘experiment’ shows that children could collectively possess the capacity to learn and create ‘language’, but it did not explain how these students develop the pidgin-like form language to sign language with a higher level of complexity.
The later analysis on the language that the young children developed shows that its spatial modulations, which are the building blocks of grammars in sign language, were signed more frequently in the later-exposed signers. And more significantly, when describing complex motions, the early-exposed signers were signing the motions simultaneously while the later signing sequentially, indicating that this combinatory change is the ‘thing’ that denotes a shift from gestural to language like expressions. But how did they learn not only the signs of words and the ‘verb agreement’ but also other conventions of grammar so fast?
To answer this, I try to investigate more on this case to find out when this more complex system emerged, was it spontaneously? Before they came to school, these deaf children were using simple home sign systems and gestures to communicate with others: they do not seem to have the ability to claim something or say that something is true. And the language scheme that those teachers first applied to these children was not suitable; many children failed to grasp the concepts of words. So, did they learn ‘proposition’ by just communicating with one another instead of being linguistically connected( under supervised learning) with their teachers? Clearly, the case of Nicaraguan Sign Language, the process from pantomime to sign language, is in favor of gesture origin of language. However, given the question I had, I think gestures might be necessary, but not sufficient to give us the power of creating and using language.
ReplyDelete
Replies
Milo BasuDecember 2, 2021 at 3:18 PM
From what I’ve understood in learning about the poverty of the stimulus argument, Chomsky’s primary coup in formulating it was that it was a refutation of previously dominant behaviorist theories of language acquisition through operant conditioning. Pullum and Scholz seem to get bogged down in the grammatical analysis of components of English sentences such as word order, which have nothing to do with Chomsky’s APS and UG, but do have to do with ordinary English grammar. I will admit that some of this text was lost on me, since it was very dryly written. But I do wonder, since it is impossible to formulate a sentence that violates UG, how one would even formulate a proper argument against it.
ReplyDelete
Replies
Shandra GunnooDecember 5, 2021 at 12:35 PM
When Chomsky put forward the APS, he intended to provide an argument that behaviorism alone is not enough for children to learn language. Children seem to abide by the rules of UG without any supervision, and they learn language so fast that just receiving positive or negative feedback would not be enough to account for such a rate of learning. Chomsky coins the term UG, which is a form of grammar that is already there at birth.

However, I find it questionable that it is a completely different approach to behaviorism. For behaviorism, what happens in the brain is considered to be part of the ‘Black Box’, that is of no major concern to the behaviorist – all they care about is the input and the output. Chomsky says it’s not a Black Box, it’s UG, but does giving it a different name make it any clearer how and why this innate language ability is there? True, he did say that behaviorism alone cannot be the answer, but what UG truly is still remains elusive.
ReplyDelete
Replies
Christy ShaoDecember 5, 2021 at 6:25 PM
The poverty of stimulus argument states that when a child firstly ‘learns’ a language, there’re not enough environmental/linguistic stimuli for them to learn the language by trial-and-error, or by purely associative learning. Children are not learning language by the ordinary grammatical structure (as adult learn a second language), but rather, the inherited language acquisition ability allow them to learn their mother language in a surprisingly high speed. I think the reason why a newborn/or anybody will face the problem of poverty of stimulus in language learning is because there are infinite ways of expression, and there will never be enough stimuli for language, even we are learning OG not UG as adults.
ReplyDelete
Replies
Ryan ChahriDecember 6, 2021 at 9:48 AM
Basically, this paper wants to show that linguists have not shown the reasoning behind why we can say that children are born with knowledge of language without having to learn through experience. As Chomsky suggested, the best way to study that would be to search for cases where it would be most likely impossible for children to experience the language structure tested. Another thing that’s important is to determine what is enough experience to learn a language structure and what is not. To understand this question, we need to know the utterances the infant is exposed to and where the infant’s attention is. Lastly, the paper mentions that specialties will be required to resolve the problem: mathematical learning theory and corpus linguistics. Mathematical learning theory is useful to determine boundaries for how much experience is needed for children to learn language structure and corpus linguistic is useful to understand the typical content of language(corpora) and be able to say what is accessible and inaccessible for children.
ReplyDelete
Replies
Melissa LiDecember 6, 2021 at 4:48 PM
So based on this reading, and based on past readings my understanding is that the “something innate” which helps children not make errors is Universal Grammar, however the Ordinary Grammar is learned. Since UG is assumed to be innate, then reverse-engineering this would be futile since, again, it is innate. And if it’s not possible to reverse-engineer then it wouldnt be possible to deploy this into a robot?
ReplyDelete
Replies
Dylan YaoDecember 6, 2021 at 9:24 PM
I am still a little bit confused about the poverty of stimulus argument that universal grammar (UG) is unlearnable. The argument sounds circular to me.
According to this argument, ordinary grammar (OG) can be learned though unsupervised or supervised learning or through instruction. However, UG cannot be learned, and thus is innate. The reason is that, in the case of OG, we can make mistakes on OG and be corrected by parents. However, in the case of UG, we cannot make mistakes, and thus have no negative evidences—everything we say is in accord with UG, so we do not know what is not counted as UG. However, without negative evidences, we cannot learn categorization. Hence, the conclusion is that UG is not learned, but inborn.
The part I do not fully grasp is why we cannot make mistakes on UG? If it is because it is by definition and construction something we must follow and on which we cannot make mistake, then we have already presupposed it to be something inborn. It seems to me that we have presuppose in the beginning that UG is a set of abstract principles, and under the guidance of these principles, the environment sets the parameters, and we learn OG. Hence, by hypothesizing the existence of UG, we have already hypothesizing something unlearnable.
ReplyDelete
Replies

Add comment

CatComConM2021

Monday, August 30, 2021

9b. Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments

59 comments:

PSYC 538 Syllabus

Report Abuse