The following misconception seems to have been held uncritically by some English
language teachers for the past few decades and has undoubtedly hindered learners from
acquiring basic speaking competence.
Misconception: The purpose of learning pronunciation is 1) to locate the vowel that is
pronounced with a primary stress in a given word, and to locate the word with a major
stress in a given sentence, and 2) to distinguish the individual vowel and consonant
sounds or phonemes.
The source of the misconception
Entrance Examinations
Since English is one of the core subjects at school, English examinations are given at
various times: at mid-term and at the end of the term in high school, and upon entrance to
senior high schools, colleges, and universities. These English examinations have
frequently included pronunciation questions in written form which generally require
finding the stress in a given word or sentence and comparing individual vowel and
consonant sounds. Appendices 1 and 2 contain typical examples. (The term
"pronunciation questions" is used hereafter to mean "pronunciation
questions in written form.")
Appendix 1 is from the National Entrance Examination (English Portion) which was given
January 17, 1996. This is the examination administered by the National Center for
University Entrance Examinations, a central government organization for all high school
students who want to enter national, prefectural, municipal, and some private colleges and
universities. In the examination there are 51 questions, eight of which are related to
pronunciation. Appendix 2 is from the first National Entrance Examination given in 1979.
It is clear that there hasnt been much change since then in the way pronunciation
questions are given. Type C questions in Appendix 1,
where the word with a major stress in the given sentence is to be selected, are nothing
new. Questions of this kind first appeared in the National Entrance Examination in 1985.
Besides the National Entrance Examination, there is another category of entrance
examinations. This is what each private college and university administers on its own. A
survey conducted by the author shows that examinations in the private sector usually cover
more or less the same pronunciation questions as those found in Appendices 1 and 2.
Accordingly, what passes for pronunciation instruction at senior high school has only
consisted of 1) explaining Daniel-Jones phonemic symbols using a chart of English
vowels and consonants, 2) using minimal pair exercises, 3) having learners memorize
pronunciation patterns to predict on which vowel the primary stress is placed in a word,
and 4) distinguishing the differences in the sounds of a vowel or consonant in a given
word (See Appendices 1 and 2 for examples). All this is done so that learners can get higher
marks in the pronunciation sections of university or college entrance examinations. As for
finding the stressed word in a sentence, teachers only try to approach this issue from a
semantic or structural point of view and usually do not have learners orally produce the
sentence. It is no wonder that Cronin (1996:16) states, "Unfortunately, in Japan,
many students will have studied [English] for six years in school with little useful
practice other than of individual words."
A recent trend in the English examinations
As mentioned above, there are entrance examinations for senior high schools which used
to include pronunciation questions. But today, a close look at some of the entrance
examinations for senior high schools will reveal that the questions are different from
what they used to be. Among the prefectural senior high schools, the number that give
pronunciation questions in entrance examinations is decreasing. The statistics (Obunsha
1996) show that in 1996 the number of prefectures which included pronunciation questions
was 11 out of 47, whereas in 1994 it was 18.
In Japan there is an English proficiency test called STEP (the Society for Testing
English Proficiency). In 1995, 3.5 million people (from elementary school pupils to
adults) took this test. The test once included pronunciation questions as an important
component, but the administrators revised the test in 1993 and eliminated all
pronunciation questions.
In brief, the entrance examinations for universities and colleges seem to be the cause
of the misconception, as described above. If pronunciation questions were abolished from
university examinations, as many of the high schools and STEP have done, there would be a
much broader change in the way pronunciation is taught. It is perhaps unfortunate that
recent innovations in high school English examinations have not had any practical
repercussion on institutions of higher education.
Taking a more holistic approach
Shifting towards communicative language teaching (CLT)
It was in the early 1980s that some English education specialists began to discuss the
importance of introducing communication-oriented language teaching. The communicative
approach become a buzzword among people in the fields of education and business in the
late 1980s. The latter complained that students did not usually have a good enough command
of English to communicate in the business world, even after completing ten years of
English training.
In the mid-1980s the Ad Hoc Education Advisory Committee put forward a package of
recommendations for educational innovation. Part of the package was a recommendation for
future English language education. Following the Committees recommendation, the
Ministry of Education decided that the English curriculum should be revised. As a result,
the English curriculum has been geared more toward communication, and textbook publishers
have revised or edited their teaching materials according to the Ministry guidelines. It
is now generally believed that CLT is the right track for future English education; it
seems that both the education administration and school teachers have agreed on the
importance of shifting to CLT. But there seems to be no clear idea of what CLT is all
about, much less what approach is necessary in teaching pronunciation in a CLT setting.
Pronunciation instruction in CLT
Hinofotis and Baily (1980:1245) reported that up to a certain proficiency standard,
the fault which most severely impairs the communication process in EFL/ESL learners is
pronunciation, rather than vocabulary or grammar. Their arguments make pronunciation
instruction all the more important in improving the communicative competence of learners.
Wong reminds us (1993:45) that the most relevant features of pronunciationstress,
rhythm, and intonationplay a greater role in English communication than the
individual sounds themselves. Therefore, teaching speech from the perspective of
suprasegmentals seems indispensable in a CLT setting. Learning pronunciation should not be
limited to finding primary stress and comparing individual vowel and consonant sounds in a
given word, as has often been the case with pronunciation learning in the past. Focusing
on individual vowel and consonant sounds is only the first step in learning English
speech, as Yule, Hoffman, and Damico claim (1987:765).
It is now widely accepted in Japan that the ultimate goal of language teaching is to
encourage learners to acquire communicative competence. This makes it all the more
important that pronunciation instruction should be approached holistically, not
phonetically, and that teaching pronunciation be directed more from the suprasegmental
perspective.
Taking the first step toward change
Because there is so much discrepancy between the holistic concept of teaching
pronunciation and the one already being implemented in Japanese schools, it would be too
difficult to institute a new methodology all at once. So, consider what could be done in
Japanese schools to initiate pronunciation training that goes hand-in-hand with the recent
trend towards CLT.
First, it is essential for English teachers to realize that, besides pronunciation of
individual sounds or phonemes, there are other distinctive aspects of the English sound
system that are not found in the Japanese system: English is a stress-timed language,
whereas Japanese is a syllable-timed language. As is often the case with a syllable-timed
language, Japanese is also an open-syllable language (words in Japanese usually end with a
vowel). When English is pronounced by a Japanese speaker, this phonological feature is
often transferred, and an additional vowel or epenthesis is the result.
Linking and assimilation
Japanese is predominantly open-syllabled, that is, syllables end in a vowel with each
syllable having an equal stress. So words are made with one of the following combinations:
CONSONANT+VOWEL, VOWEL alone, or DIPHTHONG, with the exception of the consonant /n/, which
is the only consonant that can be found either at the beginning or at the end of
syllables. If this system is transferred to an English sentence, "Look at the red
doll!" each word would sound like /lUkuw/, /eetow/, /D@/, and /rEdow/, and /daluw/.2
All of the final consonants are pronounced with an additional vowel. This phenomenon is
often found among English learners whose native tongue is mainly open syllabled. When a
vowel is added in this way, it is unlikely that the speaker will produce English speech
with linking and assimilation occurring at the right places.
Linking is a phenomenon in which two sounds are connected between adjacent words and
pronounced in a continuous flow from one word to another. Avery and Ehrlich (1992:84) say
that linking occurs between the following combinations:
1) Consonants and vowels (e.g., When we pronounce "Come on," we dont
say the two words separately but we say /k@man/ as if they were one word. This is known as
"liaison.")
2) Consonants and consonants (e.g., When we pronounce "root beer," the stop
consonant at the end of the first word is usually kept unreleased, and there seems to be a
pause or a sudden stoppage of breath in between. This is known as "open
juncture.")
3) Identical consonants (e.g., When we pronounce "red deer," the two
identical and adjacent consonants are pronounced as one long consonant. This is another
case of "open juncture.")
Assimilation is a phenomenon in which two adjacent consonants are pronounced and the
articulation of the first consonant is altered under the influence of the second
consonant. (e.g., When "at this" is pronounced, the alveolar /t/ in
"at" becomes an interdental /D/ influenced by the following sound.)
The sounds in each of these three combinations of linking and assimilation are all
consonants. If those consonants are pronounced with an additional vowel, quite often with
a tense vowel, rather than a reduced schwa as shown above, there is no way the speaker can
produce fluent and comprehensible English speech.
It is quite clear what happens when a fluent and comprehensible speaker utters the
sentence: "Look at the red doll!" When pronounced individually, the words are
pronounced /lUk/, /{t/, /D@/, /rEd/, and /dal/. But when those words are pronounced in
normal and comprehensible speech, a liaison linking occurs between /lUk/ and /{t/, and an
assimilation occurs between /{t/ and /D@/ and an open-juncture linking occurs between /rEd
and /dal/. So the whole sentence sounds something like [lUk{t9D@rEd*dal#] ("9"
and "8" denote "dentalized" and "unreleased," respectively.)
The author believes that it is of vital importance for Japanese English learners (and
others whose native language tends to be an open-syllable type) to practice linking and
assimilation. There are three reasons for this. First, because these two features do not
exist in the their first language sound system, learners cannot easily produce the correct
English sounds. As a result, as Yule, et al. (1987) imply, they have a hard time
perceiving, still less comprehending the phrases or sentences they cannot produce.
Second, by practicing these two features, students will become aware of a basic
difference between English and Japanese: English is stress-timed and Japanese is
syllable-timed. Students will then be interested to know about other features of the
English sound system, like rhythm and intonation, both of which are profoundly influenced
by stress timing.
Third, mastery of liaisons and junctures serves as a useful test of whether or not
learners have learned to produce the individual English consonants correctly. This is the
first necessary step in learning English pronunciation. When some consonants are
pronounced with an additional vowel, linking and assimilation will not be possible.
English teachers should begin by having learners realize how important it is to
practice linking and assimilation in English speech. Such a realization is not only
important in helping learners upgrade their oral production of English but also in
improving their listening comprehension skill.
Implications for teacher training
In the proceeding sections it has been argued that in order to upgrade learners
pronunciation skill, it is necessary for learners and teachers to be aware that
1) communicative effectiveness can best be promoted through not a segmental but
suprasegmental approach to pronunciation learning, and that
2) it is practical to begin early on to teach linking and assimilation in English.
It is important for teachers to raise the awareness of their students. But what can
help to encourage teachers awareness in the first place? Inservice and preservice
teacher training courses can insure that teachers will be aware of the state of the art
and be confident in their teaching.
Inservice teacher training
Japan has 47 prefectures and 11 jurisdictional municipalities. A local inservice teacher
training center has been established in each of those administrative divisions and each
gives organized training courses. Much of the achievement in school education is due to
the inservice teacher training system. For example, novice English teachers are required
by the ordinance to attend a one-year-long course in their first year on the job.
The problem with inservice training is that English teachers, like other teachers, find
themselves too busy with daily school chores to attend any of the courses provided, even
though teachers are granted the right to attend courses by the ordinance. It was once said
that the English teaching curriculum was revised too drastically for trainers to catch up
and that trainers were groping in the dark. That seems not to be the case any more and
there are well-qualified teacher trainers who provide well-organized and quality courses
or programs in every training center. So the problem with inservice training is that most
teachers do not have enough time to get new insights at the center even though they want
to. The administration needs to insure that teachers have the time to attend inservice as
well as the opportunity.
Preservice teacher training
To get a high school teaching credential, students attend four-year colleges or
universities and take a required program. The core courses are "English Teaching
Methodology," which is usually a four-credit course and lasts one year, and
"Teaching Practice," which is a two-week course in which students do practical
teaching with the help of practitioners in an authentic classroom setting. "English
Teaching Methodology" consists of theoretical and practical study in which students
are engaged in micro-teaching in a laboratory setting.
The problem with the preservice training is that the instructors lecture on
"English Teaching Methodology." Because the study of TESOL is quite recent here,
there are not enough instructors trained and knowledgeable in that area. Therefore, in
some instances, non-TESOL specialists are giving the course.
Conclusion
The misconception about pronunciation instruction undoubtedly stems from the type of
pronunciation questions used in entrance examinations for universities or colleges. Many
teachers still adhere to the kind of pronunciation practice once predominant during
audio-lingualism which is not consistent with the recent trend towards CLT. Even though
the suprasegmental approach is what is seriously needed in todays language learning
setting, allowance is to be made for a transitional and more practical approach, and
instruction emphasizing phonological linking and assimilation must be initiated. This
approach is essential for those learners whose native language is an open-syllable type.
At the same time, it is important to raise the awareness of all the people involved in
English language learning on implementing this approach in their daily classroom
activities.
References
Avery, P. and S. Ehrlich. 1992. Teaching American English pronunciation. Oxford: Oxford
University Press.
Cronin, J. 1996. Pointers on teaching English stress patterns. The Language Teacher,
19, 6, pp. 1617.
Hinofotis, F. and K. Baily. 1980. American undergraduate reaction to the communication
skills of foreign teaching assistants. On TESOL 80: Building Bridges: Research and
Practice in TESL. Alexandria, VA.
Bronstein, A. 1960. The Pronunciation of American English. Englewood Cliffs:
Prentice-Hall.
Obunsha (Ed). 1996. Zenkoku Koukou Nyushi Mondai Seikai. Tokyo: Obunsha.
Wong, R. 1993. Pronunciation myths and facts. English Teaching Forum, 31, 4, pp.
4546.
Yule, G. and P. Hoffman and J. Damico. 1987. Paying attention to pronunciation: The
role of self-monitoring in perception. TESOL Quarterly, 21, 4, pp. 765768.
(Special thanks go to Dr. Schaefer at Temple University Japan for valuable advice in
completing this article.)