Difficult Japanese Sounds by Native Language: An L1-by-L1 Pronunciation Guide

Difficult Japanese sounds by native language (L1) are not a single ranked list. They vary by which contrasts your L1 already trains and which it ignores.¹ This guide sorts the pitfalls L1 by L1, so you can spend pronunciation effort on the real gap, not on items your L1 already handles.²

Overview

Japanese has a relatively small phoneme inventory by global standards: about fifteen core consonants plus the special morae /N/ and /Q/, and five vowels with length contrasts. As a result, the number of new segments any learner has to acquire is small.³⁴⁵ The persistent difficulties tend to live in the prosodic layer (mora-timing, vowel length, pitch accent) rather than in any single segment.⁶⁷

The problem looks different for each L1. An English speaker stumbles on the ら-row tap and on phonemic vowel length; a Mandarin speaker handles the affricate つ cleanly but over-tones the pitch accent; a Spanish speaker has a near-perfect vowel match and the right tap but no length contrast; a Korean speaker has the prosody but the wrong stop-voicing template.¹²³⁸

What L1 transfer is, and what it is not

In pronunciation, L1 transfer has two well-known causes in second-language speech-acquisition research: perceptual assimilation (the L1 phoneme system filters incoming L2 sounds into the closest native category) and articulatory inertia (the motor routines for the L1 are reused to produce L2 forms).¹² The Perceptual Assimilation Model (PAM) describes the perception side; the Speech Learning Model (SLM) describes how a new L2 category may or may not form.¹²

According to the SLM, "the greater the perceived phonetic dissimilarity between an L2 speech sound and the closest L1 sound is, the more likely learners will be to discern the difference between the L1 and L2 sounds and show measurable progress in production and/or perception."² The corollary explains why close-but-not-identical contrasts (English /l/ vs Japanese /ɾ/, Spanish /i/ vs Japanese /i/) often resist learning longer than wildly different ones: similarity triggers category assimilation rather than category formation.²⁹

According to the PAM, listeners assimilate a non-native sound to a native category in one of several patterns. Discrimination difficulty depends on whether two L2 sounds map to the same L1 category (Single Category, hardest), to one L1 category at different goodness levels (Category Goodness), or to two different L1 categories (Two Category, easiest).¹ The English /r/–/l/ contrast for Japanese listeners is the canonical Single Category case: both English liquids assimilate to the Japanese /ɾ/ category.¹⁰⁹

Here is a useful picture of how PAM sorts the assimilation outcomes:

Iverson, Kuhl, and colleagues describe the underlying mechanism as a perceptual warping of the acoustic space. Experience with the L1 "is described as 'warping' perception, producing a distortion that decreases perceptual sensitivity near category modes and increases perceptual sensitivity near the boundaries between categories."¹⁰¹¹ The L1's prototypes act as magnets; sounds near them are pulled toward the prototype and lose their perceptual identity.¹¹

Two consequences follow, and the rest of this page relies on them. First, the same Japanese sound can be easy for one L1 and hard for another, with no overall "hardness" ranking. Second, positive transfer (an L1 feature that helps) is just as real as negative transfer (an L1 feature that hurts).¹²

Segmental and prosodic problems behave differently

Segmental difficulties (individual consonants and vowels) and prosodic difficulties (mora-timing, vowel length, pitch accent, geminates) have different L1-to-L1 distributions and different remediation paths. Mora-timing in particular is a prosodic property that any non-mora-counting L1 speaker has to learn explicitly, even if the consonants are easy.⁶⁷

The Japanese term for native-language interference is 母語干渉 (bogo kanshō). The term for second-language phonology is 第二言語音声 (dai-ni gengo onsei).⁴ The terms L1 transfer, negative transfer, positive transfer, interference, and cross-linguistic influence are used interchangeably in the literature; this page uses "L1 transfer" by default.¹²

How to use this page

The page is organized L1 by L1. Each section names the L1, lists what transfers cleanly (positive transfer), and lists what bites (negative transfer). End-to-end reading is mainly for readers who teach Japanese to mixed-L1 classrooms. Most learners can read just their L1's section plus the cross-L1 prosody discussions.

A short triage flow for finding the right entry point:

When a problem cuts across L1s (vowel length, mora-timing, pitch accent), the same concept may appear in several sections with L1-specific framing.¹²

The L1-difficulty matrix at a glance

The cells below summarize the source-attributed positions established in each L1 section. They condense the literature for teaching purposes; they are not measured scores.¹²³⁴

For readability on narrow screens, the matrix is split into two tables: one for segmental contrasts (single consonants and vowels) and one for prosodic contrasts (timing, length, pitch). The row order is the same in both.

Segmental contrasts

L1	ら-row /ɾ/	つ [tsɯ]	ふ [ɸɯ]	Vowel inventory match
English	hard (no tap; /r/–/l/ collapse)¹⁰⁹	hard (no native [ts] onset)³	hard (defaults to labiodental /f/)³	medium (length not phonemic)¹²
Mandarin	medium (no tap; no /r/–/l/ collapse)⁸	easy (pinyin "z" is [ts])⁸	medium (closer than English /f/)⁸	medium (/ɯ/ vs /u/ mismatch)⁸
Cantonese	medium (no tap)¹³	medium	medium	medium
Korean	medium (/l/–[ɾ] allophone helps)¹⁴	medium	medium	easy (close five-to-seven match)¹⁴
Spanish	easy (alveolar tap [ɾ] exists)¹⁵	medium (no native [ts])¹⁵	hard (labiodental /f/)¹⁵	easy (five-vowel match)¹⁵
Italian	easy ([ɾ] exists in some contexts)¹⁶	medium	hard (labiodental /f/)¹⁶	easy (close)¹⁶
Vietnamese	medium	medium	medium	medium (duration cues bound to tense-lax)¹⁷¹⁸
French	medium (uvular /ʁ/ in standard)¹⁹	hard (no native [ts])¹⁹	hard (labiodental /f/)¹⁹	medium
German	medium (uvular /ʁ/ standard)⁵	easy (native [ts] "z")⁵	hard (labiodental /f/)⁵	medium (length contrast exists)⁵
Thai	medium (alveolar tap exists)²⁰	medium (native [tɕ])²⁰	hard	easy (phonemic length exists)²⁰
Hindi/Urdu	hard (retroflex /ɽ/ habit leaks)²¹	medium	hard	medium (length partly contrastive)²¹

Prosodic contrasts

L1	Long vs short vowels	Mora-timing	Geminates /Q/	Mora-N /N/	Pitch accent
English	hard (no phonemic length)²²¹²	hard (stress-timed)⁷	hard (no native gemination)²³	medium (no native moraic nasal)⁵	hard (English stress model misfires)²⁴²⁵
Mandarin	medium (no phonemic length, no reduction)¹⁷	medium (syllable-timed)¹³⁷	hard (no native gemination)¹³	medium (final /n ŋ/ only)⁸	hard (tone-language model misfires)²⁴
Cantonese	medium	medium (syllable-timed)¹³	easy-medium (checked-syllable length cue)¹³	medium (final /m n ŋ p t k/)¹³	hard (tone-language model misfires)
Korean	medium (length largely neutralized in modern Seoul)¹⁴	easy (mora-equivalent timing)¹⁴	medium-easy (tense-consonant analogue)¹⁴	easy (final /n ŋ m/, glottal release)¹⁴	medium (Seoul Korean prosody no longer lexical)¹⁴
Spanish	hard (no phonemic length)¹⁵	medium (syllable-timed; closer than English)¹⁵	hard (no native gemination)¹⁵	medium (only /n/ word-finally)¹⁵	hard (no lexical accent of Tokyo type)¹⁵
Italian	hard (no phonemic length on vowels)¹⁶	medium (syllable-timed)¹⁶	easy (native geminate inventory)¹⁶	medium	hard
Vietnamese	medium (duration cues exist for tense-lax)¹⁷	hard (sesquisyllabic, not mora-counted)¹⁸	hard (final /p t k/ unreleased)¹⁸²⁶	hard (limited final-nasal inventory)¹⁸	hard (six-tone system misfires)¹⁷
French	hard (no phonemic length)¹⁹	hard (phrase-final stress)¹⁹	hard	medium (nasal vowels, no moraic nasal)¹⁹	hard
German	medium (length contrast exists)⁵	hard (stress-timed)⁷	medium	medium	hard
Thai	easy (phonemic length exists)²⁰	medium-hard (syllable-timed, tonal)²⁰	medium-hard	medium	hard (five tones misfire)²⁰
Hindi/Urdu	medium (length partly contrastive)²¹	hard (stress-timed)²¹	hard	medium	hard

Reading the matrix

Cells label articulatory and perceptual difficulty for an L1 speaker who is producing or perceiving Japanese. They do not label the absolute properties of the Japanese sound. A "hard" cell means the L1 lacks a category that maps cleanly; an "easy" cell means the L1 already supplies the category.

English-speaker pitfalls

Segmental problems: the ら-row, つ, ふ

English speakers face three single-segment hurdles that recur in the literature: the ら-row tap [ɾ], the つ-cell affricate [ts] at the start of a word, and the ふ-cell bilabial fricative [ɸ].³⁵

In PAM terms, the English /r/–/l/ contrast and the Japanese /ɾ/ form a canonical Single Category assimilation. Japanese speakers assimilate English /l/ into their flap category more strongly than they assimilate /r/, and reciprocally the Japanese /ɾ/ "is difficult for Japanese speakers to perceive and produce; it is not used in the Japanese language" from the English-listener side because the English /r/ and /l/ categories cover the perceptual space where /ɾ/ would sit.¹⁰¹¹⁹ For an English speaker producing Japanese, the matching error is to insert an English /r/-like approximant or an English /l/-like lateral. Neither is the apical tap.³⁹

来週らいしゅう日本にほんに行いきます。³
"I'm going to Japan next week."

In the example above, ら carries [ɾa]. An English speaker's default substitution often turns it into the approximant [ɹa]. The atom article on the ら-row owns the drills.⁹

English does not have /ts/ at the start of a word. The Japanese つ requires an alveolar affricate [ts] in onset position, at the start of the syllable. English uses this articulation only word-finally, as in cats. The default English transfer is to drop the [t] and produce a sibilant fricative [s], yielding "soonami" for 津波.³

津波つなみに注意ちゅういしてください。³
"Please be careful of tsunami."

English /f/ is a labiodental fricative (upper teeth on the lower lip); the Japanese ふ is a voiceless bilabial fricative [ɸ] (lips brought close together, no teeth contact).³⁵ The Hepburn romanization "fu" steers English readers toward the labiodental. That is the closest English-spelling approximation an English reader will recognize, but it is not the actual Japanese articulation.³⁵

The blow-out-a-candle gesture for ふ

The articulation target for ふ is "lips together, no teeth contact, like blowing out a candle." If the lower lip touches the upper teeth, the result is English /f/, not Japanese [ɸ].³⁵

富士山ふじさんに登のぼります。⁵
"I climb Mt. Fuji."

Vowel-length blindness

English does not use vowel length phonemically, meaning length alone does not distinguish words. English "tense vs lax" pairs (e.g., /iː/ in beat vs /ɪ/ in bit) differ primarily in vowel quality, with duration as a secondary cue.¹² Japanese, in contrast, contrasts vowels by duration alone while keeping vowel quality fixed.³¹²

American English listeners prior to perceptual training identify Japanese vowel length poorly (39% accurate).²²¹² Hirata (2004) showed that targeted training raised this accuracy substantially when learners "were instructed to count the number of morae in each training" item. This anchored the duration cue to a counted-beat framework.²²

A cross-dialect finding sharpens the picture. Australian English listeners performed better than American English listeners because "duration cues play a prominent role across all vowel categories, even nonnative, for Australian English listeners," whereas American English listeners "categorized Japanese long and short vowels (e.g., /ii, i/) as most similar to AmE tense vowels regardless of length."¹²

The English problem is not articulatory. English speakers can hold a vowel for any duration. The problem is perceptual: English listeners hear quality as the diagnostic and miss length.²²¹² The consequence is mishearing minimal pairs like おばあさん obāsan "grandmother" vs おばさん obasan "aunt."³

おばあさんは元気げんきです。¹²
"Grandma is well."

The second mora here is a held /a/ (long vowel あ + あ, for three morae total before -san). Beginners often hear and produce obasan (aunt) instead.²²¹²

病院びょういんに行いきます。⁵
"I am going to the hospital."

びょういん is four morae (びょ / う / い / ん) with a long [oː]. An English speaker who drops the length hears びよいん or ぴょいん, neither of which is a word.³⁵

Mora-timing vs. stress-timing

English is stress-timed: stressed syllables recur at roughly equal intervals, and the unstressed syllables between them compress with vowel reduction (the schwa).⁷ Japanese is mora-timed: every mora occupies roughly the same duration, with no reduction of unstressed morae.⁶⁷

A growing body of research classifies the world's languages into three rhythm classes: mora-timed, stress-timed, and syllable-timed. English exemplifies stress-timing, Japanese mora-timing, and Mandarin syllable-timing.⁷

For the English-L1 learner, the consequence is foot-driven vowel reduction leaking into Japanese. Morae the learner perceives as "unstressed" get compressed, flattening the beat count that distinguishes minimal pairs.⁷ The English speaker's two prosodic problems (vowel length and mora-timing) are one problem at the root: English does not use duration phonemically, so the perception system is not tuned to time-counted beats.²²¹²⁷

学校がっこうに行いきます。⁵
"I'm going to school."

がっこう is four morae (が / っ / こ / う): the geminate /Q/ and the long /oː/ each fill a mora. An English speaker often produces a two-syllable [ˈɡakoʊ] with stress on the first syllable; the four-beat count is lost.⁶⁷

Pitch accent and the stress habit

English bundles four cues into "stress": pitch, length, loudness, and vowel quality.²⁴⁷ Tokyo Japanese pitch accent uses one of these cues, pitch, with at most one falling-pitch transition per accentual unit.²⁴ When English stress transfers to Japanese, it produces a four-cue exaggeration where one cue is needed.

Production data on English-speaker pitch accent is sobering. English speakers "produced only 43% of words with accent types matching Standard Japanese norms, regardless of experience level. The highest-performing individual reached just 52% accuracy," and "approximately 60% of words showed inconsistent accent types across different contexts."²⁴ More-experienced learners "showed identical accuracy and stability to less-experienced learners," suggesting that "additional experience does not contribute to increased accuracy."²⁴

The dominant error patterns are not random. "Many learners defaulted to penultimate accent placement, while others favored unaccented production, neither reflecting Standard Japanese distribution. Learners with high stability had a dominant accent type that they used irrespective of a word's accent type."²⁴

Experience alone does not move pitch-accent accuracy

The Muradás-Taylor (2022) data shows that English speakers' accent accuracy plateaus near 43% and does not climb with general immersion. Targeted training, not exposure, is what shifts the curve. Treat pitch accent as an explicit study item.²⁴

English speakers do not lack the ear for pitch. They lack the habit of encoding pitch as part of the word. The four-cue stress model substitutes for the one-cue pitch model unless drilled explicitly.²⁴²⁵

雨あめが降ふっています。²⁴
"It's raining."

あめ "rain" is atamadaka (HL); ame "candy" is unaccented (LH-flat in citation). English speakers default to penultimate stress and produce both as LH or HL inconsistently; minimal-pair accuracy stays at chance.²⁴

Mandarin and Cantonese-speaker pitfalls

What transfers cleanly

Three Japanese features have closer Mandarin analogues than English analogues. Mandarin has the affricate /ts/ as the initial pinyin z, so the つ-cell is mostly a non-issue at the onset.⁸ Mandarin /f/ is labiodental, but Mandarin speakers tend to transfer it less aggressively to the ふ-cell than English speakers do. The substitution sounds less foreign because the surrounding vowel /u/ matches well.⁸ Mandarin lacks the English /r/–/l/ contrast that "kidnaps" the ら-row. The tap is still new, but the bidirectional confusion is not present.¹⁰⁸

Cantonese has a separate positive transfer that Mandarin does not: long-consonant sequences in checked syllables (codas /p t k/) that act as a hook for Japanese geminates. Ren (2022) shows the result quantitatively: at the beginner level, Cantonese learners' geminate identification accuracy is 89.23% versus Mandarin learners' 63.89%.¹³

The dialect background matters: "The short-long consonant contrast only appears in some Chinese southern dialects, such as Cantonese, Fukienese, and Hakka." And: "Mandarin-speaking learners who have neither checked tones nor long consonant sequences in their dialect were shown to be less accurate than Cantonese-speaking learners at the beginner stage."¹³

Tone does not transfer to pitch accent

Mandarin assigns lexical tone to each syllable (high-level, rising, low-dipping, falling, plus neutral), so pitch contour helps distinguish words.⁸ Japanese pitch accent assigns at most one falling-pitch transition per accentual unit, with the rest of the word filling in unaccented L or H morae.²⁴ The two systems use F0, the acoustic measure of pitch, differently: per-syllable contour in Mandarin, one drop per unit in Japanese.²⁴⁸

The common Mandarin-L1 error is over-toning: assigning a contour to each syllable as if it were a tone-bearing unit. This makes Japanese sound "sing-song" to a native listener. The atom article on pitch accent owns the corrective frame.²⁴

Mandarin speakers have the prosodic ear, since pitch is a lexically relevant cue in their L1, but they have the wrong template. In SLM terms this is a category-formation problem: the Japanese accent unit is a new category that does not align with any Mandarin tone.²²⁴

Mora-timing for syllable-timed speakers

Mandarin is syllable-timed; Japanese is mora-timed.⁷ The consequence is that special morae (long vowels, geminates, and the moraic /N/) get compressed because, in Mandarin terms, they are smaller than a syllable.¹³⁶

Beyond the Cantonese advantage above, Mandarin beginners' Japanese geminate identification sits at 63.89% versus native speakers' 95.74%.¹³ Production is similarly affected: "when mispronouncing a singleton as a geminate, the consonant closure is longer and the preceding vowel segment is shorter than in correctly pronounced stops. Conversely, mispronouncing a geminate as a singleton results in shorter closure and a longer following vowel segment."¹³

切手きってをください。¹³
"A stamp, please."

きって contains the moraic /Q/ (the small っ). Mandarin learners often produce きて (three morae, no geminate), changing the word.¹³⁶

今日きょうはいい天気てんきです。⁵
"It's good weather today."

きょう is two morae (きょ / う); いい is two morae (い / い). Mandarin speakers tend to produce single-mora versions, halving the beat count.¹³⁷

Vowel length and the high-front vowel

Mandarin has no phonemic vowel length.⁸ Japanese has phonemic length on every vowel.³¹² The Mandarin /i/ is close to Japanese /i/, but the Japanese /ɯ/ (compressed /u/) is distinct from Mandarin /u/ (rounded).⁸³

Mandarin vowel-length errors are slightly less severe than English-L1 errors because Mandarin keeps vowels close to their target quality (no reduction). This gives the listener a cleaner sequence to count. The duration problem remains the same; the masking is lighter.¹²⁸

いいえ、違ちがいます。⁵
"No, that's different."

いいえ is three morae (い / い / え); ち is [tɕi]. A Mandarin speaker may produce いえ (two morae), changing the word.⁵¹²

Korean-speaker pitfalls

What transfers cleanly

Among major learner L1s, Korean is the closest typological neighbor to Japanese. The vowel inventory overlaps substantially: Korean has /i e ɛ a o u ɯ/, Japanese has /i e a o ɯ/, with overlapping articulations.³¹⁴

Korean is mora-equivalent in timing: it "exhibits mora-timing rather than strict syllable-timing."¹⁴ The Korean /l/–[ɾ] allophonic pattern means the alveolar tap is already in the L1's surface inventory. In Korean, "the liquid /l/ shows substantial allophonic variation, it's a flap [ɾ] between vowels but a lateral [l] word-finally."¹⁴

Korean has tense consonants /p͈ t͈ k͈ s͈/ that act as a partial analogue for Japanese geminates: both involve a longer closure than the plain counterpart. The mapping is imperfect: Korean tense consonants involve glottal tension, while Japanese geminates involve pure length. But the perceptual cue, a longer hold, overlaps.¹⁴

A Korean learner who has been told that "Japanese is hard" is often warned about the wrong things. The articulation problems are mostly already solved by the L1; the remaining problems are in voicing and lexical accent, not in segments.

Where it bites: voicing and devoicing

Korean obstruents have a three-way contrast (plain/lenis, aspirated, tense). This does not map to the Japanese two-way voiced/voiceless contrast. "The language features a three-way contrast between unvoiced segments, which are distinguished as plain, tense, and aspirated."¹⁴ "Plain stops (lenis) like /p, t, k/ become voiced intervocalically."¹⁴

The systematic Korean error follows from that pattern: word-initial Japanese voiced obstruents (が, ば, だ) come out as voiceless lenis. Korean lenis stops are voiceless in initial position and voice only between vowels. Holliday (2019) shows the mirror direction quantitatively: "naïve Japanese listeners consistently perceive Korean fortis stops as voiced, and Korean lenis and aspirated stops as voiceless, novice second language learners do not produce any significant difference among the three stop categories."²⁷

The voicing error is predicted, not measured

No direct experimental study of L1 Korean speakers producing Japanese voiced obstruents was located. The claim above is supported by Korean phonology¹⁴ and by the mirror-direction Holliday data²⁷; treat it as predicted by the Korean stop system rather than as a measured percentage.

One useful interaction is that Japanese vowel devoicing ("/i/ and /u/ devoice between voiceless consonants or before a pause," producing です as [des])⁵ aligns with the Korean lenis pattern. Korean speakers often produce devoiced です correctly without instruction.⁵

Korean speakers do internalize Japanese rendaku with the right constraint. "Native Chinese (N=32) and Korean (N=32) speakers learning Japanese, matched for their lexical and grammatical knowledge, avoided applying rendaku in compounds with a medial voiced obstruent in the second element, indicating that Lyman's Law is an active principle even in L2 acquisition."²⁸

学校がっこうに行いきます。⁵
"I'm going to school."

が at the start of the word is voiced [ɡ]. Korean speakers may default to voiceless lenis [k], producing what sounds like カッコウ. The voicing contrast must be tagged consciously.²⁷¹⁴

友達ともだちと勉強べんきょうします。⁵
"I study with a friend."

ど in ともだち and べ in べんきょう are voiced obstruents in the middle of a word. Korean intervocalic voicing applies cleanly here, so this is not a problem; the problem is word-initial.²⁷¹⁴

Pitch accent vs. Seoul intonation

Modern Seoul Korean does not have lexical pitch accent. It "shows high pitch that gradually comes down in subsequent syllables, with ongoing tonogenesis where consonant distinctions increasingly rely on pitch rather than voice-onset time."¹⁴ The prosodic phrase has a characteristic LH initial contour, but that contour does not distinguish words.¹⁴

Korean learners have a sharp prosodic ear but no template for storing accent as part of each word. This is closer to the English-L1 situation than to the Mandarin-L1 situation: the cue (pitch) is present in the L1, but encoding-on-lexical-items is not.²⁴¹⁴

Spanish and Italian-speaker pitfalls

What transfers cleanly

Spanish has a five-vowel system /i e a o u/ "that appears in both stressed and unstressed positions."¹⁵ Japanese has /i e a o ɯ/. The match is one-to-one in number and almost one-to-one in quality, with /ɯ/ as the only non-trivial difference.³¹⁵

Spanish has the alveolar tap [ɾ] as one of its two rhotics. "The [alveolar trill] and the [alveolar tap] are in phonemic contrast word-internally between vowels."¹⁵ The Japanese /ɾ/ is the same articulation.¹⁵ Italian has phonemic geminate consonants throughout its native vocabulary, making Japanese geminates a one-to-one transfer at the articulation level.¹⁶

Romance learners have the strongest segmental positive transfer of any major L1 group. Vowels match, the ら-row matches (Spanish, Italian to a lesser extent), and Italian has gemination. The remaining problems are prosodic and orthographic.

Where ら-row and ふ still bite

Spanish [ɾ] is articulated at the alveolar ridge, the right place for Japanese. But Spanish speakers may carry a trill habit ([r] for the double "rr" grapheme) into Japanese when reading romaji that uses double letters in context.¹⁵ The trill is not a Japanese sound; in PAM terms, it is an L1 articulatory variant that is structurally adjacent but unavailable as a substitution.¹¹⁵

Italian /f/ is labiodental, like English /f/.¹⁶ The ふ-bilabial gap is the same as the English-L1 gap.³¹⁶

The Romance-L1 segmental work list is short: drop the trill for Japanese /ɾ/, and switch /f/ to bilabial for ふ.³¹⁵¹⁶

Vowel length is new

Spanish and Italian do not use vowel length phonemically.¹⁵¹⁶ The Japanese long-short contrast is new for both. The framing differs from the English-L1 case. Spanish and Italian speakers keep their vowels at full quality, which makes the perceptual count easier than for English speakers. The production problem remains.¹⁵¹⁶

Italian has phonemic consonant length (geminates), which gives Italian learners a length-cue prior that does not exist in Spanish.¹⁶ The transfer is uneven: consonant length transfers, but vowel length does not.¹⁶

Romance-L1 vowel-length errors tend to be of the "I forgot" variety rather than the "I can't hear it" variety. Remediation is closer to habit tagging than to perceptual retraining.¹²¹⁵¹⁶

病院びょういんは遠とおいです。¹²
"The hospital is far."

びょういん is four morae with [oː]. とおい is three morae with [oː]. Spanish speakers produce [bjoin] and [toi] (dropping length), recognizable but shorter than native.¹²¹⁵

Pitch accent: the closest analogue is intonation, not lexical accent

Spanish has stress, marked orthographically and signaled by pitch and loudness. But assignment follows predictable rules of penultimate or marked stress; it is not a lexical-accent system.¹⁵ Italian is similar.¹⁶ Romance learners have the cue (pitch involvement in stress) but not the lexical-accent template.²⁴¹⁵

For Romance L1s, the pitch-accent problem patterns more like the English-L1 problem than the Mandarin-L1 problem. The prosodic system is there, but pitch accent on lexical items is new.²⁴

Vietnamese-speaker pitfalls

Tone does not transfer to pitch accent

Vietnamese has six tones differentiated by "pitch contour, duration, intensity, and phonation type rather than pitch alone." Phonation type means voice quality, such as creaky or breathy voice.¹⁸ The system assigns tone syllable by syllable, like Mandarin.¹⁸ The Japanese pitch-accent system is per-accentual-unit, with at most one fall.²⁴

The mismatch is sharp. "Despite L1 experience with lexical tones, Vietnamese learners struggled to use pitch as a secondary cue. Tone languages like Vietnamese are canonical tone languages where pitch variations occur on individual syllables, while pitch-accent languages like Japanese have pitch that varies across consecutive syllables rather than individual ones."¹⁷

Vietnamese speakers face the same over-toning problem as Mandarin L1 speakers. It is compounded by phonation-type cues (creaky and breathy voice) that are not part of Japanese accent and may leak as unwanted voice-quality contrasts.¹⁷¹⁸

Mora-timing for a sesquisyllabic L1

Vietnamese is morphosyllabic and based on tone-bearing units, not mora-counted. The relevant unit for L1 Vietnamese is the tone-syllable.¹⁸ Japanese morae include sub-syllabic units (long-vowel halves, geminate halves, the moraic /N/) that have no Vietnamese counterpart.¹⁷¹⁸⁶

Le, Kondo, and Tsukada (2025) tested 75 Vietnamese learners across N1–N3 levels. They found that "Vietnamese learners primarily relied on duration cues similar to native speakers, but demonstrated delayed categorical boundaries, suggesting incomplete acquisition of the distinction. L1 experience with contrastive duration in Vietnamese tense-lax vowels may enhance durational sensitivity, but it does not fully resolve challenges in distinguishing Japanese contrasts." Only N1 learners showed "modest adaptation in integrating pitch with duration."¹⁷

Vietnamese duration sensitivity is a partial positive transfer: the cue is available, but it is bound to L1 tense-lax contrasts, not to Japanese mora-counted lengthening.¹⁷

Voicing, final consonants, and the mora-N

Vietnamese final stops are "restricted to labial, coronal, and velar stops and nasals /p, t, k, m, n, ŋ/" and are "pronounced with no audible release," meaning there is no clear burst at the end.¹⁸ Japanese has no final stops in native vocabulary except the moraic /Q/ (geminate) and /N/ (moraic nasal), which behave as morae, not codas.³⁵

In an L1 Vietnamese mispronunciation study, Tsushima et al. (2022) document that Vietnamese learners produce Japanese singleton/geminate distinctions with systematic durational compression compared to native speakers. The geminate side undershoots more than the singleton side.²⁶

The moraic /N/ assimilates to the place of the following segment. For example, it becomes [m] before /p b m/, [ŋ] before /k g/, and [ɴ] at the end of an utterance.⁵ Vietnamese's nasal-coda inventory /m n ŋ/ does not include the place-assimilated variant or the moraic count. As a result, Vietnamese production of ん tends to drop the mora count and pick one nasal place arbitrarily.⁵¹⁸

Vietnamese has implosive stops /ɓ ɗ/ at the bilabial and alveolar places. Implosive stops are made with inward airflow rather than the outward airflow of ordinary stops.¹⁸ These are not in Japanese. Vietnamese production of ば and だ may show implosion as a residual L1 articulation cue, recognizable as accented but not as a misidentification of the word.¹⁸

三人さんにんで本ほんを読よみます。²⁶
"Three of us read the book."

さんにん is four morae (さ / ん / に / ん), with two /N/ slots plus a long /n/ at the boundary. Vietnamese speakers compress the mora count, producing sannin as two syllables.⁵²⁶

切符きっぷを買かいました。²⁶
"I bought a ticket."

きっぷ has the /Q/ slot. Vietnamese speakers may compress it to きぷ (no gemination) because Vietnamese final /p/ is unreleased and does not lengthen the closure.¹⁸²⁶

Other L1 contexts (brief)

French, German, and other European L1s

Standard French has uvular /ʁ/, produced near the back of the mouth, which is not the Japanese /ɾ/.¹⁹ French has three nasal vowels (/ɑ̃ ɛ̃ ɔ̃/) but no moraic nasal. French nasal vowels are vowels with a [+nasal] feature, not a separate coda mora.¹⁹ French speakers may map the Japanese moraic /N/ to a nasal vowel, dropping the mora count.⁵¹⁹

French places phrase-final stress at the end of the prosodic group, not lexically.¹⁹ The Japanese pitch-accent template is foreign in the same way as for English L1.²⁴¹⁹

German has the affricate /ts/ (the letter "z"), which gives a clean positive transfer to the つ-cell.⁵⁸ German /r/ is uvular in the standard variety. The ら-row is foreign in the same way as for French speakers.⁵¹⁹ German has phonemic vowel length (with a quality difference), which gives a partial transfer for the long-short contrast.⁵ German is stress-timed, so mora-timing remains foreign.⁷

European L1s broadly share the vowel-length and mora-timing problems of English L1, with isolated wins (German /ts/, French nasal vowels weakly helpful for ん). The atom articles cover the segments; this article does not duplicate them.

Tonal South-East Asian L1s (Thai, Lao, Burmese)

Thai has five tones and phonemic vowel length. This gives Thai the strongest positive transfer for Japanese vowel-length contrasts among the tonal L1s.²⁰ The pitch-accent problem is the same as for Mandarin and Vietnamese: per-syllable tone is the wrong template for one-drop accent.²⁴²⁰

Lao and Burmese fit a similar tone-vs-pitch-accent profile based on the L1 inventory. Direct L2 studies on Japanese were not located in the research pass. The framing here is by typological inference from the general tone-language transfer logic.¹²²⁰

Indo-Aryan and Dravidian L1s (Hindi, Urdu, Tamil)

Hindi/Urdu has a four-way stop contrast (voiceless unaspirated, voiceless aspirated, voiced unaspirated, voiced aspirated) plus a retroflex series /ʈ ɖ ɳ ɽ/.²¹ The retroflex /ɽ/ habit can leak into the Japanese ら-row. The place is wrong (postalveolar to retroflex versus Japanese apical alveolar), and the contact pattern is different.⁵²¹

Hindi has phonemic vowel length on some pairs (/ɪ/ vs /iː/), but with a quality difference, similar to English tense-lax.²¹ The Japanese vowel-length problem is closer to the English-L1 problem than to the Thai problem.²¹

Tamil and other Dravidian L1s carry retroflex /ɭ ɻ/ and, in some varieties, a strong tap-trill distinction. The L1 articulation habits for the rhotic-lateral space differ from Japanese. The framing for Tamil is based on the L1 inventory; direct L2 studies on Japanese were not located. Phonemic vowel length in Tamil should give a positive transfer for the long-short contrast comparable to Thai.

Good to know

Positive transfer is real and underused

The L2 phonology literature focuses on negative transfer (L1 features that cause errors); pedagogical sources rarely flag positive transfer (L1 features that help).¹² As a result, learners spend study time on items their L1 already handles.

The easy lanes by L1 are sourced inside their sections: Spanish ら-row,¹⁵ Italian geminates,¹⁶ Mandarin つ,⁸ Cantonese geminates,¹³ Korean mora rhythm and ら-row,¹⁴ Thai vowel length.²⁰ A learner who knows which cells are green for their L1 can route study time to the red cells.

The "no L1 matters" claim is wrong, but the "Japanese is easy" claim is not

Japanese has a relatively small phoneme inventory by global standards: roughly fifteen core consonant phonemes plus /N Q/, five vowels with length contrasts.³⁴⁵ This is fewer than English (twenty-four or more consonants, eleven or more vowels), Mandarin (with retroflex and palatalized series), and Hindi/Urdu (with retroflex and aspirated series).⁵⁸²¹

For most L1s, the number of new segments to acquire is small. The persistent difficulties (vowel length, mora-timing, pitch accent) are not about new segments. They are about new prosodic categories. "Japanese pronunciation is easy" is true at the segmental level for most L1s and false at the prosodic level for every L1.¹²⁶⁷

Romaji hides L1 problems

Every L1 reads romaji through its own letter-to-sound map. An English reader who sees fu in Fuji pronounces it with labiodental /f/. The actual Japanese articulation is bilabial [ɸ].³⁵

Hepburn romanization was designed from an English-speaker baseline so that "speakers unfamiliar with Japanese will generally be more accurate when pronouncing unfamiliar words romanized in the Hepburn style"⁵. Readers of other L1s map the same letters through their own letter-to-sound rules. The same romanization that helps English speakers misleads Spanish speakers (who read ji as [xi]) and German speakers (who read ji as [ji]).³⁵

The corrective habit is to anchor pronunciation on the kana, not on the romaji.

富士山ふじさんに登のぼります。⁵
"I climb Mt. Fuji."

ふ here is [ɸɯ̟] regardless of how the romaji renders it.

Heritage speakers and partial L1

Heritage Japanese speakers (raised in a Japanese-speaking household abroad, often with a non-Japanese dominant language) have inconsistent transfer profiles. Some features are L1-Japanese-quality (vowel length, mora-timing), while others are dominant-language-quality (consonant inventory, pitch accent).²

The SLM predicts this asymmetric outcome. Features acquired before the perceptual narrowing window (around twelve months for vowels)¹¹ resemble L1 categories; features acquired later resemble L2 categories. Treat heritage speakers as a separate case rather than slotting them into the "L1 = dominant language" column of the matrix.²¹¹

Acoustic-cue weighting per L1

Mandarin and Cantonese listeners "paid more attention to the pitch contour, while Japanese listeners attended more to the pitch height" in pitch-perception tasks.¹⁷ Vietnamese listeners use duration primarily for vowel-quality contrasts.¹⁷ English listeners use duration as a secondary cue to spectral quality.¹² These are not preferences. They are cue-weighting profiles built by years of L1 input.¹¹

A teacher who knows a learner's L1 cue-weighting profile can target the missing cue directly. For English L1, the target is duration as a primary cue. For Mandarin L1, it is pitch height as a primary cue (not contour). For Vietnamese L1, it is duration on vowels independent of vowel quality.¹¹¹⁷¹²

Training studies actually work

Targeted perceptual training can substantially improve L2 perception, even in adults. Hirata (2004): English-L1 listeners' Japanese vowel-length identification rose from 39% with counted-mora training.²² Tajima et al. (2008): English-L1 listeners' length identification improved with multimodal audio-visual training.²³ Iverson, Kuhl et al. (2003): Japanese-L1 listeners' /r/–/l/ identification improved with prototype-stretched training, though the effect is partial.¹⁰

The pessimistic framing, "you can't learn it after the critical period," is overstated. The realistic framing is that adult perceptual reorganization is slower and requires explicit, cue-targeted training. It is not impossible.¹⁰¹¹²²²³

References

Best, Catherine T. "A direct realist view of cross-language speech perception." In Speech Perception and Linguistic Experience: Issues in Cross-Language Research, edited by Winifred Strange. York Press, 1995, pp. 171–204. (PAM, the Perceptual Assimilation Model.) Authoritative summary in Mizera, "Second language phonology and perceptual assimilation of foreign sounds." https://www.keiwa-c.ac.jp/wp-content/uploads/2012/12/kiyo21-2.pdf ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³
Flege, James E. "Second language speech learning: Theory, findings, and problems." In Speech Perception and Linguistic Experience, edited by Winifred Strange. York Press, 1995. (SLM, the Speech Learning Model.) Revised version: Flege, J. E., and Bohn, O.-S. "The Revised Speech Learning Model (SLM-r)." In Second Language Speech Learning: Theoretical and Empirical Progress, edited by Wayland. Cambridge University Press, 2021. https://www.cambridge.org/core/books/abs/second-language-speech-learning/revised-speech-learning-model-slmr/7A720FCB65B653B00C766A436908B1A7 ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶
Vance, Timothy J. The Sounds of Japanese. Cambridge University Press, 2008. https://www.cambridge.org/core/books/sounds-of-japanese/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸ ↩¹⁹ ↩²⁰ ↩²¹ ↩²² ↩²³ ↩²⁴ ↩²⁵ ↩²⁶
Kubozono, Haruo (ed.). Handbook of Japanese Phonetics and Phonology. De Gruyter Mouton, 2015 (Handbooks of Japanese Language and Linguistics 2). ↩ ↩² ↩³ ↩⁴
"Japanese phonology." Wikipedia. https://en.wikipedia.org/wiki/Japanese_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸ ↩¹⁹ ↩²⁰ ↩²¹ ↩²² ↩²³ ↩²⁴ ↩²⁵ ↩²⁶ ↩²⁷ ↩²⁸ ↩²⁹ ↩³⁰ ↩³¹ ↩³² ↩³³ ↩³⁴ ↩³⁵ ↩³⁶ ↩³⁷
"Mora (linguistics)." Wikipedia. https://en.wikipedia.org/wiki/Mora_(linguistics) ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
Liu, Sha, and Takeda, Toshifumi. "Mora-timed, stress-timed, and syllable-timed rhythm classes: Clues in English speech production by bilingual speakers." Acta Linguistica Academica 68, no. 3 (2021): 350–369. https://akjournals.com/view/journals/2062/68/3/article-p350.xml ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶
"Standard Chinese phonology." Wikipedia. https://en.wikipedia.org/wiki/Standard_Chinese_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷
Aoyama, Katsura, Flege, James E., Guion, Susan G., Akahane-Yamada, Reiko, and Yamada, Tsuneo. "Perceived phonetic dissimilarity and L2 speech learning: The case of Japanese /r/ and English /l/ and /r/." Journal of Phonetics 32, no. 2 (2004): 233–250. https://www.sciencedirect.com/science/article/abs/pii/S0095447003000366 ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
Iverson, P., Kuhl, P. K., Akahane-Yamada, R., Diesch, E., Tohkura, Y., Kettermann, A., and Siebert, C. "A perceptual interference account of acquisition difficulties for non-native phonemes." Cognition 87, no. 1 (2003): B47–B57. https://ilabs.uw.edu/wp-content/uploads/2022/05/Iverson_Kuhl_2003.pdf ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷
Kuhl, P. K., Conboy, B. T., Coffey-Corina, S., Padden, D., Rivera-Gaxiola, M., and Nelson, T. "Phonetic learning as a pathway to language: new data and native language magnet theory expanded (NLM-e)." Philosophical Transactions of the Royal Society B 363, no. 1493 (2008): 979–1000. https://pmc.ncbi.nlm.nih.gov/articles/PMC2606791/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
Yazawa, Kakeru, Whang, James, and Escudero, Paola. "Australian English listeners' perception of Japanese vowel length reveals underlying phonological knowledge." Frontiers in Psychology 14 (2023): 1122471. https://pmc.ncbi.nlm.nih.gov/articles/PMC10639153/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸
Ren, Aiwen. "Perception of Japanese singleton and geminate contrasts: A case of Chinese learners with different dialectal backgrounds." Frontiers in Psychology 13 (2022): 1070107. https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2022.1070107/full ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵
"Korean phonology." Wikipedia. https://en.wikipedia.org/wiki/Korean_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸ ↩¹⁹ ↩²⁰
"Spanish phonology." Wikipedia. https://en.wikipedia.org/wiki/Spanish_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸ ↩¹⁹ ↩²⁰ ↩²¹ ↩²² ↩²³
"Italian phonology." Wikipedia. https://en.wikipedia.org/wiki/Italian_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷
Le, Trang Thi Huyen, Kondo, Mariko, and Tsukada, Kimiko. "Cross-linguistic transfer of acoustic cues: Perception of Japanese vowel length by learners from Vietnamese-speaking background." Second Language Research (2025). https://journals.sagepub.com/doi/10.1177/02676583251355203 ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹²
"Vietnamese phonology." Wikipedia. https://en.wikipedia.org/wiki/Vietnamese_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴
"French phonology." Wikipedia. https://en.wikipedia.org/wiki/French_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹²
"Thai phonology." Wikipedia. https://en.wikipedia.org/wiki/Thai_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰
"Hindi–Urdu phonology." Wikipedia. https://en.wikipedia.org/wiki/Hindustani_phonology ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹
Hirata, Yukari. "Training native English speakers to perceive Japanese length contrasts in word versus sentence contexts." Journal of the Acoustical Society of America 116, no. 4 (2004): 2384–2394. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
Tajima, Keiichi, Kato, Hiroaki, Rothwell, Amanda, Akahane-Yamada, Reiko, and Munhall, Kevin G. "Training English listeners to perceive phonemic length contrasts in Japanese." Journal of the Acoustical Society of America 123, no. 1 (2008): 397–413. ↩ ↩² ↩³
Muradás-Taylor, Becky. "Accuracy and Stability in English Speakers' Production of Japanese Pitch Accent." Language and Speech 65, no. 2 (2022): 377–403. https://pmc.ncbi.nlm.nih.gov/articles/PMC9014664/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸ ↩¹⁹ ↩²⁰ ↩²¹
Tsurutani, Chiharu. "Intonation of Japanese sentences spoken by English speakers." INTERSPEECH 2011. https://www.semanticscholar.org/paper/Intonation-of-Japanese-sentences-spoken-by-English-Tsurutani/6bb2e650b5f9b929174f5b17af280d0953fed247 ↩ ↩²
Tsushima, Teruaki, et al. "Vietnamese speakers' mispronunciation of Japanese singleton and geminate stops." Acoustical Science and Technology 43, no. 5 (2022): E2172. https://www.jstage.jst.go.jp/article/ast/43/5/43_E2172/_pdf ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
Holliday, Jeffrey J. "The perception and production of word-initial Korean stops by native speakers of Japanese." Language and Speech 62, no. 1 (2019): 27–63. https://journals.sagepub.com/doi/10.1177/0023830918785649 ↩ ↩² ↩³ ↩⁴
Hwang, Hyun Kyung, Ito, Junko, and Mester, Armin. "Triple operations of rendaku processing: Native Chinese and Korean speakers learning Japanese." Lingua 199 (2017): 28–48. https://www.researchgate.net/publication/318130440_Triple_operations_of_rendaku_processing_Native_Chinese_and_Korean_speakers_learning_Japanese ↩

Overview​

What L1 transfer is, and what it is not​

How to use this page​

The L1-difficulty matrix at a glance​

Segmental contrasts​

Prosodic contrasts​

English-speaker pitfalls​

Segmental problems: the ら-row, つ, ふ​

Vowel-length blindness​

Mora-timing vs. stress-timing​

Pitch accent and the stress habit​

Mandarin and Cantonese-speaker pitfalls​

What transfers cleanly​

Tone does not transfer to pitch accent​

Mora-timing for syllable-timed speakers​

Vowel length and the high-front vowel​

Korean-speaker pitfalls​

What transfers cleanly​

Where it bites: voicing and devoicing​

Pitch accent vs. Seoul intonation​

Spanish and Italian-speaker pitfalls​

What transfers cleanly​

Where ら-row and ふ still bite​

Vowel length is new​

Pitch accent: the closest analogue is intonation, not lexical accent​

Vietnamese-speaker pitfalls​

Tone does not transfer to pitch accent​

Mora-timing for a sesquisyllabic L1​

Voicing, final consonants, and the mora-N​

Other L1 contexts (brief)​

French, German, and other European L1s​

Tonal South-East Asian L1s (Thai, Lao, Burmese)​

Indo-Aryan and Dravidian L1s (Hindi, Urdu, Tamil)​

Good to know​

Positive transfer is real and underused​

The "no L1 matters" claim is wrong, but the "Japanese is easy" claim is not​

Romaji hides L1 problems​

Heritage speakers and partial L1​

Acoustic-cue weighting per L1​

Training studies actually work​

See also​

References​

Footnotes​

Overview

What L1 transfer is, and what it is not

How to use this page

The L1-difficulty matrix at a glance

Segmental contrasts

Prosodic contrasts

English-speaker pitfalls

Segmental problems: the ら-row, つ, ふ

Vowel-length blindness

Mora-timing vs. stress-timing

Pitch accent and the stress habit

Mandarin and Cantonese-speaker pitfalls

What transfers cleanly

Tone does not transfer to pitch accent

Mora-timing for syllable-timed speakers

Vowel length and the high-front vowel

Korean-speaker pitfalls

What transfers cleanly

Where it bites: voicing and devoicing

Pitch accent vs. Seoul intonation

Spanish and Italian-speaker pitfalls

What transfers cleanly

Where ら-row and ふ still bite

Vowel length is new

Pitch accent: the closest analogue is intonation, not lexical accent

Vietnamese-speaker pitfalls

Tone does not transfer to pitch accent

Mora-timing for a sesquisyllabic L1

Voicing, final consonants, and the mora-N

Other L1 contexts (brief)

French, German, and other European L1s

Tonal South-East Asian L1s (Thai, Lao, Burmese)

Indo-Aryan and Dravidian L1s (Hindi, Urdu, Tamil)

Good to know

Positive transfer is real and underused

The "no L1 matters" claim is wrong, but the "Japanese is easy" claim is not

Romaji hides L1 problems

Heritage speakers and partial L1

Acoustic-cue weighting per L1

Training studies actually work

See also

References

Footnotes