Rendaku: When K Becomes G in Compound Words
Rendaku is Japanese sequential voicing. In compounds, it turns the first voiceless obstruent (a stop or fricative such as /k/, /s/, /t/, or /h/) of the second element into its voiced partner.1 2 It is why 紙 kami becomes gami inside 折り紙 origami, why 手紙 tegami writes its second half with が rather than か, and why 人々 reads hitobito instead of hitohito.
Overview
What rendaku is
Rendaku (連濁) is the regular voicing of a compound's right-hand initial consonant from /k s t h/ to /g z d b~p/.1 In writing, the shift appears as the dakuten (゛) on the か, さ, た rows and as the dakuten or handakuten (゜) on the は row. In speech, it is a one-step voicing of the obstruent.1
The term comes from Edo-period kokugaku scholarship. Kamo no Mabuchi (1765) and Motoori Norinaga (1767–1798) independently identified the voicing-blocking constraint now called Lyman's Law.3 Benjamin Smith Lyman's 1894 paper The Change from Surd to Sonant in Japanese Compounds stated the rule in English and gave the constraint its Western name.4
The literature usually describes the structure this way: in an [A + B] compound, the initial consonant of B voices under lexical, phonotactic, and structural conditions.1 2
Where you meet it as a learner
Rendaku is not a JLPT grammar point. It is a phonological process whose results appear in the standard N5 through N1 vocabulary lists.1 A learner first meets it in a small set of high-frequency words whose second half has unexpected dakuten: 手紙 tegami "letter," 折り紙 origami "paper-folding," 人々 hitobito "people," 鼻血 hanaji "nosebleed," 黒砂糖 kurozatō "brown sugar," 花火 hanabi "fireworks."6 7 8 9 10
Counter compounds are the next major place to notice rendaku. The forms 一本 ippon, 三本 sanbon, 六本 roppon sit at the intersection of rendaku and gemination. The underlying counter is 本 hon, and the surface shape depends on the preceding mora.1
Surname and place-name components use the same process: -gawa (川), -jima (島), -bashi (橋), -saki/-zaki (崎). These are lexicalized choices, discussed under "Semantic factors" below.2
The four voicing shifts
Rendaku targets the four voiceless obstruent series at the start of the second element. The pairings are fixed by the writing system: each voiceless onset has exactly one voiced reflex.
| Voiceless onset | Voiced reflex | Kana shift |
|---|---|---|
| /k/ | /g/ | か→が, き→ぎ, く→ぐ, け→げ, こ→ご |
| /s/, /ɕ/ | /z/, /(d)ʑ/ | さ→ざ, し→じ, す→ず, せ→ぜ, そ→ぞ |
| /t/, /tɕ/, /ts/ | /d/, /(d)ʑ/, /dz/ | た→だ, ち→ぢ, つ→づ, て→で, と→ど |
| /h/, /ç/, /ɸ/ | /b/ (default) or /p/ (handakuten) | は→ば |
k goes to g
The /k/ initial of the second element voices to /g/.1 This is the clearest of the four shifts because the underlying /k/ has no allophonic complications, meaning it does not change pronunciation by environment before rendaku. Wago compounds with a second-element /k/ show rendaku at the wago baseline rate, around 87% in Rosen 2001 and 84% in Irwin 2005.11 12
手紙8
"letter (the kind you write and post)"
折り紙6
"origami; paper-folding"
縁組み13
"matrimonial alliance; adoption arrangement"
s goes to z
The /s/ initial of the second element voices to /z/.1 The /sh/ ~ /ɕ/ series (し, しゃ, しゅ, しょ) voices to /j/ ~ /(d)ʑ/, written じ, じゃ, じゅ, じょ. Phonetically, this is [dʑ] in onset position.1 2
黒砂糖7
"unrefined brown sugar"
青空2
"blue sky"
夜桜13
"cherry blossoms viewed at night"
t goes to d or dʒ
The /t/ initial of the second element voices to /d/.1 Before /i/ and /u/, the underlying /t/ surfaces as the allophones [tɕ] and [ts]. The voiced reflexes are ぢ /dʑi/ and づ /dzu/ in historical orthography.3
Modern 現代仮名遣い (the 1946 spelling reform) writes both ぢ and づ as じ and ず in almost every context. The exceptions are rendaku compounds and etymologically transparent reduplications like ちぢむ, where ぢ and づ are retained as visible markers of the rendaku origin.3 The textbook example is 鼻血 hanaji "nosebleed."9
鼻血9
"nosebleed"
三日月13
"crescent moon"
火種1
"spark; kindling; the seed of a fire (literal or figurative)"
h goes to b (and sometimes p)
The /h/ initial of the second element voices to /b/ by default, or to /p/ in the handakuten reflex.1 The /h/-to-/b/ correspondence is regular wherever the structural rules allow rendaku, even though /b/ is phonetically farther from /h/ than /g/ is from /k/.
The reason is historical. The h-row originated as Proto-Japanese _/p/. It shifted to the bilabial fricative [ɸ] before the end of the 16th century, then split into [h] before /a e o/, [ç] before /i/, and remained [ɸ] before /u/.3 The voiced counterpart of _/p/ has been /b/ throughout, and rendaku preserves that ancient pairing.
The /h/-to-/p/ (handakuten) reflex appears predominantly when the first element ends in the moraic obstruent っ or the moraic nasal ん.1 The counter forms 一本 ippon, 三本 sanbon, 六本 roppon are the highest-frequency examples; Sino-Japanese 出発 shuppatsu shows the same conditioning inside a kango compound.1
人々10
"people; various people"
花火1
"fireworks"
川舟1
"river boat"
三本1
"three (long thin objects)"
一本1
"one (long thin object)"
When rendaku fires and when it does not
The conditioning rules are presented below in order of reliability. Position is absolute. Lyman's Law is hard. The strata generalizations are statistical. Branching and dvandva are structural, with attested exceptions.
Position: only the second element
Rendaku targets the initial consonant of the right-hand element of a binary compound. The first element keeps its lexical voicing.1 2
In three-element compounds, only the rightmost morpheme of a right-branching sub-constituent can voice. That constraint is the subject of "Branching" below.14
Lyman's Law: no double voicing
Vance's modern formulation is simple: rendaku is blocked when the second element of a compound already contains a voiced obstruent phoneme (/g/, /z/, /d/, or /b/).1 2 Lyman's 1894 original wording named the same constraint: "the general rule does not apply ... when b, d, g, j, p, or z already occurs anywhere in the second part of the compound."4
Modern theoretical work restates Lyman's Law as the surface effect of a morpheme-internal OCP (Obligatory Contour Principle) constraint. In plain terms, that constraint bans two voiced obstruents inside a single morpheme.5
The positive control: 折り紙 origami above. Kami contains no voiced obstruent, so rendaku fires.6
The canonical negative example:
山風1
"mountain wind (the wind blowing down off a mountain)"
Lyman blocks because kaze already contains the voiced obstruent /z/. The form is yamakaze, not yamagaze.1 4
A second blocked compound:
山火事1
"wildfire; mountain fire"
Lyman blocks because kaji contains the voiced [dʑ] (phonemically /z/). The form is yamakaji, not yamagaji.1
Lexical strata: wago vs kango vs gairaigo
Japanese vocabulary divides into three strata, or historical layers: wago (和語, native Japanese), kango (漢語, Sino-Japanese loans from Middle Chinese), and gairaigo (外来語, later loans, predominantly from European languages and now written in katakana).15
Rendaku rates fall sharply by stratum. The figures below come from Rosen's 2001 University of British Columbia dissertation, Irwin's 2005 Journal of East Asian Linguistics article, and Irwin's 2011 Loanwords in Japanese monograph.
| Stratum | Rendaku rate as second element | Source |
|---|---|---|
| Wago | ~87% of items show rendaku in at least one compound | Rosen 200111 |
| Wago | ~84% (alternative count, restricted to compound-initial second-element position) | Irwin 200512 |
| Kango (mononoms) | ~20% | Irwin 200512 |
| Kango (binoms) | ~10% | Irwin 200512 |
| Gairaigo | near zero | Irwin 201115; Vance 20152 |
Vance explains the stratum split this way: rendaku is a wago-internal historical process. Kango compounds entered Japanese after that process had largely stopped applying productively, and they were borrowed with fixed on'yomi readings that resist further alternation. Gairaigo arrived later still and behave as orthographically frozen units in katakana.1 2
For learners, a practical stratum check is this: a morpheme written in katakana is gairaigo by default; a morpheme whose kanji are read with on'yomi inside a compound is kango by default; a morpheme whose kanji are read with kun'yomi (the native gloss) is wago by default.15 The heuristic breaks for the small set of hybrid compounds (重箱読み and 湯桶読み), so it is a guide rather than a rule.
Branching in three-element compounds
Otsu's Right-Branch Condition (1980) states that rendaku applies only when a potential rendaku segment sits in a right-branching constituent, that is, a grouped unit on the right side of the compound.14 In a three-element compound A+B+C, the structure determines whether the middle element B can voice.
- Left-branching [[A + B] + C]: rendaku fires on C, because C is in the right branch of the top node.
- Right-branching [A + [B + C]]: rendaku is blocked on B; rendaku may still fire on C inside the [B + C] sub-constituent.
The canonical contrast comes from Otsu 1980 and is revisited in every later treatment.14 16
Status of the constraint: psycholinguistic experiments (Kumagai 2014) show only weak speaker sensitivity to the Right-Branch Condition, with average d′ around 0.14. Vance 2015 also questions whether the "loose" versus "strict" compound distinction can be applied reliably.16 2 The constraint remains the standard description in the literature. Treat it as a strong tendency in three-element compounds rather than a hard law.
Semantic factors: dvandva and proper names
Dvandva compounds are coordinate compounds, meaning "X and Y." They tend to block rendaku.1 2 The classic minimal pair uses a single written form:
山川17
"mountains and rivers" (dvandva: no rendaku)
山川17
"mountain river; a river flowing through a mountain" (modificational: rendaku fires)
The kanji and morphemes are the same, but the compounds differ in meaning. Rendaku marks that difference by being present or absent.
Proper-noun compounds (surnames and place names) are lexicalized.2 The same morpheme can take either form in different names: 中島 reads as both Nakajima and Nakashima depending on the family; 大川 reads as both Ōgawa and Ōkawa.2 For learners, the rule is simple: look up the specific name, do not predict.
Working through a new compound
A learner who knows the four shifts and Lyman's Law still has to decide whether the second element voices in an unfamiliar compound. The check below turns the conditioning rules into a fixed order of questions.
A four-question check
The questions, in order:
- Stratum check. Is the second element wago, kango, or gairaigo? If kango, rendaku is unlikely (~10–20%). If gairaigo, rendaku is near zero. If wago, continue.12 15
- Lyman check. Does the second element already contain a voiced obstruent (/g z d b/, including the [dʑ] allophone written じ or ぢ)? If yes, rendaku is blocked.1 4
- Branching check. In a three-element compound, is the structure right-branching? If yes, rendaku is blocked on the middle element.14
- Semantics check. Is the compound a dvandva (coordinate, "X and Y")? If yes, rendaku is blocked.1 2
If all four checks point to "fire," expect rendaku in a wago compound at roughly the wago baseline rate (≈85% per Rosen 2001 and Irwin 2005). Treat the result as a strong prediction, not a guarantee. A dictionary entry overrides the prediction whenever the two disagree.11 12
Worked examples
The three examples below run the four-question check on a predictable wago hit, a Lyman block, and a kango block.
Predictable wago hit.
折り紙6
"origami; the art of paper-folding"
Stratum: wago + wago. Lyman: kami contains no voiced obstruent. Branching: binary compound, so the three-element question does not arise. Semantics: modificational (the paper is the object being folded), not coordinate. All four checks pass. Rendaku fires: kami → gami.
Lyman block.
山風1
"mountain wind; the wind that blows down from a mountain"
Stratum: wago + wago. Lyman: kaze contains /z/; blocked. Stop. The form is yamakaze, not yamagaze. This is the canonical Lyman's Law example used in every Japanese phonology paper since Lyman 1894.4 1
Kango (strata) block.
説明1
"explanation"
Stratum: kango + kango. Kango compounds resist rendaku (~10% rate per Irwin 2005). The on'yomi of 明 is mei, not bei. Rendaku is blocked at the strata check, so there is no need to run the Lyman check.12 The form is setsumei.
Good to know
The dakuten you see in the spelling is the rendaku you hear
Rendaku writes itself with the same dakuten (゛) and handakuten (゜) marks that learners already know from the basic kana chart. The spelling side of the story belongs to the J-Compass article on dakuten and handakuten as voicing marks on hiragana. There is no separate "rendaku diacritic."1 The regular voicing marks do the work.
Why h goes to b, not v
The h-row of modern Japanese descends from Proto-Japanese _/p/. Frellesvig 2010 dates the shift: _/p/ became the bilabial fricative [ɸ] before the end of the 16th century, then split during the 17th–18th centuries into [h] before /a e o/, [ç] before /i/, and remained [ɸ] before /u/.3
The voiced counterpart of */p/ has always been /b/, so rendaku in the h-row preserves the original /p/–/b/ pairing. The modern alternation hito → bito is the surface form of the ancient *pitö → *bitö voicing.3 The same explanation accounts for the handakuten /p/ reflex after moraic っ or ん in compounds such as ippon and sanbon.1
Mountain wind is the mnemonic for Lyman's Law
The compound 山風 yamakaze is the canonical example used in every introduction to rendaku from Lyman 1894 onward.4 1 Memorizing the pair (yama + kaze, blocked because kaze already has a /z/) gives you the rule.
Rendaku is a strong tendency, not a law
Even wago compounds show roughly 13–16% non-rendaku in second-element position (the complement of the 84–87% wago rate cited above), and Lyman has attested exceptions like nawa-bashigo 縄梯子.11 12 1 The four-question check produces a strong prediction. A dictionary entry overrides it whenever the two disagree.13 18
How rendaku probably arose
The dominant historical hypothesis is the genitive-particle fusion account. In Old Japanese, the genitive postposition の no could appear between the two elements of an N+N compound. For example, yama-no-tori "mountain-GEN-bird" gave modern yamadori 山鳥 "copper pheasant," via an intermediate stage where no reduced to a prenasalized voiced segment that fused with the following obstruent.3 19
The reconstruction sits in the Old Japanese period (roughly the 7th–8th centuries CE) and earlier. Frellesvig 2010 is the standard reference for the timing.3 The hypothesis explains why rendaku is partial rather than universal: only the subset of compounds that historically passed through the no-fusion path show the voicing reflex.
A second hypothesis treats rendaku as a juncture marker that originally signalled the boundary between elements of a compound. Both accounts coexist in the literature. Frellesvig 2010 reviews them.3
See also
- Rendaku in Kanji Compounds: Why 紙 Becomes -gami
- Japanese Compound-Word Pitch Accent: How Two Words Combine into One Accent Pattern
- How to Predict the Reading of an Unknown Kanji Compound: The On+On Default, Jūbako, Yutō, and the Look-It-Up Bucket
- The Japanese Vowel Inventory: Five Vowels, Done Right
- Japanese Vowel Devoicing: Why です Sounds Like "Des"
- Stress vs. Pitch: Does Japanese Have Stress?