Skip to main content

Japanese Pitch Accent: A Complete Beginner's Guide

Japanese pitch accent is the system standard Tokyo Japanese uses to assign at most one high-to-low pitch drop to each word. It marks the accented mora through pitch alone, not through tone or stress.12 For a serious learner, getting the category right up front (Japanese is neither Mandarin nor English) helps the rest of pronunciation study click.

Overview

This article frames pitch accent as a language type. It names the binary high/low system that operates on every mora, previews the four Tokyo patterns, and explains why mainstream textbooks leave pitch accent off the syllabus. Every example here uses the Tokyo (NHK broadcast) standard; regional systems are flagged but not taught.34

The deeper goal is to give you a mental model accurate enough for pattern-by-pattern study elsewhere. A single overview article cannot train your ear or your production.

The one fact to leave with

Standard Tokyo Japanese marks at most one pitch drop per word. Find the drop and you have found the accent; everything else in the contour follows from that.25

What pitch accent is, and what it is not

The label pitch accent is a typological term, meaning it classifies languages by how they work. It places Japanese in a third category, separate from tone languages like Mandarin and stress languages like English. Getting this category right matters because beginners often import either a tone model or a stress model into Japanese pronunciation. Both lead to wrong predictions.

Pitch accent vs. tone language

Tone languages (Mandarin, Cantonese, Vietnamese, Standard Thai) assign an independent tonal contour, or pitch shape, to almost every syllable. A three-syllable Mandarin word carries three independent lexical tones, each chosen from the language's tone inventory.6

A pitch-accent language is different in kind. It marks at most one prominent spot per word using pitch, a property the typology literature calls culminativity.6 Standard Tokyo Japanese is the classic example: one accented mora per word (or none), realized as a single high-to-low pitch drop immediately after the accented mora.26

A live debate in the typology literature

Larry M. Hyman has argued that "pitch accent" is not a coherent category and that such systems are better analyzed as a subtype of tone. Mainstream Japanese-linguistics work nonetheless uses the pitch-accent label; both Vance 2008 and Kawahara 2015 treat Tokyo Japanese as a pitch-accent system.126

Pitch accent vs. stress accent

Stress accent (the system used by English, Russian, and Spanish) bundles loudness, vowel length, and vowel quality together with pitch. In the English noun RECord, the stressed syllable is louder, longer, and has a fuller vowel than the unstressed syllable.6

Tokyo Japanese keeps loudness and length largely constant across morae and uses pitch alone to mark the accented mora. The acoustic cue is the F0 fall (a drop in fundamental frequency), not amplitude, or loudness.15 Pierrehumbert and Beckman (1988) characterize Tokyo accent as a bitonal HL accent: a high-low accent realized as an F0 peak followed by a steep F0 fall on (or immediately after) the accented mora.5

A dedicated article on stress versus pitch accent walks through the perceptual consequences for English-L1 learners. Here it is enough to know that loudness is not the cue.

Why "high/low" beats "rising/falling" as a mental model

Each mora is in one of two states: H or L. A Tokyo word's accent shape is the sequence of those binary labels, nothing more.27

The "drop" (the H→L transition, also called downstep) is the audible event the listener hears as the accent. Everything else in the contour is just the H plateau before the drop and the L plateau after it. Here, a plateau means a stretch that stays at the same pitch level.15

Thinking in "rising" and "falling" contours leads beginners to hear Japanese as a stripped-down tone language. Thinking in H and L labels per mora matches both the dictionary notation and the underlying phonology.

Terminology to lock in early

Downstep = the H→L pitch fall that marks the accented spot.25 Accent number (akusento-bangō) = the integer dictionaries print: 0 for unaccented words, n for an accent that falls after the n-th mora.348

The binary high-low system on each mora

If pitch accent is the macro story (one drop per word), the binary high-low assignment is the micro story that fills in every mora's actual pitch. Together, three rules generate every well-formed Tokyo accent contour. They are the only rules a beginner needs.

One mora, one pitch

Pitch in Tokyo Japanese is assigned per mora, a timing unit, not per syllable. Each mora is either H or L, with no in-between values.12

A long vowel, the first half of a geminate (the small っ), and the moraic nasal ん each count as their own mora. Each carries its own H or L label.19 This is one of the places where an article on the mora versus the syllable earns its keep. For now, take it as given that the mora is the unit.

For a worked example, 東京 has four morae:

東京とうきょう4
"Tokyo."

The accepted Tokyo pattern is LHHH: heiban, accent number 0 (unaccented). The breakdown is と (L), う (H), きょ (H), う (H). Note that きょ is a single mora despite being written with two kana.

The "one drop per word" rule

A standard-Tokyo word has at most one downstep: once pitch falls from H to L within the word, it does not rise again inside the same word.29

This single constraint generates exactly n + 1 possible accent patterns for an n-mora word. The drop can occur after any of the n morae, or it can be absent entirely (heiban).29 A 2-mora word has 3 possible patterns, a 3-mora word has 4, a 4-mora word has 5.

The combinatorics are tidy, but the lexicon is not evenly distributed across the patterns. Long words with initial or final accent are rare in actual usage even though the system permits them.2

The diagram gives the whole grammar of Tokyo accent contours in one branch: an initial rise, then either a drop somewhere or no drop at all.

Why the second mora almost always differs from the first

Standard Tokyo enforces an initial-lowering rule: the first and second morae must differ in pitch.12

Combined with the one-drop rule, this creates two cases. If the accent sits on mora 1 (the atamadaka case), the first mora is H and the second drops to L (HL...). In all other cases, the first mora is L and the second rises to H (LH...).12

The only standard exception is the one-mora word plus particle case, where the constraint is satisfied by the particle rather than by an internal second mora.2

Off-by-one when reading accent numbers

Accent number 2 does not mean "the second mora is high and the rest are low." It means the drop happens after the second mora: mora 2 is the last H, mora 3 is the first L. Combined with initial-lowering, a 4-mora word with accent 2 surfaces as L–H–L–L.23

The four patterns at a glance

Together, the one-drop rule and initial-lowering give every Tokyo word one of four named patterns. This section previews each pattern with one standard example and points to dedicated articles for drilling.

Heiban (平板): flat, no drop

Accent number 0. The contour is LHH...H on the word, and the particle stays high: LHH...H–H.234 The pattern is named 平板 ("flat board") because, after the required L→H rise on mora 2, the contour stays level for the rest of the word and into any attached particles.2

さくらきました。4
"The cherry blossoms have bloomed."

In さくら, the contour is LHH; the particle が stays high, giving LHH–H. The whole noun-plus-particle sequence is level after mora 2. A dedicated article on the heiban pattern handles drilling.

Atamadaka (頭高): head-high, drop after mora 1

Accent number 1. The contour is HLL...L, and a following particle stays low: HLL...L–L.234 The pattern is named 頭高 ("head-high") because the high pitch sits on the first mora (the "head" of the word) and falls immediately after.28

ねこいています。4
"The cat is meowing."

In ねこ, the contour is HL; the particle が is L, giving HL–L. The drop is immediate, after mora 1. A dedicated article on the atamadaka pattern handles drilling.

Nakadaka (中高): middle-high, drop somewhere inside

Accent number is any integer from 2 to n − 1 for an n-mora word. A 4-mora word with accent 2 is LHLL; with accent 3, it is LHHL.234 The pattern is named 中高 ("middle-high") because the high stretch sits in the middle of the word and drops before the end.28

たまごいました。4
"I bought eggs."

In たまご, the contour is LHL; the particle を is L, giving LHL–L. The drop is internal, between mora 2 and mora 3. A dedicated article on the nakadaka pattern handles drilling.

Odaka (尾高): tail-high, drop after the final mora

Accent number n for an n-mora word. The contour is LHH...H within the word itself, and the drop lands on the following particle: LHH...H–L.234 The pattern is named 尾高 ("tail-high") because the high stretch extends to the tail (last mora) of the word, and the drop is exposed only by what follows.28

おとこました。4
"A man came."

In おとこ, the contour is LHH in isolation; the particle が exposes the drop, giving LHH–L. Without the particle, this is indistinguishable from heiban. With the particle, the contrast surfaces. A dedicated article on the odaka pattern handles drilling.

Odaka and heiban are identical in citation form

Both surface as LHH...H on the word itself. The contrast lives entirely on the particle: heiban keeps the particle high (LHH–H), odaka drops it (LHH–L). A noun pronounced in isolation cannot, in principle, be classified as one or the other. You need a particle.248

The classic three-way demonstration uses はし:

はしながい。 / はしながい。98
"The bridge is long." / "The edge is long."

橋 (bridge) is odaka, accent 2: はし = LH, particle drops → はしが LH–L. 端 (edge) is heiban, accent 0: はし = LH, particle stays high → はしが LH–H. Add 箸 (chopsticks), atamadaka, accent 1: はし = HL, はしが HL–L. The minimal triple is cited in the Shin Meikai Nihongo Akusento Jiten (Kindaichi, 1981). A dedicated article on pitch-accent minimal pairs takes it further.108

A side-by-side comparison table

PatternKanjiAccent number3-mora schematic (word)With particleExample (3-mora)
Heiban平板0LHHLHH–Hさくら (sakura)
Atamadaka頭高1HLLHLL–Lいのち (inochi)
Nakadaka中高2LHLLHL–Lたまご (tamago)
Odaka尾高3LHHLHH–Lおとこ (otoko)

Schematics and accent numbers follow the conventions of the NHK Nihongo Hatsuon Akusento Shin-Jiten (2016)311 and OJAD.412 The L/H breakdown follows Kawahara 2015.2

Why most textbooks ignore pitch accent

If you arrive at this article, you have probably noticed that mainstream classroom materials do not teach pitch accent. It is worth understanding the "why" before deciding what to do about it.

The standard-textbook stance

Neither Genki nor Minna no Nihongo systematically trains pitch accent in its main lessons. The mainline lessons of Genki I (3rd edition, The Japan Times, 2020) do not mark pitch accent on vocabulary entries. Pitch is mentioned in the pronunciation discussion as the cue that distinguishes Japanese from a stress language, but the four patterns are not foregrounded as a training object. The audio recordings use accurate Tokyo accent, but the textbook does not treat them as pitch-accent training material.13

Minna no Nihongo (Shokyū I, 2nd edition, スリーエーネットワーク) takes a similar line. The audio materials use natural Tokyo accent, and some supplementary materials include pitch annotation for selected vocabulary. But the main textbook does not include a pitch-accent lesson or a notation key for learners.1415

The safest factual summary is that both textbooks expose learners to correct Tokyo accent through audio while declining to teach the system explicitly.

Three reasons given

Three reasons recur in discussions of why pitch accent stays off the syllabus. The research below documents the reasons themselves. The published literature does not provide pedagogy-research rebuttals at the rigor this article targets, so each reason is presented descriptively.

  1. "It varies by region." Kansai (Keihan), Tohoku, and parts of Kyushu have distinct systems, so the worry is that teaching one accent picks a side. The descriptive fact is true. The Tokyo standard is nonetheless well-defined. It is the accent used in NHK broadcasts, on the JLPT listening sections, and in OJAD.349
  2. "Context disambiguates minimal pairs." This is true at the lexical level for many high-frequency homophones in connected speech, especially at beginner levels where vocabulary is small.9
  3. "It's too hard for beginners." This is a common informal claim. The cited references do not engage with it as a pedagogical position.

What ignoring it costs you long-term

The descriptive cost is straightforward and well sourced: pitch accent is the phonological cue native listeners use for lexical contrast in Tokyo Japanese.125 A learner who acquires vocabulary without its accent is acquiring an incomplete lexical representation. The gap is exposed every time a minimal pair like 橋 / 端 / 箸 enters a sentence.

The sources reviewed here do not give the size of that cost in connected speech as a quantifiable intelligibility figure. A dedicated article on stress versus pitch can walk through the perceptual interference English-L1 listeners bring. Here it is enough to say that the lexical gap is real, and closing it later is harder than building the habit early.

The Tokyo-standard caveat

Every accent number, schematic, and example in this article assumes the Tokyo standard. That assumption needs to be explicit, because the standard is not the only system Japanese speakers use.

What "Tokyo standard" means here

"Tokyo standard accent" refers to the prestige dialect derived from Yamanote Tokyo. NHK's Broadcasting Culture Research Institute (NHK放送文化研究所) codified it and published it as the NHK Nihongo Hatsuon Akusento Shin-Jiten (2016). All NHK news broadcasts use this accent.311

The same standard is used by OJAD,412 by the Shin Meikai Nihongo Akusento Jiten (Sanseidō),10 and as the implicit reference for pitch-accent material in JLPT listening passages.

Other dialects use different systems

The Kansai / Keihan system (Kyoto–Osaka type, 京阪式アクセント) encodes two independent dimensions per word: an initial-register contrast (whether the word starts high 高起式 or low 低起式) and a downstep location. This creates more contrasts per word than the Tokyo system.9

For 川 (kawa, "river"), Tokyo is [ka.waꜜ] (odaka, drop on particle); Kansai is [kaꜜ.wa] (initial-high, internal drop).9

Accentless (無アクセント, mu-akusento) dialects have no lexical pitch contrast at all. All words surface with a flat contour, and pitch carries no lexical information. Parts of Tohoku and parts of Kyushu fall into this group.9

A dedicated article on pitch-accent regional variation maps the full inventory.

What this means for self-study

For JLPT preparation, NHK broadcast comprehension, and standard business Japanese, drill Tokyo accent. This is the accent every cited reference dictionary records.3410

For learners headed to Kansai, the practical sequence is still to learn Tokyo first as the reference frame. The dictionaries, OJAD, and every accent-tagged Anki deck assume it. Then layer the Keihan system on top.9

Good to know

"Pitch accent" is the linguistics term, not a hedge

The label pitch-accent language has a precise typological meaning: a language with culminativity (one prominence per word) where the prominence is realized by pitch alone.6 It is not a softer way to say "Japanese is kind of tonal." Tonal languages assign tone to every syllable; pitch-accent languages assign at most one tonal locus per word.6

Heiban is the statistical default for nouns

Across the Tokyo-Japanese noun lexicon, roughly half of common nouns are heiban; among the remainder, accented nouns favour the antepenultimate mora (the "default accent" of McCawley's analysis).27 The practical inference is that heiban is the least-bad blind guess for a noun whose accent you do not know. But Kawahara 2015 characterizes the figure as a statistical tendency, not a productive rule, so the reliable move is always to look the word up in OJAD.24

The drop is what your ear should track

Native listeners cue on the H→L transition (downstep), not on absolute pitch height.15 Training your perception of the drop is the highest-leverage drill before production. Find the drop, name the pattern, then check against OJAD.

Treating odaka and heiban as audibly identical in all contexts

The wrong inference is that はし sounds the same regardless of whether the word is 橋 or 端. In citation form they are identical (both LH), but with a particle they diverge.

はしが / はし98
"Bridge (subject) / edge (subject)."

橋が is LH–L (odaka, drop on the particle); 端が is LH–H (heiban, particle stays high). The minimal contrast lives on the particle, not on the noun.

Compound words and conjugations have their own rules

The accent of かみ + さま is not the sum of the parts. Verbs also shift accent across forms. These behaviors are mapped in dedicated articles on pitch-accent compounds and on verb / adjective accent. They are named here only so the reader knows the system has more to it than the four citation-form patterns.2

Notation appears in several flavours

The NHK dictionary's 2016 edition switched from an overline-with-drop-bar notation (which marked the H span) to a downstep-only notation. The newer notation marks the H→L fall with a "\" symbol and uses " ̄" only when no drop exists. The conceptual shift is from "Where does the H sit?" to "Where does the drop happen?"11

OJAD displays the contour graphically (a stepped diagram showing each mora as H or L, with the drop drawn explicitly). It also prints the accent number.412 Older learner materials still use the overline-with-drop-bar notation. IPA-style transcriptions of Japanese use the downstep symbol (ꜜ).3115 All three encode the same information. Given initial-lowering and the one-drop rule, knowing where the drop is is equivalent to knowing the H span.2 A dedicated article on pitch-accent notation unpacks each convention.

See also

References

Footnotes

  1. Vance, Timothy J. The Sounds of Japanese. Cambridge University Press, 2008. Pitch-accent chapter (the work surveys vowels, consonants, syllable structure, accent, and intonation of standard Tokyo Japanese). 2 3 4 5 6 7 8 9 10

  2. Kawahara, Shigeto. "The Phonology of Japanese Accent." Chapter 11 in Haruo Kubozono (ed.), Handbook of Japanese Phonetics and Phonology, De Gruyter Mouton, 2015. https://user.keio.ac.jp/~kawahara/pdf/HandbookAccentPublished.pdf 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

  3. NHK放送文化研究所 (ed.). 『NHK日本語発音アクセント新辞典』. NHK出版, 2016. ISBN 978-4-14-011345-5. Publisher product page: https://www.monokakido.jp/en/dictionaries/nhkaccent2/index.html 2 3 4 5 6 7 8 9 10 11 12

  4. Online Japanese Accent Dictionary (OJAD). Minematsu Lab, Graduate School of Engineering, The University of Tokyo. https://www.gavo.t.u-tokyo.ac.jp/ojad/eng/pages/home 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

  5. Pierrehumbert, Janet B., and Mary E. Beckman. Japanese Tone Structure. MIT Press, 1988. (Standard reference for the bitonal HL analysis of Tokyo accent; cited in Kawahara 2015 2.) 2 3 4 5 6 7 8

  6. "Pitch-accent language." Wikipedia. https://en.wikipedia.org/wiki/Pitch-accent_language (tertiary cross-check on cross-linguistic typology). 2 3 4 5 6 7

  7. McCawley, James D. The Phonological Component of a Grammar of Japanese. Mouton, 1968. (Foundational generative analysis of Tokyo accent as a single underlying H–L locus per word; cited in Kawahara 2015 2.) 2

  8. "What is Japanese pitch accent?" sci.lang.japan FAQ. https://www.sljfaq.org/afaq/pitch-accent.html (limitation: community FAQ; used here only for the four-pattern overview where it agrees with 2 and 3, and for the 箸/橋/端 minimal pair where it cites Kindaichi 1981). 2 3 4 5 6 7 8

  9. "Japanese pitch accent." Wikipedia. https://en.wikipedia.org/wiki/Japanese_pitch_accent (tertiary cross-check; primary references therein are Vance 2008, Kawahara 2015, Shin Meikai Nihongo Akusento Jiten, and NHK Nihongo Hatsuon Akusento Jiten). 2 3 4 5 6 7 8 9 10 11

  10. 金田一春彦 (Kindaichi, Haruhiko) (ed.). 『新明解日本語アクセント辞典』 (Shin Meikai Nihongo Akusento Jiten). 三省堂 (Sanseidō). The other major normative pitch-accent reference alongside the NHK dictionary; cited by Vance 2008 and by Kawahara 2015. 2 3

  11. 国際交流基金. 「『NHK 日本語発音アクセント新辞典』」. 日本語教育通信 本ばこ. https://www.jpf.go.jp/j/project/japanese/teach/tsushin/bookshelf/201610.html 2 3 4

  12. Nakamura, Nobuaki; Hirano, Hiroya; Minematsu, Nobuaki; et al. "Development of a Web Framework for Teaching and Learning Japanese Prosody: OJAD (Online Japanese Accent Dictionary)." Interspeech 2013. https://www.isca-archive.org/interspeech_2013/nakamura13_interspeech.html 2 3

  13. Banno, Eri; Ikeda, Yoko; Ohno, Yutaka; Shinagawa, Chikako; Tokashiki, Kyoko. Genki: An Integrated Course in Elementary Japanese I, Third Edition. The Japan Times, 2020. Publisher overview: https://genki3.japantimes.co.jp/en/intro/

  14. スリーエーネットワーク (3A Corporation) (ed.). 『みんなの日本語 初級I 本冊』第2版. スリーエーネットワーク. Publisher resources page: https://www.3anet.co.jp/np/en/resrcs/230020/

  15. Tofugu. "Minna no Nihongo Review." Japanese Learning Resources Database. https://www.tofugu.com/japanese-learning-resources-database/minna-no-nihongo/ (limitation: secondary review; the load-bearing claim is that Minna no Nihongo "explains pitch accents with new grammar you learn"; used here as evidence that pitch is at least present in the audio materials, not as evidence of systematic instruction.)