Japanese Shadowing Materials by JLPT Level: What to Shadow from N5 to N1

Japanese shadowing materials by JLPT level are audio sources screened for one task: reproducing native speech aloud, a beat behind, from sound alone.¹ Picking the wrong source is the quiet reason most shadowing practice stalls. Material that is too fast, too long, or has no transcript turns the drill into parroting noise.¹

Overview

This article is a materials map, not a technique manual. It screens every pick against four shadowing-fitness criteria. Then it routes each JLPT band from N5 to N1 to a level-appropriate source already covered elsewhere on J-Compass.

Two anchors hold the whole map together. The first is the set of four fitness tests below. The second is a small set of measured speech-rate numbers. These let "by level" mean something concrete rather than an "easy/hard" hunch.²³

Every level pick here is a recommendation, not a gate

No published study assigns a specific resource to a specific JLPT level, and the JLPT itself publishes no media or vocabulary list. The per-level picks are calibrated J-Compass recommendations. They are anchored to the official can-do descriptors plus measured speech-rate bands, not official classifications.⁴ Read each row as a starting point you adjust, not a wall you must clear first.

What makes good shadowing material

The four fitness tests: short, clear, transcribed, single-speaker

Shadowing means reproducing heard speech aloud in real time, a short interval behind, from sound alone with no text in view, while the audio keeps playing.¹⁵ The four fitness criteria below come from that drill. They are the spine the rest of this map hangs on.

Criterion 1, short. Material should be a short segment, not a long clip. Kadota models shadowing as pushing processing through the phonological loop, the auditory-rehearsal part of working memory that holds sound traces for only a few seconds before they fade.⁶⁵ A long unbroken clip overruns that buffer, so segments must be short enough to hold. "Short" follows from the working-memory account. It is not a fixed second-count: aim for "short enough to hold," not "exactly N seconds."¹⁷

Criterion 2, clear. Audio must be clearly enunciated. Shadowing's best-supported gain is bottom-up phoneme perception, meaning recognition of sounds from the audio itself. The technique trains the ear to whatever input it is fed.¹⁸ Muddy or noisy audio gives the perceptual system nothing clean to lock onto, and clarity is also what makes a single clear-voiced host comprehensible in the first place.¹

Criterion 3, has a transcript. A transcript lets you check what you mis-shadowed afterward. Shadowing itself is done with no text in view, because reading while shadowing "will change the cognitive process... so it becomes a different practice."⁷ But Kadota and Tamai's stepped procedure depends on having a script available. You use it to mumble and synchronize before the text is removed at the prosody and content stages.⁹ You shadow without the text in the moment, then verify against the transcript afterward. With no transcript at any point, there is no audit trail for your errors.

Criterion 4, single speaker (relaxes as level rises). A single speaker is easiest to track. Single-speaker audio gives you three supports at once: one voice, full untruncated turns, and a read or semi-scripted register.² Overlapping multi-speaker spontaneous speech removes all three. The constraint is strictest at low levels and loosens upward. By N2 to N1, a mature ear can tolerate multi-speaker material. "Single-speaker at lower levels, relaxing upward" is a reasoned recommendation built from that accommodation count, not a sourced per-level rule.⁹²

The unifying principle: shadow slightly below your ceiling

Material should sit slightly below your comprehension ceiling so you can track it in real time. Comprehensible input is input a learner can understand even when it sits slightly beyond their current level. For shadowing, the bar is stricter because production runs alongside perception.¹ Hamada's data show shadowing's listening payoff concentrated in lower-proficiency learners working with level-appropriate material. An over-level text is "too demanding... cognitive overload."¹ So the calibration for shadowing skews easier than for passive listening.

How "by level" is calibrated, not absolute

The match between level and material is a recommendation, not a hard gate. The assignments are built from three evidence-anchored axes rather than impressions:

Approximate speech rate (morae per second). Spontaneous native conversation averages 8.01 morae/s (SD 2.07). Read or careful speech averages 7.11 morae/s. Native broadcast is roughly 450 to 570 morae/min (about 7.5 to 9.5 morae/s). The rate non-native listeners perceive as close to ideal for "easy Japanese" is 320 to 360 morae/min (about 5.3 to 6.0 morae/s).²³ These are the anchors every level recommendation leans on.
Register, ranging from graded-learner through casual, formal or news, to slang and dialect.
Topic-vocabulary load, expressed as a JLPT-equivalent band.

A mora is the rhythmic beat of Japanese; か, ん, and the small っ each count as one mora, which is why morae per second, not words per minute, is the unit here.

The JLPT-audio caveat applies throughout. JLPT listening is deliberately slowed, over-articulated, and contraction-free. The slowing shrinks from N5 ("spoken slowly") to N1 ("natural speed") but is present below N1.⁴ No published morae/s figure exists for JLPT tracks, so none is given for them.⁴

A learner who shadows only JLPT-style audio trains JLPT ears, not native ears.

You may also shadow material one notch below your reading or comprehension level. Because production runs alongside perception, the shadowing-fit level sits below the passive-comprehension level. This is reasoned from the proficiency-dependence finding, not from a study that directly tested difficulty calibration.¹

How to use this map

Shadowing material vs. comprehension-listening material

The two libraries overlap, but the selection bar differs. Comprehension (passive) listening tolerates multi-speaker audio, no transcript, and material at or slightly above your ceiling. Shadowing is stricter: it needs the four fitness criteria and material a notch easier than your comprehension ceiling.¹⁹

The bar is stricter because shadowing is an online, high-cognitive-load task with no pauses. Passive comprehension, and even listen-pause-repeat, are gentler because a pause gives you time to process meaning.⁷ A clip that is fine to listen to passively can be too fast or too long to shadow.

In practice the same resource can appear in both a general listening map and this shadowing map for different jobs. A multi-speaker drama scene is good comprehension input at N2, but for shadowing you isolate a single short turn from it rather than shadowing the crosstalk.

The per-level table below is the heart of the map. Each row pairs a level with what to shadow, why it fits, whether a transcript is available, and the main watch-out.

Level	What to shadow	Why it fits	Transcript?	Watch-out
N5	Textbook audio (Genki, Minna no Nihongo); simplest NHK News Web Easy stories	Scripted, slow, single or clean two-speaker, fully transcribed; all four criteria at once	Yes (printed script; on-screen news text)	NHK Easy audio is synthesized; shadow for pace, not pitch
N4	Teppei Beginner and main Nihongo Con Teppei; YuYu Nihongo (on-screen-transcript version)	Single host, short episodes, clear delivery	Teppei: not confirmed; YuYu: only on the YouTube on-screen-text version	Use short isolated segments; lean transcript-critical work on Sakura Tips / NHK Easy
N3	Sakura Tips; calm single-speaker anime scenes	Scripted, short, single clear host with free JP+EN transcript (Sakura Tips)	Sakura Tips: yes; anime: n/a	Anime carries role language; isolate calm slice-of-life turns only
N2	Contemporary slice-of-life J-drama (single turns); NHK radio/web news	Real vocal-tract prosody (drama); clear formal announcer register (news)	Drama: inconsistent JP subs; news: yes (読むらじる。)	News is formal 書き言葉; does not transfer to casual speech
N1	Native vloggers and to-camera YouTube; short isolated variety turns	Self-recorded single-speaker native audio; full native ceiling	Vlogs: partial (on-screen text / JP auto-captions); variety: no	テロップ are not a transcript; variety sits above the JLPT ceiling

N5: textbook audio and simplified news

The N5 tier carries the strictest single-speaker and transcript demand of the whole map. As everywhere here, the pick is a recommendation, not a gate.⁴

Textbook audio (Genki, Minna no Nihongo)

Textbook audio is the ideal first shadowing source because it is scripted, slow, single-speaker (or clean two-speaker), and fully transcribed, with the dialogue printed in the book. It satisfies all four fitness criteria at once, which no native material does at N5.

Genki: An Integrated Course in Elementary Japanese (The Japan Times) includes accompanying audio for its scripted elementary dialogues and listening exercises.¹⁰ Minna no Nihongo (みんなの日本語, 3A Corporation) likewise provides scripted dialogue and listening audio keyed to its elementary lessons.¹¹

The durable, citable fact is that these series exist and pair printed (transcribed) scripts with recorded audio.¹⁰¹¹ The N5 fit is a calibrated recommendation. Elementary textbook audio is graded for beginners by design. No measured speech-rate figure is published for either series, so none is given.

The printed script is the built-in transcript. The scripted, slow, single or clean-speaker delivery covers short, clear, and single-speaker. Textbook dialogue is polite-register scripted speech, useful precisely because it is clean and checkable; it is not a model of fast casual speech, but at N5 that is the point.

Simplified news and beginner podcasts

NHK News Web Easy is the N5 simplified-news pick. It is a free NHK site that rewrites real news into the simplest Japanese, with furigana on every kanji, an on-demand Japanese hard-word gloss, and a 「ニュースを聞く」 read-aloud audio track.¹²¹³ Because the article text is on screen and the audio reads it, the transcript criterion is automatic. The delivery is slow and clear over short articles, and it is a single voice.¹³¹⁴

One caveat decides how you use it. The read-aloud is synthesized speech (合成音), meaning a computer-generated voice. It is generated automatically and read slowly and evenly with adjustable playback speed.¹⁴ That makes it good for segmentation and reading or articulation pace. But it is not a reliable model of natural prosody or pitch accent.¹⁴

Shadow NHK Easy audio for pace, not for melody

The NHK News Web Easy read-aloud is a synthesized voice, not a human one, read slowly and evenly.¹⁴ Use it to train segmentation and articulation pace, where it excels, and take your model of natural intonation and pitch from a human source instead. A synthesized voice cannot teach you pitch accent.

The manuscript is built for the 中級準備レベル (N3合格) learner drawing on roughly 1,600 words from the old JLPT 3・4級 range, so it sits in N4 to N3 territory. Because it rewrites authentic news, harder proper nouns and terms leak through, and difficulty varies story to story.¹⁴ For shadowing at N5, pick the simplest stories. For absolute-beginner podcasts, start with the recommended-podcasts list and Teppei Beginner, covered in the N4 section. Its single-host, short-episode format is what makes it shadow-able for a near-beginner.¹⁵¹⁶

N4: Nihongo Con Teppei and slow learner podcasts

The N4 tier favors single-host, slow-ish, short audio. Recommendation, not a gate.⁴

Nihongo Con Teppei and Teppei Beginner

Nihongo Con Teppei is a free, single-host Japanese podcast. One teacher, Teppei, talks in natural Japanese about one everyday topic per episode, with little or no background audio.¹⁵¹⁶ The genuine entry point is the spinoff "Japanese podcast for beginners (Nihongo con Teppei)," commonly called Teppei Beginner, calibrated around N5. The main "Nihongo Con Teppei" series sits one band higher at roughly N4 to N3.¹⁵¹⁷¹⁸¹⁶

Teppei Beginner episodes run roughly 3 to 5 minutes (Spotify shows about 3 minutes; Tofugu's beginner review reports about 4 minutes); main-series episodes run roughly 10 to 20 minutes.¹⁷¹⁶ Short beginner episodes satisfy the "short" criterion well. One voice with no co-presenter satisfies the single-speaker criterion that matters most at this level.¹⁵¹⁶

A common myth is worth correcting: Teppei does not slow his speech much. Comprehensibility comes from simple vocabulary, one topic, and repetition of the key word, all delivered clearly by a single speaker. Tofugu notes he "doesn't really slow his speech much," and "simply speaks clearly, only really pausing around the title or any other key phrases."¹⁶ No measured speech rate is published, so none is given. The N5 and N4 to N3 bands describe register and topic vocabulary, not official ratings.¹⁵¹⁸¹⁶

The transcript criterion needs an honest flag. The verified facts establish that the show is free across Apple, Spotify, and RSS. They do not establish a free full per-episode transcript for Teppei Beginner.¹⁵¹⁷¹⁶

So Teppei is a strong fit on short, clear, and single-speaker, but a weaker fit on transcript than transcript-bundled resources. For transcript-critical shadowing, lean on Sakura Tips and NHK News Web Easy instead.

YuYu Nihongo and other comprehensible-input shows

YuYu Nihongo (YUYUの日本語) is a comprehensible-input resource by Yuusuke Takemori (竹森悠介). It is delivered as both an audio podcast and YouTube uploads that show the Japanese transcript on screen; the register is casual single-host monologue.¹⁹²⁰²¹ The on-screen Japanese transcript is the key support: the YouTube version displays the spoken Japanese as readable text. That supplies the transcript criterion and makes it the gentler entry point. The audio-only feed strips that support and is harder.¹⁹²¹

YuYu trends harder than a pure N4 pick. A resource database describes the host as having "very clear audio, and an easy-to-understand way of speaking," but lists the show under intermediate Japanese. Episodes also run long, from about 10 minutes to a half-hour or more.²¹

As an N4 shadowing source, use the on-screen-transcript YouTube version on short isolated segments, not the long audio-only monologues. Treat YuYu as a level curve: gentler with the transcript, advanced without it. The delivery is well-enunciated but at or near native conversational tempo, with no measured rate published.¹⁹²¹

N3: Sakura Tips and clean anime clips

The N3 tier is the first native-adjacent band. Single-speaker audio is still preferred. Recommendation, not a gate.⁴

Sakura Tips: slow-but-natural single-host audio

Sakura Tips is a short, scripted Japanese podcast by a single host, Mari. Every episode comes with a free Japanese-and-English transcript posted on sakuratips.com.²²²³²⁴ Episodes run about 4 to 5 minutes each and are numbered in sequence.²³²⁴

This is the best fit on the whole map for the transcript criterion: scripted (so no filler), short, single clear-voiced host, and a free per-episode transcript. That satisfies all four fitness criteria in one resource, which is why it anchors N3.²²²³²⁴ Mari "speaks in a very clear voice, and slows down her speech, while not overdoing it so much to make it odd," striking "a nice balance between comprehensibility and naturalness."²⁴ No measured speech rate is published. Describe it qualitatively as slower than native conversation yet faster and less robotic than from-zero drilled audio.²⁴

The clear, slowed delivery feels beginner-friendly. The topic vocabulary (family, work, seasons, temples and religion, decluttering, Japan-culture asides) reaches the N4 to N3 band. That label is a J-Compass calibration against the JLPT N4 and N3 can-do descriptors, not an official level.²²²⁴⁴

Anime clips: pick calm, single-speaker scenes

Anime is multi-speaker, fast, and register-distorted, so it does not satisfy the criteria as a whole. The shadowing-fit move is to isolate calm, single-speaker scenes: monologue or narration rather than banter or crosstalk. This recovers the single-speaker and short criteria from an otherwise unfit source.

No corpus measures anime's own dialogue rate. But it is delivered by professional voice actors at or above natural broadcast pace, which sits in or above the 450 to 570 morae/min band. That is well above the slowed 320 to 360 morae/min "easy Japanese" target.³ Anime is fast; that is why you isolate short calm turns and why N3 is the floor for this pick.

The register trap matters most here. Register means the style of language you use in a particular situation. Anime carries 役割語 (role language), a stylized fictional speech tied to character types that is "usually partially or entirely distinct from the real-life language" of the people it depicts.²⁵²⁶ Markers like わし and 〜じゃ (old man), わたくし and 〜ですわ (refined lady), 拙者 and 〜でござる (samurai), and gendered sentence-enders appear much more often in fiction than in real speech.²⁵²⁶

Do not shadow these into your real-speech default. Pick slice-of-life or contemporary-setting scenes, which carry less role-marked speech because role language indexes the archetypes that period and fantasy genres foreground.²⁵

N2: J-drama and NHK news

The N2 tier tolerates multi-speaker audio and widens into formal register. Recommendation, not a gate.⁴

Japanese drama (dorama): realistic conversational rhythm

Live-action drama (実写ドラマ) is performed by real actors. Prosody, articulation, breath, and timing therefore come from a real vocal tract. Contemporary-setting dramas default to 標準語 (standard Japanese) rather than role language.²⁷²⁵²⁶ What transfers is realistic intonation and conversational rhythm. That is why live-action beats anime for transferable prosody.

Scripted drama is cleaner than spontaneous speech. CSJ (Corpus of Spontaneous Japanese) measured spontaneous speech at 8.01 morae/s (SD 2.07) against 7.11 morae/s for read or performed speech. So drama sits one rung below spontaneous speech and above textbook audio.²⁸ Native broadcast and drama pace (about 450 to 570 morae/min) is well above the learner-comfortable band (about 320 to 360 morae/min). That is why N3-plus is the floor and N2 the comfortable shadowing tier.³

Drama is multi-speaker, which N2 ears can tolerate. For shadowing, isolate short single turns and shadow those, not overlapping exchanges. The transcript dependency is real. Japanese subtitles, which shadowing needs, are inconsistent and region-dependent across platforms. English-only subs do not serve shadowing.²⁷

For calibrated picks, contemporary slice-of-life and everyday-speech titles are the shadowing-friendly band, such as 深夜食堂 (Midnight Diner, 2009, slice-of-life 標準語) and 逃げ恥 / 逃げるは恥だが役に立つ (TBS, 2016, everyday adult and workplace speech).²⁷²⁹ Treat these as illustrative difficulty anchors, not a streaming catalog. Avoid theatrical (半沢直樹) or genre-jargon (ドクターX, medical) titles, whose register and vocabulary do not generalize.

NHK Radio/Web News: formal-register shadowing

らじる★らじる (NHK ONE) streams NHK's hourly radio newscasts with a 同時配信 live simulcast and 聴き逃し catch-up, plus a 読むらじる。 read-along text companion. NHK News Web carries the same native register in text with attached audio or video.³⁰³¹ The read-along text is the transcript pairing that makes intensive shadowing practical.

The announcer reads clearly and to a standard. NHK's news-reading standard is about 300 字/分, with a former-announcer working range of 300 to 350 字/分. Converting on the roughly 1 字 ≈ 1 mora basis lands at about 5 to 6 morae/s. This is a calibration from a reading standard rather than a measured newscast rate.³² Clean, standard enunciation is precisely what makes news good shadowing practice once listening is solid.³³

News register does not transfer to casual speech

News audio is formal written-style (書き言葉), heavy with 漢語 (Sino-Japanese vocabulary) and reporting forms like 〜とみられます (reporting passive) and 〜ということです (hearsay). It does not transfer to casual conversation.³³ JLPT listening is also slower and cleaner than a real NHK newscast. Passing N3 listening does not predict you can shadow NHK news.⁴ Shadow it for formal register and articulation, then pair it with casual input.

One access note: らじる★らじる is 日本国内限定 (Japan-only, IP-enforced); from abroad, use NHK News Web, NHK WORLD-JAPAN audio, and NEWS WEB EASY instead.³⁰³¹

N1: variety shows and native vloggers

The N1 tier means full native speed, slang, and overlapping speech. It sits above the JLPT ceiling. Recommendation, not a gate.⁴

Variety shows: the unscripted native ceiling

バラエティ番組 (variety shows) stack every hard variable at once: spontaneous overlapping speech, rapid wordplay, dialect, on-screen テロップ, and dense cultural reference.³⁴ The genre as a whole fails the single-speaker and short criteria. At N1, the shadowing-fit move is to shadow short, isolated single turns, not crosstalk.

Spontaneous speech averages 8.01 morae/s and is far more variable than scripted speech (SD 2.07). In the fast tail, about 0.1% of utterances exceed 14.2 morae/s. That is the measured basis for the rapid-fire feel.²⁸³⁵ CSJ is academic and simulated-speech data. It is an anchor rather than a measurement of variety TV. The JLPT publishes no overlapping multi-speaker, dialect-dense listening section, so passing N1 does not predict variety comprehension; treat variety as above the test ceiling.⁴²⁸

On-screen テロップ are not a transcript. They are selective, stylized, affect-driven open captions pitched at native reading speed and often kanji-heavy. They emphasize and editorialize rather than transcribe, so they do not satisfy the transcript criterion.³⁴

Much TV comedy talent also arrives through the Osaka お笑い pipeline, so variety carries a baseline of 関西弁 (Kansai dialect, such as copula や and negative 〜へん) that standard-dialect study never teaches. One verified show wears it in its title, ガキの使いやあらへんで (NTV, 1989).³⁵³⁶ Recognize that dialect, but do not necessarily shadow it into standard output. For difficulty illustration only, 水曜日のダウンタウン (TBS, 2014) and ガキの使いやあらへんで (NTV, 1989) are real shows cited as anchors. They are not a viewing prescription.³⁷³⁶

Native vloggers and YouTube channels

A to-camera native vlog keeps the single-speaker support that variety removes. So self-recorded single-speaker native audio is the more shadow-able native tier.³⁸ Three picks span the range:

HikakinTV (Hikakin) offers broad daily vlogging at full conversational speed, with frequent on-screen Japanese text but no guaranteed native subtitles. The on-screen text therefore only partially serves the transcript criterion.³⁸
That Japanese Man Yuta runs subtitled street interviews. Interviewees speak at natural conversational speed, but the subtitles keep it followable. This makes it a transcript-supported bridge for isolating short native turns.³⁹
NHK きょうの料理, a long-running NHK cooking program (since 1957), has narrow, repeating cooking vocabulary and a calmer instructional pace. It is a gentler native on-ramp than vlogger channels and a durable public-broadcaster source.⁴⁰

Two flags close the tier. Entertainer and vlogger register (exaggerated reactions, slang, gendered or anime-adjacent phrasing) is authentic but does not transfer cleanly to formal or workplace Japanese. Shadow for the ear, and study register separately.³⁸ For native YouTube, turn on Japanese (not English) auto-captions where available. English subs convert shadowing-prep into reading and do not serve verification.³⁸³⁹

Good to know

The transcript is non-negotiable for shadowing

You shadow from sound with no text in the moment, but you cannot diagnose what you mis-shadowed afterward without a reference text. Kadota and Tamai's procedure builds on having a script available to synchronize against before it is removed. With no transcript at any stage, there is no audit trail for your errors.⁹⁷

Do not waive this criterion at any level. It is why transcript-bundled resources (Sakura Tips, NHK News Web Easy, 読むらじる。, textbook scripts) outrank transcript-poor ones for shadowing specifically.⁹¹³²³³⁰

Shadow one notch below your comprehension level

If the audio is above your level, you fall off the stream and produce noise rather than meaning. Shadowing's listening payoff concentrated in lower-proficiency learners working with level-appropriate material. An over-level text causes cognitive overload.¹ Because production runs alongside perception, the shadowing-fit level sits below the passive-comprehension level. "JLPT-correct but too fast" becomes parroting noise.

The "one notch below" heuristic is a pedagogical recommendation; the supporting datum is the proficiency-dependence finding, not a study that directly tested difficulty calibration.¹

Don't ship anime/variety register into real speech

Anime carries 役割語 (role language) "usually partially or entirely distinct from the real-life language" it depicts (わし and 〜じゃ, 拙者 and 〜でござる, gendered enders). Variety and vlogger speech carry slang and gendered or anime-adjacent phrasing.²⁵²⁶³⁴³⁸ A learner who shadows these as defaults imports a fictional or rough register into real speech.

The fix is to shadow these sources for the ear, then run a "would a real person say this to a real person in my situation?" filter before adopting any phrase. Shadow contemporary 標準語 (drama, news, vlogs) for production-safe register.²⁵²⁶

The JLPT-audio trap

JLPT listening is deliberately slowed, over-articulated, and contraction-free. The slowing shrinks from N5 to N1 but is present below N1.⁴ Shadowing trains the ear to whatever input it is fed. Practicing only slow, over-articulated test audio builds perception of slow, over-articulated speech, not native-rate connected speech.¹⁸ Real speech is much faster: spontaneous speech is about 8.0 morae/s, and broadcast is about 450 to 570 morae/min, compared with the learner-preferred 320 to 360 morae/min.²³

The fix is to keep native-rate material (drama, news, vlogs) in the shadowing rotation alongside any test-style audio. There is no single study pitting JLPT audio against native audio for shadowing; this is a reasoned implication of the mechanism.¹⁸ Put plainly: shadowing JLPT audio trains JLPT ears, not native ears.

References

Hamada, Yo. "Shadowing: Who Benefits and How? Uncovering a Booming EFL Teaching Technique for Listening Comprehension." Language Teaching Research, vol. 20, no. 1, 2016, pp. 35–52. https://journals.sagepub.com/doi/10.1177/1362168815597504 ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴
Maekawa, Kikuo. "Corpus of Spontaneous Japanese: Its Design and Evaluation." Proceedings of the ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition (SSPR 2003), pp. 7–12. National Institute for Japanese Language and Linguistics (NINJAL). https://www.isca-archive.org/sspr_2003/maekawa03_sspr.html ↩ ↩² ↩³ ↩⁴ ↩⁵
Prafiyanto, Hafiyan; Nose, Takashi; Chiba, Yuya; Ito, Akinori. "Analysis of preferred speaking rate and pause in spoken easy Japanese for non-native listeners." Acoustical Science and Technology, vol. 39, no. 2, 2018, pp. 92–99. The Acoustical Society of Japan. https://www.jstage.jst.go.jp/article/ast/39/2/39_E1731/_pdf/-char/en ↩ ↩² ↩³ ↩⁴ ↩⁵
日本語能力試験 (Japanese-Language Proficiency Test). "N1–N5: Summary of Linguistic Competence Required for Each Level." Official JLPT website, administered by the Japan Foundation and Japan Educational Exchanges and Services. https://www.jlpt.jp/e/about/levelsummary.html ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹²
Takeuchi, Hikaru, et al. "Effects of Training of Shadowing and Reading Aloud of Second Language on Working Memory and Neural Systems." Brain Imaging and Behavior, vol. 15, no. 3, 2021, pp. 1253–1269. https://doi.org/10.1007/s11682-020-00324-4 ↩ ↩²
Kadota, Shuhei. Shadowing as a Practice in Second Language Acquisition: Connecting Inputs and Outputs. Routledge (Routledge Research in Language Education), 2019. ISBN 9781138485501. https://www.routledge.com/Shadowing-as-a-Practice-in-Second-Language-Acquisition-Connecting-Inputs/Kadota/p/book/9781032092836 ↩
Hamada, Yo. "Shadowing for Language Teaching." Contact (TESL Ontario), August 2018, pp. 19–24. https://contact.teslontario.org/wp-content/uploads/2018/08/Hamada-Shadowing.pdf ↩ ↩² ↩³ ↩⁴
Hamada, Yo. Teaching EFL Learners Shadowing for Listening: Developing Learners' Bottom-up Skills. Routledge (Routledge Research in Language Education), 2016. ISBN 9781138935983. https://www.routledge.com/p/book/9780815360902 ↩ ↩² ↩³
Kadota, Shuhei, and Ken'ichi Tamai. English Shadowing. Cosmopier Publishing, 2004. ↩ ↩² ↩³ ↩⁴ ↩⁵
The Japan Times. Genki: An Integrated Course in Elementary Japanese (Banno, Eri, et al.), textbook series with accompanying scripted dialogue audio. The Japan Times Publishing. https://genki3.japantimes.co.jp/en/ ↩ ↩²
3A Corporation (スリーエーネットワーク). Minna no Nihongo (みんなの日本語) elementary course, with accompanying scripted dialogue and listening audio. https://www.3anet.co.jp/np/list.html?g=21 ↩ ↩²
NHK. 「NHKやさしいことばニュース」(NEWS WEB EASY). 公式サイト. https://news.web.nhk/news/easy/ ↩
NHK. 「NHKやさしいことばニュースについて」. NEWS WEB EASY 公式サイト（furigana toggle, hard-word gloss, 「ニュースを聞く」 read-aloud audio button）. https://news.web.nhk/news/easy/about/ ↩ ↩² ↩³
田中英輝（NHK放送技術研究所）.「『やさしい日本語』ニュース NEWS WEB EASY」. 文化庁平成26年度日本語教育大会配付資料, 2014年8月29日（synthesized 合成音 read-aloud; slow, even delivery; manuscript built for 中級準備レベル/N3合格 learners）. https://www.bunka.go.jp/seisaku/kokugo_nihongo/kyoiku/taikai/26/program/pdf/shiryo_05.pdf ↩ ↩² ↩³ ↩⁴ ↩⁵
Teppei. Japanese podcast for beginners (Nihongo con Teppei). Official site / show home. https://nihongoconteppei.com/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
Tofugu. Japanese Podcast for Beginners (Nihongo Con Teppei) Review. Tofugu Japanese Learning Resources Database (limitation: learning-blog review, tier-5; used for qualitative delivery notes only). https://www.tofugu.com/japanese-learning-resources-database/japanese-podcast-for-beginners-nihongo-con-teppei/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
Teppei. Japanese podcast for beginners (Nihongo con Teppei). Apple Podcasts show page (id1471500012). https://podcasts.apple.com/us/podcast/japanese-podcast-for-beginners-nihongo-con-teppei/id1471500012 ↩ ↩² ↩³
Teppei. Nihongo con Teppei (main / intermediate series). Apple Podcasts show page (id1353032168). https://podcasts.apple.com/us/podcast/nihongo-con-teppei/id1353032168 ↩ ↩²
YouTube. "YUYUの日本語Podcast" channel (@yuyunihongopodcast); on-video Japanese transcript text. https://www.youtube.com/@yuyunihongopodcast ↩ ↩² ↩³
Satori Reader. "Contributor: Yuyu." Profile page (host Yuusuke Takemori / 竹森悠介; casual first-person monologue). https://www.satorireader.com/contributors/yuyu ↩
Tofugu. "YUYUの日本語 Podcast Review" (Japanese Learning Resources Database; limitation: learning-blog review, tier-5; used for qualitative delivery notes only). https://www.tofugu.com/japanese-learning-resources-database/yuyu-no-nihongo-podcast/ ↩ ↩² ↩³ ↩⁴
SAKURA TIPS. Official homepage（host "Mari"; podcast plus free JP+EN transcripts on sakuratips.com）. https://sakuratips.com/ ↩ ↩² ↩³
SAKURA TIPS｜Listen to Japanese. Apple Podcasts directory listing（episodes typically 4–5 minutes; scripted; scripts in Japanese on the website）. https://podcasts.apple.com/us/podcast/sakura-tips-listen-to-japanese/id1536540690 ↩ ↩² ↩³ ↩⁴
Tofugu. "Sakura Tips Review," Japanese Learning Resources Database（Mari "speaks in a very clear voice, and slows down her speech, while not overdoing it so much to make it odd"; "strikes a nice balance between comprehensibility and naturalness"; scripted short episodes; JP+EN scripts; limitation: learning-blog review, tier-5; used for qualitative delivery notes only）. https://www.tofugu.com/japanese-learning-resources-database/sakura-tips/ ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
Kinsui, Satoshi (金水敏). 『ヴァーチャル日本語役割語の謎』 (Virtual Japanese: The Enigma of Role Language). 岩波書店 (Iwanami Shoten), 2003. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
国際交流基金 (The Japan Foundation). 「『役割語』とは何か」(連載「日本語・日本語教育を研究する」第41回), 著者金水敏. https://www.jpf.go.jp/j/project/japanese/teach/tsushin/research/201302.html ↩ ↩² ↩³ ↩⁴ ↩⁵
"Shin'ya Shokudō" (深夜食堂 / Midnight Diner). Wikipedia（live-action drama from 2009; food/slice-of-life; contemporary 標準語）. https://en.wikipedia.org/wiki/Shin%27ya_Shokud%C5%8D ↩ ↩² ↩³
Maekawa, Kikuo. "Corpus of Spontaneous Japanese." (CSJ: spontaneous speech averaged 8.01 morae/s, SD 2.07, against 7.11 morae/s for read speech.) https://www.isca-archive.org/sspr_2003/maekawa03_sspr.html ↩ ↩² ↩³
"逃げるは恥だが役に立つ" (Nigeru wa Haji da ga Yaku ni Tatsu). Wikipedia (Japanese)（TBS, 2016; romantic comedy; everyday adult/workplace speech）. https://ja.wikipedia.org/wiki/%E9%80%83%E3%81%92%E3%82%8B%E3%81%AF%E6%81%A5%E3%81%A0%E3%81%8C%E5%BD%B9%E3%81%AB%E7%AB%8B%E3%81%A4 ↩
NHK. 「NHK ONE らじる★らじるラジオ配信アプリ」公式アプリ説明 (App Store / Apple)（同時配信 live simulcast + 聴き逃し catch-up of hourly radio newscasts; 読むらじる。read-along text; 日本国内限定 IP-enforced）. https://apps.apple.com/jp/app/id473937342 ↩ ↩² ↩³
NHK. 「NHK ONE らじる★らじる」(ja.wikipedia, citing NHK service pages)（catch-up audio kept roughly one week; 読むらじる。companion text）. https://ja.wikipedia.org/wiki/NHK_ONE_らじる★らじる ↩ ↩²
矢野香. 『【NHK式＋心理学】一分で一生の信頼を勝ち取る法 NHK式7つのルール』. ダイヤモンド社, 2014（former-NHK-announcer reference: news-reading standard ~300 字/分, working range ~300–350 字/分）. https://diamond.jp/articles/-/56483 ↩
轟里香. 「ニュース番組で用いられる言語の変化について」. 『北陸大学紀要』第32号, 2008, pp. 121–133（radio news as fuller 書き言葉 register; reporting passive 〜とみられます, hearsay 〜ということです verbatim from NHK radio newscasts）. https://www.hokuriku-u.ac.jp/library/libraryDATA/kiyo32/kyou3.pdf ↩ ↩²
バラエティ番組. Wikipedia (Japanese)（comedian-driven multi-segment genre; talk + コント + games; お笑い芸人-centered; limitation: encyclopedia entry, tier-5; used for genre-description context only）. https://ja.wikipedia.org/wiki/バラエティ番組 ↩ ↩² ↩³
Tsutsumi, Hideo. "Conversation Analysis of Boke-tsukkomi Exchange in Japanese Comedy." New Voices in Japanese Studies, vol. 5, December 2011, pp. 147–173（boke-tsukkomi as fast adjacency pair; manzai's Osaka/Kansai roots）. https://newvoices.org.au/volume-5/conversation-analysis-of-boke-tsukkomi-exchange-in-japanese-comedy/ ↩ ↩²
ダウンタウンのガキの使いやあらへんで!. Wikipedia (Japanese)（NTV, first aired 4 October 1989; Kansai-ben in the title やあらへん = 標準語〜ではない／〜じゃない）. https://ja.wikipedia.org/wiki/ダウンタウンのガキの使いやあらへんで! ↩ ↩²
水曜日のダウンタウン. Wikipedia (Japanese)（TBS, first aired 23 April 2014; fast panel talk, comedian slang, dense in-references）. https://ja.wikipedia.org/wiki/水曜日のダウンタウン ↩
HikakinTV. Official YouTube channel（native vlogger; full conversational speed; frequent on-screen Japanese text; no guaranteed native subtitles）. https://www.youtube.com/channel/UCZf__ehlCEBPop-_sldpBUQ ↩ ↩² ↩³ ↩⁴ ↩⁵
That Japanese Man Yuta. Official YouTube channel（subtitled street interviews. Interviewees at natural conversational speed, subtitles keep it followable）. https://www.youtube.com/channel/UCn7LyBvG5LEBXK9I4W5dGdA ↩ ↩²
NHK きょうの料理 / みんなのきょうの料理. Official program & recipe site（long-running NHK cooking program since 1957; narrow repeating vocabulary; calmer native on-ramp）. https://www.kyounoryouri.jp/ ↩

Overview​

What makes good shadowing material​

The four fitness tests: short, clear, transcribed, single-speaker​

How "by level" is calibrated, not absolute​

How to use this map​

Shadowing material vs. comprehension-listening material​

N5: textbook audio and simplified news​

Textbook audio (Genki, Minna no Nihongo)​

Simplified news and beginner podcasts​

N4: Nihongo Con Teppei and slow learner podcasts​

Nihongo Con Teppei and Teppei Beginner​

YuYu Nihongo and other comprehensible-input shows​

N3: Sakura Tips and clean anime clips​

Sakura Tips: slow-but-natural single-host audio​

Anime clips: pick calm, single-speaker scenes​

N2: J-drama and NHK news​

Japanese drama (dorama): realistic conversational rhythm​

NHK Radio/Web News: formal-register shadowing​

N1: variety shows and native vloggers​

Variety shows: the unscripted native ceiling​

Native vloggers and YouTube channels​

Good to know​

The transcript is non-negotiable for shadowing​

Shadow one notch below your comprehension level​

Don't ship anime/variety register into real speech​

The JLPT-audio trap​

See also​

References​

Footnotes​