Onomatopoeia in Manga and Anime: ドカン, バーン, シーン
In manga and anime, onomatopoeia is the dense layer of drawn sound words that fill a panel: the explosive ドカン, the loud バーン, and the famous シーン that paints silence onto a soundless page.1 If you can already sound out katakana, learning to decode these words is the bridge between dictionary-clean study and native Japanese media.
Overview
A manga panel cannot play audio. Where a film carries an explosion or a heartbeat on its soundtrack, the page has to draw that sound. That is why manga relies so heavily on onomatopoeia as lettering inside the art.1 These are the big, stylized katakana that crowd action scenes and the small, thin ones that mark a hush.
This article is a capstone for the onomatopoeia subcategory. The full four-class system (擬声語, 擬音語, 擬態語, and manner words) is taught in the hub article, so those class names appear here only in passing. The focus here is how sound words behave in manga and anime: as drawn art, as the silence paradox, as impact and motion SFX, and as a translation problem.
Why manga overflows with onomatopoeia
Japanese has a very large inventory of conventional mimetic expressions, far more than English, which is part of why manga can lean on them so freely.2 Most of what fills a panel as a sound effect imitates an actual noise: the bang of an impact, a footstep, a heartbeat.3
A smaller but striking group describes a soundless state yet is printed as if it were a noise. The headline case is シーン for silence.31 Manga reaches for so many of these words for a structural reason.
Because manga cannot actually play sounds, drawn onomatopoeia carries what a film would carry on its soundtrack. The word シーン, the "sound" of silence, became necessary precisely to emphasize the complete silence of particular scenes.1
The exact size of the inventory is best stated qualitatively. One practical measure is a community sound-effect (SFX) database that catalogues well over a thousand entries, with 1,329 tied to specific manga examples.2
SFX as drawn lettering, not captions (書き文字 / 漫符)
In manga, onomatopoeia appears as hand-drawn lettering inside the panel, not as set type outside the art.1 This 書き文字 (drawn lettering) is central to creating a vivid panel.1
The lettering is a design element. Artists and editors choose font and size for nuance, drawing loud sounds and intense emotion large and bold while quiet scenes use small, thin characters.14
The precise scholarly term for this is 音喩 (on'yu), which means onomatopoeia depicted as written lettering in manga.5 Manga critic Natsume Fusanosuke popularized the term. His essay analyzes how hand-drawn onomatopoeia conveys information about time and space as part of the narrative flow, a feature he argues is characteristic of Japanese comics.6
One neighboring term is easy to confuse with this, so it is worth keeping straight.
The drawn sound lettering is 音喩 / 書き文字. By contrast, 漫符 (manpu) names a different convention: non-lettering emotion symbols unique to manga, such as sweat drops, the cross-shaped anger mark, and popping veins. These are signs that visualize emotions and sensations.7 The 漫符 convention does not cover sound-effect lettering.7
How manga SFX are written and why they look that way
Katakana vs hiragana: sharp impact vs soft texture
Manga SFX are usually written in katakana, the angular syllabary, which makes them stand out visually from the rounded hiragana of dialogue.3 The script choice also carries a feel.
Sound effects written in hiragana feel softer, while sound effects written in katakana tend to feel sharper.8 Katakana reads as loud, hard, or mechanical. Hiragana reads as gentle or organic, even when the phonetic value is identical.
The clearest illustration is a heartbeat. ドキドキ in katakana reads as a louder, more agitated heart than どきどき in hiragana, though the two are the same word. ドキドキ is also one of the few panel-SFX words common enough in ordinary prose to appear clearly in a corpus.
心臓がドキドキした。9
"My heart was pounding."
This sharp-versus-soft contrast is the practical payoff of the katakana-for-emphasis and katakana-versus-hiragana rules taught in the writing-systems articles. Here, it is simply a feel choice an artist makes for each sound.
Vowel lengthening, small ッ, and the chōonpu ー
Manga SFX are often stretched and clipped on the page, so they may not match any dictionary headword. Four changes do most of the work.
A lengthened vowel, written with the chōonpu ー, drags the sound out. It marks the sound as larger, more intense, or longer-lasting.8 So バーン is a bigger, more sustained バン, and シーン is silence held.
A small ッ (the sokuon) cuts off the airflow. This adds emphasis in a shorter, sharper, more staccato way.8 So ゾクッ is a single sharp shiver against the drawn-out ゾクゾク.
Voicing matters too: a voiceless consonant sounds lighter and softer, while a voiced one sounds heavier and louder.8 ドゴォ reads heavier than an unvoiced alternative, and バキ heavier than its base. An artist may also write a single beat instead of the standard reduplicated form. That makes the action read as one momentary event.8
The chōonpu ー taught in the long-vowels-in-katakana article does concrete work here: it is the visual lever for "hold this sound." Reading バーン as a longer バン and シーン as held silence uses the same mechanic in drawn lettering.8
Reading SFX that bend or wrap around the art
SFX lettering is composed as part of the illustration, positioned within the panel rather than apart from it.18 Because it is integrated into the art and styled freely, beginning readers often fail to notice it as text at all and read it as background pattern.
The practical consequence is that an SFX can be oriented or styled in ways that hide it from a reader expecting horizontal type. Part of learning to read manga is learning to see the lettering as language in the first place.
The core SFX families you will meet
Most of these words live only as drawn panel lettering, so running-text corpus sentences are naturally scarce. Where a verified corpus sentence exists, the entry below includes a two-line example block. Where the word is essentially panel-only, it appears as a sourced gloss in a table. The class names and formal definitions route to the hub.
Impact and explosion: ドカン, バーン, ドゴォ, ドン, バキ
These are the main action sound effects: the noises of hits, blasts, and breaks. They form a family that scales from a single heavy thud up to a sustained explosion.
| SFX | Gloss | Typical panel |
|---|---|---|
| ドン | a heavy thud or boom, the foundational impact sound3 | a forceful landing, a dramatic reveal |
| ドカン | a large explosive blast or heavy impact10 | an explosion, a big collision |
| ドゴォ | a heavy, weighty impact; voiced ゴ and small ォ make it read heavier and more forceful than a plain ドカ8 | a crushing blow |
| バーン | a loud bang or blast; the ー marks it as a bigger, more sustained バン8 | a gun, a door flung open, a burst |
| バキ | a hard crack or snap of something breaking; voiced and sharp83 | a bone, a branch, armor giving way |
The nuance between near-twins comes from script and lengthening rather than separate vocabulary: ドン (single heavy thud) intensifies to ドカン (explosive burst), and バン (sharp bang) lengthens to バーン (a bigger, held bang).8
Among these, only ドカン is attested in running text here. The corpus shows it for dynamite going off.
ダイナマイトがドカンと爆発した。10
"The dynamite went off with a bang."
Motion and action: ダッ, ザッ, ヒュン, ゴゴゴ
This family marks movement and manner of motion. All four are panel-lettering SFX with no clean running-text attestation, so they appear as sourced glosses here.
| SFX | Gloss | Typical panel |
|---|---|---|
| ダッ | a sudden dash; the small ッ clips it to a single sharp burst of motion8 | a character bolting off |
| ザッ | a single crunch or swish of footsteps, gravel, or a brisk movement; clipped by the small ッ8 | a sharp turn, boots on gravel |
| ヒュン | a quick whoosh, something slicing through the air3 | a thrown object, a fast swing |
| ゴゴゴ | a low, ominous rumble; a sustained reduplicated low sound3 | mounting tension or dread |
The manner-of-motion words themselves belong to the Movement Onomatopoeia article, which covers them in depth. Here, they are just the panel labels.
The sound of silence: シーン (and the gitaigo-as-SFX paradox)
シーン (also written しーん, し〜ん, or しんと) is printed in a panel as if it were a sound, yet it names a state in which no sound is made. The dictionary defines しいん as a word for the state of not a single sound being heard: a hushed, quiet state. Its example is a temple hall that has fallen completely silent.11
Because it describes a soundless state rather than a noise, it works as a state-describing mimetic placed where a noise word would normally go. The formal class label routes to the hub. Manga adopted it precisely because the page has no soundtrack, so silence had to be made visible.1
Tezuka Osamu said he was the one who started writing シーン in silent scenes. He put it as "writing シーン in scenes where not a single sound is made is, to come clean, something I myself started," and dated the first use to his work 新世界ルルー (Shin Sekai Rurū, 1951–1952).12 This is his own reported testimony, not an independently verified origin. Other authorities discuss シーン and Tezuka's role in manga's verbal play without confirming that he coined the word specifically.56
The hiragana しんと and しいん are the dictionary headwords for running prose. The katakana シーン with the chōonpu is the panel styling of the same word.11 For English readers, it can be glossed as very loud silence or dead silence.13
部屋はシーンとしていた。14
"There was quiet in the room."
会場内は、シーンと静まり返っていました。15
"The entire venue was silent."
その知らせを聞いて、皆シーンとしてしまった。16
"A blanket of silence fell over everyone when they heard the news."
辺りはしんとしていた。17
"It was quiet all around."
Atmosphere and emotion in the panel: ドキドキ, ゾクッ, ニヤッ
This family carries the emotional weather of a scene. ドキドキ, the pounding heart of nervousness or excitement, is one of the few with abundant running-text attestation. It appeared earlier in the katakana-versus-hiragana section.9 The other two are panel-only glosses.
| SFX | Gloss | Typical panel |
|---|---|---|
| ゾクッ | a single sharp shiver of chill, dread, or thrill down the spine; the small ッ clips ゾクゾク to one beat8 | a sudden scare, a thrill |
| ニヤッ | a sudden smirk or sly grin spreading; a facial expression drawn as if it had a sound3 | a scheming or knowing look |
The feeling-word inventory itself belongs to the Emotion Onomatopoeia article. This section names the panel use and points there rather than re-teaching the words.
From manga page to anime soundtrack
Voiced and spoken onomatopoeia in anime
When manga is animated, the words split by what they name. Sound words that name an actual noise, such as an explosion or heartbeat foley, tend to become real sound on the soundtrack.1 State-describing words that name a soundless condition cannot be played. Instead, they surface as dialogue, with characters literally saying the mimetic word aloud.1
This is why anime characters can be heard saying words like ドキドキ (heart-pounding nervousness) and ニコニコ (beaming, smiling), which describe states rather than noises. Spoken aloud, they read as playful narration of a character's inner state.43
トムはいつもニコニコしてるね。18
"Tom is always smiling."
The mechanic of turning a mimetic word into a verb or adverb belongs in the Onomatopoeia + する article. Here it is enough to note that the panel word and the spoken word are the same word.
Reading translated manga: why the SFX look different (or stay Japanese)
Four ways translators handle SFX: leave, transliterate, gloss, redraw
Localizers, whether official publishers or fan scanlation teams, handle each drawn SFX in one of four ways. The choice depends on how deeply the lettering is fused into the art and how much the reader needs the meaning spelled out.
Leaving the SFX untranslated means keeping it on the page as it is. This is often chosen because the lettering is beautifully integrated into the artwork and removing it would damage the panel.193 Transliteration respells the Japanese script in Roman letters, so ゴクン is written as gokun.19
Glossing places a small translation beside the original, either describing the meaning or substituting the closest English onomatopoeia.19 Redrawing replaces the Japanese lettering with English drawn back into the art. It is the most labor-intensive option precisely because the lettering is fused into the illustration. A common compromise leaves the original and adds small English text beside it.193
Some licensed publishers also include an index at the back of the book listing sound-effect translations.19 For decoding on your own, a reading-indexed SFX database exists for exactly this problem. It is a free online dictionary for Japanese-to-English SFX, searchable by the first katakana character, with well over a thousand entries and 1,329 manga-referenced examples.2
The same forceful-door and banging SFX show how a word can be panel-only in its sharpest forms but still appear in ordinary prose when it also works as everyday vocabulary.
バタンとドアを閉めた。20
"I slammed the door shut."
玄関のドアがバタンと閉まるのが聞こえた。21
"I heard the front door slam shut."
誰かが壁をドンドンと叩く音が聞こえたような気がした。22
"I thought I heard someone banging on the wall."
Good to know
Looking up the panel form instead of the dictionary form
Panel SFX are clipped, stretched, voiced, and reduplicated in ways that may not match a textbook entry. The lengthened vowel, the small ッ, voicing, and the single-beat trick all push the lettering away from its headword.8 This is a documented deciphering step in real translation practice, not a learner superstition.8
Normalize the form before searching: strip the ー, restore the reduplication, and drop the small ッ. Better still, search a reading-indexed SFX database that already indexes the surface forms.2
SFX are an immersion skill, not exam vocabulary
Panel SFX such as ドカン, ドゴォ, バーン, and シーン-as-lettering are not JLPT testing targets and are not leveled on the exam, even though they appear everywhere in native media. The N3+ banding for this topic reflects audience readiness, fluent katakana and prior exposure to onomatopoeia mechanics. It does not reflect exam content scope.
Set the expectation accordingly: reading SFX is a decoding skill for immersion. It is separate from grammar study and best built by reading native material.
シーン: a "sound" drawn in for no sound
The retention hook for シーン is the paradox itself. It names a soundless state, defined in the dictionary as not a single sound being heard. Yet it is printed like a sound effect because the silent page needed silence made visible.111
Tezuka Osamu said he was the one who started writing シーン in silent scenes, in 新世界ルルー (1951–1952).12 Treat that line as a reported claim rather than a proven origin. It still works well as a single memorable anchor for the whole convention.
Where to look an SFX up
Two practical lookup paths cover most cases. Reading-indexed SFX dictionaries let you search by the first katakana character of the effect.2 Licensed releases sometimes include a back-of-volume index that lists the sound-effect translations for that book.19
Not every SFX is in any dictionary
Mangaka sometimes invent their own written forms, and for those you may never find a resource that explains what the sound effect means.8 When both normalization and database lookup fail, read the SFX from panel context: its font, its size, and what is happening in the scene. This is the same way native readers infer an invented sound.1
See also
- Texture and Appearance Onomatopoeia: フワフワ, ザラザラ, ベタベタ
- Mokuro: OCR for Japanese Manga
- Learning Japanese From Anime: The Honest Guide