Skip to main content

The i+1 Principle for Reading Japanese

The i+1 principle for reading says you acquire Japanese fastest from text you almost understand: material one notch above your current level, where you follow the meaning but still meet a thin layer of new words and structures.1 This article defines i+1 input, shows how to find it at each Japanese-Language Proficiency Test (JLPT) level, and gives a clear procedure for when a text turns out to be far too hard.

Overview: what i+1 means and where it comes from

The principle comes from Stephen Krashen's Input Hypothesis, a theory of how people acquire a second language rather than consciously study it.1 Its core idea is brief, but it can reshape how you choose every text you read.

The "i" and the "+1": your level plus one step

In Krashen's notation, i is the acquirer's current level of competence, and i + 1 is the next level, the increment just beyond where the learner is now.1

The hypothesis holds that "a necessary (but not sufficient) condition to move from stage i to stage i + 1 is that the acquirer understand input that contains i + 1." Here, to understand means the acquirer is "focussed on the meaning and not the form of the message."1 Krashen states the mechanism plainly: "We acquire ... only when we understand language that contains structure that is 'a little beyond' where we are now."1

The "+1" is comprehension-driven, not a fixed word count. We can understand structures we have not yet acquired because, in Krashen's words, "we use more than our linguistic competence to help us understand. We also use context, our knowledge of the world, our extra-linguistic information."1

The notation: "i + 1" vs. "i+1"

Krashen's original text writes the notation with spaces, as "i + 1".1 The compressed form "i+1" is the common rendering and is used throughout this article; they mean the same thing.

Krashen's Input Hypothesis in one paragraph

The Input Hypothesis and the "i + 1" formulation are stated in Krashen's Principles and Practice in Second Language Acquisition (1982)1 and developed at book length in The Input Hypothesis (1985).2

In Krashen's own framing, "the input hypothesis relates to acquisition, not learning," and "we acquire by understanding language that contains structure a little beyond our current level of competence (i + 1). This is done with the help of context or extra-linguistic information."1

One nuance matters for choosing reading material. Input "must contain i + 1 to be useful ... but it need not contain only i + 1"; Krashen argues that "if the acquirer understands the input, and there is enough of it, i + 1 will automatically be provided."1 The practical takeaway is simple: read broadly at roughly the right level rather than hunting for the one perfect next word.

It is a hypothesis, and a contested one. SLA (second language acquisition) research argues that Krashen's constructs are hard to define and to falsify, and that input alone may be insufficient for acquisition. Treat i+1 as a widely applied heuristic, not settled fact.1

Why reading is the easiest place to hit i+1

Krashen emphasizes comprehensible input as the engine of acquisition, and he argues that "we acquire vocabulary ... best through comprehensible input."1 Reading is a prime delivery channel for it.

A solo reader controls pace, can reread, and can look up a word at will. Reading therefore lets a learner tune the size of the "+1 dose" more finely than real-time listening or conversation does.

This follows from Krashen's point that the acquirer uses context and extra-linguistic information to bridge the gap.1 Reading maximizes the time available to deploy that context, which is why it is the easiest channel in which to land near i+1 on purpose.

How comprehensible is "comprehensible"? The two thresholds

Two different numbers do two different jobs here, and competing explanations tend to blur them. Keeping them distinct is the key to using the principle well.

The acquisition sweet spot vs. the comprehension threshold

The first number is the unassisted-comprehension threshold. Hu and Nation (2000) manipulated a 633-word narrative to known-word coverages of 80%, 90%, 95%, and 100%. They concluded that learners need to know about 98% of the running words (roughly one unknown word in fifty) for adequate unassisted comprehension; at 95% coverage, most readers did not reach adequate comprehension.3

Nation (2006) extended this with corpus analysis, or large text-sample analysis, of authentic English. About 98% coverage of novels and newspapers requires roughly 8,000–9,000 word families (plus proper nouns), while about 95% coverage requires roughly 3,000–4,000 word families.4 These coverage thresholds for fluency translate into concrete vocabulary targets. Nation later summarizes 98% as the goal for unassisted reading and about 95% as the level at which reading is possible with some support or inferencing.5

The second number is the acquisition zone, and it is a band rather than a single magic figure. Productive "+1" reading happens where the learner knows most words but still meets new items they can absorb.

Read about 95% coverage as "followable, still learning" and about 98% as "fluent, unassisted"; this 95-to-98 comprehension band is the practical target.345 Avoid pinning a single precise "sweet spot" percentage as if it were measured; the band is derived from the coverage studies, not cited to one study claiming a clean number.

For Japanese specifically, Komori, Mikuni and Kondō (2004) tested 61 learners from China, Taiwan, and Korea. They found that a known-word rate (既知語率) of about 96% was needed for learners to answer roughly 80% or more of reading-comprehension questions correctly; known-word rate correlated strongly with comprehension.6

How to read the Japanese 96% figure

The Japanese ~96% figure is the threshold for roughly 80% comprehension accuracy, not a full-unassisted-reading threshold. It sits between the ~95% and ~98% English lines and should be read that way, not as a Japanese equivalent of Hu and Nation's 98%.6

The shape of the spectrum is easier to see laid out as a line from too-easy through the productive band to too-hard.

Why too-easy input fails too

Input "must contain i + 1 to be useful for language acquisition."1 If a text is 100% known (call it i+0), there is no new structure to acquire from it.

Krashen contrasts i+1 with input "at the student's current level," which on his account does not drive acquisition.1 Friction you can absorb is the goal, not zero friction.

Easy reading still has a job. Day and Bamford's extensive-reading program, a high-volume approach to reading easy material, targets text at or just below the reader's comfort level. It is read "for pleasure, information and general understanding rather than 100% comprehension."7 Pure review reading builds speed and habit, but it does not supply the +1 that new acquisition needs.

A practical self-check while reading

The coverage research translates into an on-page test. At about 98% coverage roughly one word in fifty is unknown; at about 95%, about one in twenty.34

In practice, roughly one unknown item every sentence or two keeps you in the followable-but-learning band. If you stop every few words, you are below about 95% coverage and past a productive +1.

今日きょうあたらしい言葉ことばひとつだけてきた。8
"Today only one new word came up."

Day and Bamford give the heuristic for the easy end: a reader should be able to read "with few or no obstacles to understanding" and look words up rarely. A learner who is "looking up many words" has picked a text that is too hard for extensive reading.97

This self-check is really a test of whether you can guess from context. Krashen says the acquirer bridges i+1 using "context ... knowledge of the world ... extra-linguistic information." That is exactly the inference behavior the check looks for.1

Finding i+1 by level

The two-threshold section gives the target; this section translates it into actual Japanese materials at each stage. For a fuller cross-skill map, see finding i+1 input at each Japanese level. The JLPT bands below are a starting estimate, not a guarantee that a given book lands at i+1 for a given reader.

N5-N4: graded readers and Tadoku

Graded readers are level-controlled books designed to sit near i+1 for early learners: vocabulary and grammar are restricted per level and expand gradually. The NPO Tadoku (多言語多読, multilingual extensive reading) Japanese series runs from a Pre-Starter level through Level 5, with the publisher stating "controlled vocabulary and grammar suitable to each level."10

The published Tadoku level data maps headword counts (unique dictionary headwords) and book lengths to the JLPT scale.10

Tadoku levelLabelStated headwordsWords per bookPublisher's JLPT note
Pre-Starterabsolute beginner~2000–200none; pictures + audio
Level 0Starter~350200–400starting beginners
Level 1Beginner~350400–1,500≈ current N5
Level 2Upper-Beginner~5001,500–3,000≈ current N4
Level 3Lower-Intermediate~8002,500–6,000≈ current N3
Level 4Intermediate~1,3005,000–15,000≈ current N3–N2
Level 5Upper-Intermediate or above~2,0008,000–25,000≈ current N2–N1
The Tadoku table uses the old four-tier JLPT numbers

The publisher's own level notes are written against the pre-2010 four-tier JLPT scale (their "Level 1 of JLPT" maps to the current N5, and so on up). The column above translates them to the current five-level N5–N1 system; the original page still prints the old numbers.10

Tadoku's four rules of extensive reading put i+1 into practice: start from an easy level; read without a dictionary; skip over parts you do not understand and keep reading; and if a book stops working, set it aside and pick another.11

Day and Bamford describe the matching teaching approach. Extensive reading uses material that is easy (at or just below the reader's level), read in quantity, for general understanding rather than 100% comprehension.97

N3: the bridge into lightly-edited native material

N3 is the transition from engineered to selected input. The learner starts choosing real-world material and tuning difficulty by choice of text rather than by graded level.

Useful categories at this stage, each a generic category rather than an endorsement, include simplified or furigana-rich news, children's and middle-grade fiction, and furigana manga. Furigana are small kana readings printed beside kanji. Tadoku Levels 3–4 explicitly target this band, with grammar described as less controlled.10

A learner crossing into native text rarely clears about 98% coverage yet, so N3 reading typically sits in the ~95%, followable-with-effort band.35 That is why support tools such as furigana and lookups matter most at this stage.

N2-N1: native material chosen for density

At N2–N1, the useful selector is topic familiarity and known-vocabulary overlap, not a level label. The same reader may find a familiar-topic essay near i+1 and an unfamiliar-domain text far harder, because background knowledge is part of comprehension.1

Coverage climbs slowly here. Reaching about 98% coverage of general native prose requires on the order of 8,000–9,000 word families in the English studies, and each additional 1% of coverage costs progressively more vocabulary because the remaining words are low-frequency.45

Japanese shows the same long-tail dynamic: a high-frequency core covers most tokens, then coverage gains slow, which is why N2 to N1 reading feels like a long plateau. In practice, light novels and topic-matched essays tend to sit closer to i+1 than dense nonfiction or specialist writing does, even for the same reader.

Using vocabulary coverage to pick a text

The level ladder is only a starting estimate; a page sample is the actual measurement. Sample a page (or a few hundred running words), count the unknowns, and estimate coverage. Deliberately raising your known-word base is what shifts more material into the i+1 band over time.

Accept the text if you clear roughly 95% (about one unknown in twenty or better); it is comfortable and unassisted near 98% (about one in fifty).345

Raise your "i" so more text becomes i+1

The coverage check measures a text against your known-word base. Raising that base is what shifts more material into the sweet spot, and J-Compass recommends Amenokori to raise it. Its FSRS-scheduled (spaced-repetition), level-mapped (N5–N1) vocabulary and grammar decks grow the known-word percentage that lands a text near i+1, so a book that reads as i+5 today can become i+1 once the base catches up.

When the material is i+5: what to do with text that is too hard

"i+5" is this article's own informal shorthand for "many steps beyond i," not a term Krashen uses. Krashen's contrast is between i+1 and input that is simply too far beyond the learner; the "+5" is illustrative.

How to tell i+5 from i+1

Three diagnostic signals, grounded in the coverage research and the extensive-reading literature, separate too-hard from productive.

The first is lookup rate. If you need to look up more than roughly every other word, coverage is well below about 95% and far past a productive +1. Day and Bamford treat heavy dictionary dependence as the marker that a text is unsuitable for extensive reading.397

The second is loss of thread. Losing the meaning within a paragraph and rereading without gain indicates comprehension has dropped below the followable band.3

The third is an affective signal, or an emotional warning sign. Krashen's Affective Filter Hypothesis holds that high anxiety or frustration raises a "filter" so that "even if they understand the message, the input will not reach ... the language acquisition device."1

Rising frustration actively blocks uptake

On Krashen's account, frustration from a too-hard text is not merely unpleasant. A raised affective filter blocks the input from reaching acquisition even where the message is understood, so pushing on through mounting frustration works against the goal.1

Productive +1 friction looks different: one unknown per sentence or two, the thread intact, and frustration low.13

The three responses: drop, scaffold, or shelve

When a text reads as i+5, three responses recover a productive level. The choice among them depends on how attached you are to that specific text.

Drop a level to genuine i+1: move to easier graded or selected material until coverage is back near 95% or better.39 This is the Tadoku rule to pick another, easier book when you cannot continue.11

Scaffold the same text so that effective comprehension rises into the workable band. Use furigana, a pop-up or browser dictionary, a parallel translation, or sentence mining. Nation frames about 95% as the level at which reading works with support, versus 98% unassisted, and scaffolding is what closes that gap.5

Shelve it and return after more input. This is the Tadoku rule to set a too-hard book aside and come back later.11 It also fits Krashen's claim that i+1 is "provided automatically" once enough comprehensible input accumulates: the text becomes i+1 later without being forced now.1

Why "pushing through" rarely works

Decoding word by word is not reading for meaning. Krashen's condition for acquisition is that the acquirer be "focussed on the meaning and not the form of the message," which collapses when nearly every word is unknown.1

Below about 95% coverage, comprehension is inadequate and there is little to "go for meaning" with. Guessing from context fails because too many of the surrounding words are also unknown.3

The affective filter compounds the problem: frustration from forcing a too-hard text raises the filter and further blocks uptake.1 The useful reframe is level-matching, not endurance.

Good to know

i+1 is a direction, not a dial you can read off a label

Treating a JLPT level or a single percentage as the whole answer is the most common misstep. The same N3 reader can find a familiar-topic essay at i+1 and an unfamiliar-domain essay several steps too hard, because comprehension draws on "context ... knowledge of the world ... extra-linguistic information," not vocabulary alone.1

The coverage thresholds describe a specific text for a specific reader, not a global level you can wear like a badge.34

The percentages are research estimates, not guarantees

Reading the 95% and 98% figures as a pass/fail gate overstates them. They come from coverage studies on limited populations and vary by text type and reader. Hu and Nation's 98% rests on a regression with 66 university students in New Zealand on a single manipulated narrative;3 Nation's coverage counts differ between novels and newspapers;45 and the Japanese ~96% figure is from 61 learners against a comprehension-accuracy criterion of about 80%.6 Treat all of them as calibration, not a guarantee.

The Input Hypothesis itself is debated. Krashen states it as a hypothesis,1 and critical SLA research argues that its core constructs are hard to define and to falsify and that input alone may not be sufficient. Present i+1 as a useful, widely applied heuristic rather than settled fact.

"Comprehensible input" is not "no effort"

Misreading "comprehensible" as "effortless" leads learners to expect reading to feel free. i+1 still requires active attention, rereading, and inference. Krashen's "understand" means focused on meaning,1 and Day and Bamford expect reading "for general understanding rather than 100% comprehension," which still involves working through unknowns.7 Comprehensible means followable, not free.

See also

References

Footnotes

  1. Krashen, Stephen D. Principles and Practice in Second Language Acquisition. Pergamon Press, 1982 (internet edition 2009). pp. 20–21 (Input Hypothesis, "i + 1"), pp. 30–32 (Affective Filter Hypothesis). https://www.sdkrashen.com/content/books/principles_and_practice.pdf. 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26

  2. Krashen, Stephen D. The Input Hypothesis: Issues and Implications. Longman, 1985. (Book-length statement of the Input Hypothesis and comprehensible input; primary source, no open full text.)

  3. Hu, Marcella, and I. S. P. Nation. "Unknown Vocabulary Density and Reading Comprehension." Reading in a Foreign Language, vol. 13, no. 1, 2000, pp. 403–430. https://nflrc.hawaii.edu/rfl/. 2 3 4 5 6 7 8 9 10 11 12

  4. Nation, I. S. P. "How Large a Vocabulary Is Needed for Reading and Listening?" The Canadian Modern Language Review, vol. 63, no. 1, 2006, pp. 59–82. https://www.lextutor.ca/cover/papers/nation_2006.pdf. 2 3 4 5 6 7

  5. Nation, I. S. P. Learning Vocabulary in Another Language. 2nd ed., Cambridge University Press, 2013. (Coverage thresholds and word-family counts.) 2 3 4 5 6 7

  6. 小森和子・三國純子・近藤安月子「文章理解を促進する語彙知識の量的側面:既知語率の閾値探索の試み」『日本語教育』第120号, 2004, pp. 83–92. (Komori Kazuko, Mikuni Junko, Kondō Atsuko. "Bunshō rikai o sokushin suru goi chishiki no ryōteki sokumen: kichigoritsu no ikichi tansaku no kokoromi" ["Quantitative Aspects of Vocabulary Knowledge that Promote Text Comprehension: An Attempt to Explore the Threshold of the Known-Word Rate"]. Nihongo Kyōiku [Journal of Japanese Language Teaching], no. 120, 2004.) https://cir.nii.ac.jp/crid/1050564287990194304. 2 3

  7. Day, Richard R., and Julian Bamford. "Top Ten Principles for Teaching Extensive Reading." Reading in a Foreign Language, vol. 14, no. 2, 2002, pp. 136–141. https://nflrc.hawaii.edu/rfl/item/61. 2 3 4 5

  8. Constructed minimal example by the author for illustration; not drawn from a corpus. (See Examples block in "A practical self-check while reading.")

  9. Day, Richard R., and Julian Bamford. Extensive Reading in the Second Language Classroom. Cambridge University Press, 1998. (Extensive reading; "i minus 1" easy-material principle.) 2 3 4

  10. NPO多言語多読 (NPO Tadoku). "About Levels" and "Our Graded Readers." にほんごたどく (Tadoku for Japanese). https://tadoku.org/japanese/en/levels-en/ and https://tadoku.org/japanese/en/graded-readers-en/. 2 3 4

  11. NPO多言語多読 (NPO Tadoku). "How to start tadoku" (the four rules of extensive reading). https://tadoku.org/japanese/en/what-is-tadoku-en/. 2 3