Skip to main content

The Immersion Method for Learning Japanese: AJATT, MIA, and Refold Explained

The immersion method that Japanese learners debate is a self-study model. It treats high-volume native-speaker material as the primary engine of acquisition, not as a textbook subject to be studied.12 It is the doctrine behind the AJATT, MIA, and Refold movements, and it descends directly from Stephen Krashen's theory of comprehensible input.31

Overview

This article is a hub. It defines immersion as a method, traces the AJATT to MIA to Refold lineage as history rather than advocacy, and ties the philosophy back to the second-language-acquisition research it grew out of.312

It also states plainly where immersion breaks down: where pure passive exposure underperforms, where the maximalist lifestyle becomes unsustainable, and where native Japanese before a foundation is opaque in a way European-language immersion is not. The four named methods each get a dedicated deep-dive elsewhere; here they appear only as a lineage.

What "Immersion" Means in Language Learning

Immersion as a method, not just "living in Japan"

In second-language acquisition, "immersion" most precisely means an instructional model in which the target language is the medium of input and instruction, not merely a subject of study.4 The Canadian French immersion programs Merrill Swain studied are the canonical academic referent for the term.4

Self-study learner communities use the word more narrowly. In the AJATT, MIA, and Refold tradition, "immersion" means high-volume consumption of native-speaker material (audio, video, and text) as the primary driver of acquisition, explicitly without relocation to the target-language country.12

Geographic relocation is not part of the method. AJATT's own self-description stresses reaching fluency "without having to travel to the country, take classes, use outdated and boring textbooks." It frames immersion as a media and lifestyle practice rather than a place.1

The word carries three different senses

In learner discourse, "immersion" can mean geographic immersion (living in Japan), passive or ambient immersion (target-language audio playing in the background), or active immersion (focused consumption with look-up and study). Communities in the AJATT lineage use the second and third senses; a lay reader usually means the first.52

Input over output, acquisition over study

The core immersion premise restates Krashen's position that comprehensible input is the crucial and necessary ingredient for language acquisition.36 It also adopts his distinction between acquisition (a subconscious process) and learning (conscious knowledge of rules).3

Krashen's acquisition-learning distinction holds that consciously "learned" grammar rules cannot become subconsciously "acquired" competence; in his model, only comprehensible input drives acquisition.3 This is the theoretical backbone of the input-first slogan immersion communities adopt.

This article will qualify that slogan rather than repeat it uncritically. The sections below set the counter-evidence (from Richard Schmidt and from Swain) directly against it.74

The Comprehensible-Input Foundation

How immersion theory descends from Krashen

Krashen formulated the input hypothesis in the late 1970s and gave it its standard book-length treatment in Principles and Practice in Second Language Acquisition (1982) and The Input Hypothesis (1985).36 The immersion movements inherit their central claim from these works.

The mechanism is the i+1 formula. Krashen holds that a learner moves from current competence i to the next level by understanding input that contains structures slightly beyond their level: "we acquire... only when we understand language that contains structure that is 'a little beyond' where we are now."3

The decisive word is "understand." Acquisition depends on input being comprehended, not merely heard. Comprehension, aided by context and extralinguistic cues, is what lets the unknown +1 structures be acquired.36

Krashen adds an affective filter. Anxiety, low motivation, or low self-confidence raise a barrier that blocks input from being acquired even when it is comprehensible. This is why immersion communities argue for enjoyable, self-selected native media.3

Where the theory and the practice diverge

Krashen's model assumes the input is comprehensible. For a true beginner, raw, unedited native Japanese aimed at native adults is almost entirely incomprehensible. Simply switching on native media does not meet the i+1 condition.32

Bridging that gap is the practical problem every immersion method had to solve: making native input comprehensible, or building a path toward it.32 The lineage's history is largely a history of different answers to it.

A standing critique of the input hypothesis

Krashen's input hypothesis has been criticized within the field as difficult or impossible to test. Because i and i+1 are not independently measurable, critics including McLaughlin and Gregg argue that the hypotheses are not falsifiable as stated.8

The strongest practical objection to a passive reading of immersion comes from Schmidt's noticing hypothesis. In its 1990 formulation, learners convert input into "intake" only for features they consciously notice, and in the original strong form "subliminal language learning is impossible."7

Schmidt later softened the claim. His 1994 revision allowed that more noticing yields more learning, without every feature requiring conscious notice. The durable takeaway still holds: attention to form matters, so passive ambient audio alone is a weak acquisition engine.7

That contrast is the load-bearing point of this article. The claim that pure passive immersion underperforms is grounded in Schmidt's hypothesis that intake requires attention, not merely asserted as community folklore.7

The Historical Movements: AJATT to MIA to Refold

The three named movements form a single line of descent. A single input-first doctrine was gradually systematized, then generalized and softened into a staged, multi-language program.192

Treat the dates that follow as historical, with one important asymmetry. The 2006 origin is attested on the AJATT site itself. The later 2018 and 2020 dates come from secondary reporting and are hedged accordingly.192 Founders' real-world identities are out of scope wherever they are not reliably attested.

AJATT: Khatzumoto's "All Japanese All The Time"

AJATT ("All Japanese All The Time") is a self-study method published from 2006 on the site alljapanesealltheti.me, by a creator using the pseudonym "Khatzumoto."1 No reliably citable legal name stands behind that pseudonym. This article therefore treats "Khatzumoto" as a handle and asserts no real surname.

AJATT's framing is total lifestyle saturation in Japanese. Its stated goal is reaching high functional ability through self-directed immersion "without having to travel to the country, take classes, use outdated and boring textbooks, or spend any money."1

The site also offers a founder narrative. The AJATT account states that the creator began Japanese in 2004 and reached high reading and working ability over roughly eighteen months, largely from outside Japan. This is the site's own self-report, presented as motivation rather than verified fact.1

AJATT's mechanics, such as its kanji-first approach and its no-English rule, belong to the dedicated AJATT deep-dive. Here it stands as the maximalist origin point of the lineage.

MIA: the Mass Immersion Approach

The Mass Immersion Approach (MIA) systematized AJATT-style immersion and is associated with the creator behind the "Matt vs Japan" channel. The method was promoted under the MIA brand at massimmersionapproach.com, which now redirects to mattvsjapan.com.9 This article treats "Matt vs Japan" as a public handle and asserts no legal surname behind it.

The MIA brand's exact launch year is reported only loosely across sources. It is commonly placed around 2018, with the brand, website, and community taking shape across the late 2010s; no single authoritative founding date is available from a primary source.9

MIA's contribution was structure. It kept AJATT's input-first core and added a more formal program: heavy native-input immersion, sentence-level spaced-repetition mining (building review cards from sentences met during immersion), and explicit pitch-accent and pronunciation training.9

The pronunciation emphasis is the MIA brand's signature, and its detailed mechanics belong to the dedicated MIA deep-dive. MIA is also generally described as a collaboration whose partnership later divided. Because secondary sources disagree on the people involved, this hub describes the split at the project level only.

Refold: the language-agnostic rebrand

MIA was rebranded and generalized as Refold (refold.la). It was repositioned as a language-agnostic method usable for any target language rather than Japanese specifically.2 This is commonly dated to 2020, but Refold's own pages do not state a founding year, so the date rests on secondary reporting rather than Refold's own copy.2

The sequencing changed. Refold organizes study into explicit numbered stages and "domains": Stage 1 covers the sounds, writing system, a basic vocabulary base, a skim of basic grammar, and the beginning of native-content consumption. Later stages widen into broad immersion.2

The "domains" idea is the distinctive Refold strategy. A learner narrows immersion to a manageable subset of the language, masters it, then expands, which keeps input near the i+1 edge as comprehension grows.2

The engine stayed the same. The input-first, immersion-as-driver core is unchanged from MIA and AJATT. Refold still treats massive comprehensible native input as the primary source of acquisition.2

Around the rebrand, the original partnership divided into two projects: Refold, the methodology platform, and Migaku, a separate tools and software project.2 Sources agree a split occurred but disagree on the individuals' names, so this article keeps the description at the project level.

What Works and What Doesn't

Active vs passive immersion

Passive immersion has a real but modest benefit, mostly for listening. The speech-perception literature shows that sustained exposure reshapes which sound contrasts a listener can discriminate. It also shows that listeners can extract statistical regularities, such as word boundaries, from a stream of speech.10

Read the passive-immersion evidence carefully

This evidence base is drawn primarily from infant first-language acquisition (Kuhl; Saffran and colleagues). It supports the perceptual-tuning mechanism, but it is not direct proof that passive native audio teaches adult second-language learners. It justifies a modest benefit, not a strong endorsement.10

Comprehension is a different matter. Because intake requires attention and noticing, ambient audio the learner is not attending to yields little new grammatical or lexical knowledge on its own. The real gains come from active immersion: focused attention, looking words up, and sentence mining.7

The split is clean enough to state as a rule. Passive immersion builds sound discrimination, which is real but limited; active immersion is where comprehension and vocabulary growth actually happen.710

The "all Japanese all the time" cost

The original AJATT framing is total saturation: all Japanese all the time. It presents that maximalism as the point rather than a drawback.1 For most learners with jobs, school, or family obligations, that level of saturation is demanding to sustain.

This is a trade-off to surface, not a verdict to hand down. There is no sourceable statistic for how many learners abandon the maximalist regimen. The cost is therefore best framed qualitatively: a high adherence burden and a sustainability problem, rather than an invented number.1

The lineage itself reflects the strain. Refold's staged model, with its "do what you enjoy and look things up when you feel like it" framing, is explicitly less all-or-nothing than AJATT's total-immersion stance.2 Later movements walked the maximalism back as they met real learners' constraints.

The Japanese-specific opacity problem

For an English speaker, raw native Japanese is opaque to beginners in a way native Spanish or French is not. Japanese uses three scripts at once: hiragana and katakana (the two syllabaries), plus kanji. The modern common-use kanji set is the 2,136-character Jōyō Kanji table, revised by the Japanese government in 2010.11

A learner cannot even decode written native material until a substantial script and kanji foundation is in place.11 There is also no inherited cognate vocabulary, or shared word stock, to lean on.

Japanese shares essentially no cognate vocabulary with English. Loanwords exist, but they are re-encoded phonologically and written in katakana, such as コンピューター for "computer." They therefore do not give the free lexical leg-up a European-language learner gets from cognates.11

The difficulty is measurable in training time. The U.S. Foreign Service Institute groups Japanese in its hardest tier for English speakers, estimating roughly 88 weeks (about 2,200 class hours) to reach ILR S-3/R-3, approximately CEFR B2/C1.12

The pedagogical consequence follows directly. Raw native Japanese fails Krashen's "comprehensible" precondition for far longer than native Spanish does for an English speaker. A deliberate foundation phase (kana, core grammar, and core vocabulary) is therefore near-mandatory before native input becomes comprehensible input.31112

Immersion and output

Input alone does not reliably build production. Swain's output hypothesis (1985), based on Canadian French immersion students, found that learners with years of rich comprehensible input still produced language well below their comprehension level.4

Swain argued that producing language pushes learners in ways comprehension does not. Output moves them from semantic processing (focusing on meaning) to syntactic processing (focusing on sentence structure). It forces them to notice gaps in their interlanguage and lets them test hypotheses about the language.4

The application to immersion is straightforward. A purely input-only regimen predictably underdevelops speaking, so an output component (speaking and writing practice) is needed to convert comprehension into production.4

Scaling Immersion by Level

Beginner: foundation first, then graded input

Because of the opacity problem, the defensible beginner sequence is foundation first. A learner builds kana, core grammar, and a core vocabulary base, efficiently using spaced repetition. Only then do they transition toward the most comprehensible native-adjacent input.2

This is the function Refold assigns to its Stage 1: sounds, writing system, a basic vocabulary base, a skim of grammar, and the start of content consumption.2 The foundation is the bridge to comprehensible input, not a detour around it.

At the bottom of the ladder, the i+1 target is graded or simplified input. Learner-aimed audio, graded readers, and slowed or curated native-adjacent media sit at the right level of difficulty. True native material is still above i+1 for a beginner.32

Intermediate and beyond: widening the native-input diet

The sourcing problem does not disappear past the foundation. It changes shape. Finding input at i+1 in Japanese remains harder than for major European languages, because there is no cognate scaffold and less learner-graded native content. Intermediate learners must therefore deliberately source comprehensible material.312

Refold's "domains" approach is a concrete way to manage this. Narrowing immersion to one show or genre, mastering its vocabulary and patterns, and then expanding keeps input near i+1 as comprehension grows.2

From there the move is to broaden. As each domain becomes comprehensible, the learner widens domains and registers (different genres, speakers, and formality levels). This continually chases the i+1 edge and is easier to sustain with a curated, rotating content library.32

Good to know

"Immersion" is overloaded

The single most common reader confusion is treating the three senses of "immersion" as one thing. The word can mean living in Japan, passive ambient target-language audio, or active study with lookups and sentence mining. A learner who does only the passive sense while expecting the results of the active sense will conclude that "immersion doesn't work."72

The senses are not interchangeable for a sourceable reason. Schmidt's noticing hypothesis predicts that unattended passive input yields little intake. A passive listening habit therefore cannot stand in for active study.7

Vendor-funnel awareness

Many of the highest-ranking "immersion guide" pages are owned by paid tools and also serve as product funnels. Vendor-published explainers tend to soft-pedal both the comprehensibility problem and the lifestyle cost.

The practical move is to separate the method, which is a public and sourceable idea, from any product that packages it. This is an editorial-honesty note rather than a linguistic claim. It does not require singling out any one competitor.

Why the movements keep rebranding

Read as a sequence, AJATT to MIA to Refold is a softening arc. Each successor walked back its predecessor's maximalism as it met real learners' constraints.192

AJATT's total "all Japanese all the time" lifestyle (around 2006) became MIA's structured program (around 2018). MIA then became Refold's staged, language-agnostic, "do what you enjoy" roadmap (around 2020).192 The 2006 date is attested on the AJATT site; the 2018 and 2020 dates are reported and hedged. The trajectory runs from one charismatic maximalist doctrine toward a gentler, more generalizable system.

See also

References

Footnotes

  1. AJATT (All Japanese All The Time), "About" page. https://alljapanesealltheti.me/about/index.html (primary self-description by the AJATT site; used for the founder's pseudonym, the 2006 framing, and the method's own claims, attributed as self-reported). 2 3 4 5 6 7 8 9 10 11 12 13 14

  2. Refold. "Roadmap" / method pages, including "2A: Domains." https://refold.la/roadmap/stage-2/a/domains/ and the Refold stage pages https://refold.la/simplified/stage-1/a/immersion/ (primary self-description of the Refold staged, language-agnostic method). 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

  3. Krashen, Stephen D. Principles and Practice in Second Language Acquisition. Pergamon Press, 1982. (Full text in the public domain at https://www.sdkrashen.com/content/books/principles_and_practice.pdf) Source of the input hypothesis, the i+1 formulation, the acquisition-learning distinction, and the affective filter hypothesis. 2 3 4 5 6 7 8 9 10 11 12 13 14 15

  4. Swain, Merrill. "Communicative Competence: Some Roles of Comprehensible Input and Comprehensible Output in its Development." In S. Gass & C. Madden (eds.), Input in Second Language Acquisition, Newbury House, 1985, pp. 235–253. The output (comprehensible output) hypothesis, based on Canadian French immersion data. 2 3 4 5 6

  5. Tatsumoto. "What's AJATT?" Tatsumoto Ren / Refold-adjacent community site. https://tatsumoto-ren.github.io/blog/whats-ajatt.html (community/partisan source; used only as a self-description of the AJATT doctrine, attributed as such).

  6. Krashen, Stephen D. The Input Hypothesis: Issues and Implications. Longman, 1985. Develops the input hypothesis and the comprehensible-input claim. 2 3

  7. Schmidt, Richard W. "The Role of Consciousness in Second Language Learning." Applied Linguistics, vol. 11, no. 2, 1990, pp. 129–158. The noticing hypothesis. 2 3 4 5 6 7 8

  8. Wikipedia contributors. "Input hypothesis." Wikipedia. https://en.wikipedia.org/wiki/Input_hypothesis (secondary summary; used only to corroborate the non-falsifiability critique attributed to McLaughlin and Gregg).

  9. massimmersionapproach.com, now redirecting to mattvsjapan.com (the Mass Immersion Approach / Matt vs Japan site; primary self-description of MIA and its association with the "Matt vs Japan" handle). 2 3 4 5 6 7

  10. Kuhl, Patricia K. "Early language acquisition: cracking the speech code." Nature Reviews Neuroscience, vol. 5, 2004, pp. 831–843; and Saffran, J. R., Aslin, R. N., & Newport, E. L. "Statistical Learning by 8-Month-Old Infants." Science, vol. 274, 1996, pp. 1926–1928. Evidence that exposure tunes phonetic discrimination and supports statistical segmentation (infant L1 evidence; cited for the perceptual-learning mechanism only, not as a direct claim about adult L2 immersion). 2 3

  11. 文化庁/文部科学省. 文化審議会答申「改定常用漢字表」, 平成22年(2010年)6月7日答申, 同年11月30日 平成22年内閣告示第2号「常用漢字表」. https://www.mext.go.jp/b_menu/shingi/chousa/shotou/076/shiryo/__icsFiles/afieldfile/2010/09/22/1297504_01.pdf (the 2,136-character revised Jōyō Kanji table). 2 3 4

  12. U.S. Department of State, Foreign Service Institute (FSI). Language difficulty / time-to-proficiency rankings, as compiled at https://www.fsi-language-courses.org/blog/fsi-language-difficulty/ (community compilation of the FSI list). The FSI groups Japanese among its hardest category and estimates 88 weeks / 2,200 class hours to reach ILR S-3/R-3 (≈ CEFR B2/C1). 2 3