In 2008, a team of linguists climbed steep hillsides in Arunachal Pradesh, India's remote northeastern state, going door-to-door among bamboo houses on stilts. They were looking for two languages — Aka and Miji — both poorly documented, both endangered. What they found instead was something that shouldn't have been possible: a third language, entirely unknown to science, spoken by people living alongside the communities they'd come to study.
The language was Koro. And its discovery wasn't just a scientific surprise. It was an indictment.
The Permit Zone That Became a Linguistic Black Hole
Arunachal Pradesh requires a special permit to enter. That bureaucratic fact has had profound consequences for linguistics. ScienceDaily reported that because of this access restriction, few linguists had worked in the region, and no one had drawn up a reliable list of languages spoken there — their locations, their structures, their speaker counts. The state was, in the words of the expedition team, a "black hole" on the linguistic map.
This matters because Arunachal Pradesh sits inside one of the densest linguistic zones on earth. Northeast India hosts over two hundred distinct native languages, according to documentation of the region's endangered language crisis, spanning Tibeto-Burman and Austroasiatic families, shaped by thousands of years of communities separated by thick jungle, mountain ranges, and roaring rivers. The region was always known to be linguistically extraordinary. What wasn't known was how much of it remained uncharted.
Koro belongs to the Tibeto-Burman family — a group of roughly 400 languages including Tibetan and Burmese. Approximately 150 Tibeto-Burman languages are spoken in India alone. And yet the expedition team, led by National Geographic Fellows Gregory Anderson and K. David Harrison, could find no language closely related to Koro. It was distinct enough from its neighbors that it registered as a separate branch, not a dialect, not a variant — a language, with its own grammar, its own vocabulary, its own way of organizing the world.
What "Discovery" Actually Means Here
The word discovery is doing complicated work in this story. Koro's speakers knew it existed. Their neighbors knew it existed. What didn't exist was any record of it in the scientific literature, any entry in standard language surveys, any documentation that would have allowed a linguist sitting in a university library to know to go looking.
This is the mechanism of linguistic invisibility: a language can be spoken by hundreds of people, embedded in a living community, and still be functionally absent from the systems that determine which languages get studied, protected, or even acknowledged. The expedition team hadn't targeted Koro — they stumbled into it while recording Aka and Miji vocabularies, gradually noticing that some speakers were producing sounds and structures that didn't fit either language. The discovery was, in Anderson's own framing, accidental.
A ResearchGate paper on Indian language extinction notes that the formal identification of Koro — sometimes called Koro-Aka in the literature — came in 2010, when Anderson and Harrison published their findings as part of the Enduring Voices Project. By then, the language was already critically endangered: roughly 800 speakers, few of them under twenty years old, nothing written down.
That timeline is worth sitting with. The language was identified and simultaneously recognized as nearly gone. There was no window of comfortable documentation. The discovery and the eulogy arrived together.
The Documentation Race That Scholarship Is Losing
What Koro reveals about the broader situation in underdocumented language families is uncomfortable. A recent survey of the world's least understood language groupings makes the structural problem explicit: for many families, the current pace of documentation work is slower than the pace of language loss. The bottleneck isn't just access or funding — it's that the academic infrastructure required to do comparative work depends on individual languages being documented first, and that documentation depends on fieldwork that often can't happen fast enough.
Koro is a case study in what falls through that gap. It survived long enough to be found. The question the field can't answer with confidence is how many languages like it didn't — languages that slipped out of living memory before anyone with a recorder and a notebook made it through the permit process.
K. David Harrison, writing about the discovery, framed Koro's contribution as far exceeding its statistical weight among the world's languages. "Koro brings an entirely different perspective, history, mythology, technology and grammar to what was known before." That framing — language as complete knowledge system, not just communication tool — is the one that should haunt us. Every undocumented language is a hypothesis about how reality can be organized. Koro survived to be documented. The ones we haven't found yet are disappearing as hypotheses we never knew to test.
The Central Institute of Indian Languages and various NGOs are now working to document and revitalize endangered languages across Northeast India. Whether that effort reaches the region's remaining linguistic unknowns before they vanish is the question worth watching — not in the abstract, but in the specific: which communities in Arunachal Pradesh still haven't had a linguist knock on the door.
