λ Laesi
Languages

60+ languages, a dozen families.

Laesi's language list is the whole point. We prioritize coverage in the families mass-market reader apps ignore — and cover the mainstream ones too. Click any language for reading samples, a suggested library, and exactly what kind of lemmatization it has today.

Full morphology Surface-form lookup Coming soon

Nordic & North Germanic

The living and medieval languages of Scandinavia, Iceland, and the Faroes.
Icelandic Surface
Íslenska · 350,000

Icelandic preserves Old Norse grammar more faithfully than any other living language. Laesi ships it today with Wiktionary lookups; BÍN-grade lemmatization is on the roadmap.

Wiktionary (BÍN planned) →
Faroese Full
Føroyskt · 70,000

Spoken by 70,000 people on 18 islands. No major reader app supports it. Laesi does — with a GiellaLT morphological analyzer and Wiktionary lookups.

GiellaLT →
Norwegian Bokmål Full
Bokmål · written standard · ~5M users

Bokmål is the written standard in about 85% of Norwegian books, newspapers, and media. Full spaCy lemmatization on day one.

spaCy →
Norwegian Nynorsk Full
Nynorsk · written standard · ~600k users

Built from Norway's rural dialects by Ivar Aasen in the 1850s. If you read Tarjei Vesaas or Jon Fosse, you want Nynorsk.

spaCy (Bokmål model) →
Danish Full
Dansk · 6 million

Danish grammar is simple; Danish pronunciation is famously not. Laesi focuses on what you study in text — vocabulary, idioms, phrasal verbs.

spaCy →
Swedish Full
Svenska · 10 million

Swedish has the largest literary output of the Nordic languages. Laesi handles its compound nouns, particle verbs, and two genders through spaCy.

spaCy →
Old Norse Full
Norrœnt mál · medieval language (9th–14th c.)

The medieval language of Iceland, Norway, and the Viking diaspora. Almost no reading app supports it. Laesi gives it a GiellaLT analyzer — with CLTK, a better fit for Old Norse, on the roadmap.

GiellaLT (CLTK planned) →
Elfdalian Surface
Övdalska · 3,000

A conservative North Germanic variety of Älvdalen, Sweden, that kept features lost everywhere else. Surface-form lookup today.

Wiktionary →

West Germanic

German and its relatives, from Standard German to the regional and minority varieties.
German Full
Deutsch · 95 million

Full spaCy lemmatization with case, gender, and separable-verb handling. The base model also powers Swiss German and Bavarian.

spaCy →
Dutch Full
Nederlands · 24 million

Full spaCy lemmatization. Handles Dutch compounds, separable verbs, and diminutives.

spaCy →
Swiss German Surface
Schwiizerdütsch · 5 million

Lemmas are stored as Standard German forms — there's no standard Swiss German orthography — so lemmatization is partial but useful.

spaCy (Standard German) →
Bavarian Surface
Boarisch · 12 million

Lemmas are stored as Standard German forms. Partial lemmatization, full reading and lookup support.

spaCy (Standard German) →
Low German Surface
Plattdüütsch · 2.5 million

German Low German (SASS orthography). Surface-form lookup against Wiktionary's Low German section.

Wiktionary →
Dutch Low Saxon Surface
Nedersaksisch · 1.5 million

Low German as written in the Netherlands (NSS). A separate language entry from German Low German.

Wiktionary →
Luxembourgish Surface
Lëtzebuergesch · 400,000

Surface-form lookup today. A national language with a small but growing written corpus.

Wiktionary →
West Frisian Surface
Frysk · 470,000

The closest living relative of English. Surface-form lookup against Wiktionary.

Wiktionary →
North Frisian Surface
Nordfriisk · 10,000

A cluster of endangered Frisian dialects on the German North Sea coast. Surface-form lookup.

Wiktionary →
Scots Surface
Scots · 1.5 million

The Germanic sister language of English, spoken in Lowland Scotland and Ulster. Surface-form lookup.

Wiktionary →

Slavic

East, West, and South Slavic — with full lemmatization for the major standards and script-aware handling.
Polish Full
Polski · 40 million

Full spaCy lemmatization across Polish's seven cases and complex consonant alternations.

spaCy →
Russian Full
Русский · 150 million

Full spaCy lemmatization with case, aspect, and Cyrillic handling.

spaCy →
Ukrainian Full
Українська · 40 million

Full spaCy lemmatization for Ukrainian's seven cases and verbal aspect.

spaCy →
Bulgarian Full
Български · 8 million

Full spaCy lemmatization. Bulgarian's postposed definite article handled correctly.

spaCy →
Slovenian Full
Slovenščina · 2.5 million

Full spaCy lemmatization, including the rare dual number.

spaCy →
Macedonian Full
Македонски · 1.6 million

Full spaCy lemmatization for the South Slavic language of North Macedonia.

spaCy →
Croatian Full
Hrvatski · 5 million

Full spaCy lemmatization. The Croatian model also backs Serbian (Latin).

spaCy →
Serbian (Latin) Full
Srpski · 9 million

Lemmatized via the Croatian model with a Cyrillic-leak guard. A dedicated Serbian model is planned.

spaCy (Croatian model) →
Serbian (Cyrillic) Surface
Српски · 9 million

A separate language entry from Serbian Latin. Surface-form lookup; CLASSLA lemmatization planned.

Wiktionary →
Czech Surface
Čeština · 10 million

Surface-form lookup today. A Stanza-based lemmatizer is planned.

Wiktionary →
Slovak Surface
Slovenčina · 5 million

Surface-form lookup today. A Stanza-based lemmatizer is planned.

Wiktionary →
Belarusian Surface
Беларуская · 5 million

Surface-form lookup today. A Stanza-based lemmatizer is planned.

Wiktionary →
Bosnian (Latin) Surface
Bosanski · 2.5 million

Surface-form lookup. A separate entry from Bosnian Cyrillic; CLASSLA lemmatization planned.

Wiktionary →
Bosnian (Cyrillic) Surface
Босански · 2.5 million

Surface-form lookup. A separate entry from Bosnian Latin.

Wiktionary →

Uralic & Finno-Ugric

Finnic, Sámi, and their relatives — agglutinative, morphology-heavy, and underserved.

Don't see your language?

Add it yourself — Laesi's custom-language tool handles minority dialects, conlangs, and niche historical varieties. Or email us: if there's a GiellaLT, CLTK, spaCy, or Stanza analyzer we can wire in, we'll prioritize it.