Download NT2Lex
Feel free to download and use NT2Lex for your own research or teaching.
If you are using NT2Lex, please cite:
Tack, A., François, T., Desmet, P. & Fairon, C. (2018). NT2Lex: A CEFR-Graded Lexical Resource for Dutch as a Foreign Language Linked to Open Dutch WordNet. In Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications (pp. 137-146).
NT2Lex with Frog - CGN parts of speech
Dutch in receptive context · CEFR levels: A1 A2 B1 B2 C1
Basic version w/ canonical word forms and parts of speech only
We recommend to use this version for those interested in a general complexity analysis. Moreover, since only this version is compatible with all other CEFRLex resources (for English, French, Swedish, ...), it is more suited for a cross-lingual analysis.
receptive lexicon
includes word frequencies observed in textbook reading activities and simplified readers
CEFR levels
A1 · A2 · B1 · B2 · C1
lexical entries
lemma (simple/multi-word) · part of speech
15,227 items
labels
CGN tagset (Van Eynde, 2004) · simplified (cgn_to_nt2lex.yaml)
processing pipeline
Frog Tagger (Van den Bosch et al., 2007)
license
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
NT2Lex with Frog - CGN+ODWN parts of speech
Dutch in receptive context · CEFR levels: A1 A2 B1 B2 C1
Extended version w/ added word senses
We recommend to use this version for a more full-fledged and fine-grained analysis. However, since this version currently the only lexical resource including word-sense disambiguated entries, it is less compatible with the other CEFRLex resources.
receptive lexicon
includes word frequencies observed in textbook reading activities and simplified readers
CEFR levels
A1 · A2 · B1 · B2 · C1
lexical entries
lemma (simple/multi-word) · part of speech · sense number · synset
17,743 items
labels
CGN tagset (Van Eynde, 2004) · simplified (cgn_to_nt2lex.yaml)
Open Dutch WordNet (Postma et al., 2016)
processing pipeline
Frog Tagger (Van den Bosch et al., 2007)
DutchSemCor WSD (SVM)
license
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License