Wals Roberta Sets 37-70.zip May 2026
: Obligatory possessive inflection (58A) and possessive classification (59A).
: Testing if models like RoBERTa or XLM-RoBERTa have "learned" the typological rules of specific languages during pre-training. WALS roberta sets 37-70.zip
: Position of tense-aspect affixes (69A) and the morphological imperative (70A). Use Cases for the Dataset WALS roberta sets 37-70.zip
World languages with features and coordinates - Dataset Search WALS roberta sets 37-70.zip
For more information on the specific data points, you can explore the Official WALS Features List or the WALS-Bench dataset on Hugging Face.
: Using the WALS database features as labels to see if a model's internal representations (embeddings) cluster according to known linguistic traits, such as whether a language uses definite articles.