Change the repository type filter
All
Repositories list
31 repositories
GlotLID
PublicLanguage Identification with Support for More Than 2000 Labels -- EMNLP 2023GlotWeb
PublicGlotWeb: Web Indexing for Low-Resource Languages -- under construction.GlotCC
PublicGlotCC Dataset and Pipline -- NeurIPS 2024oscar-io
Publicungoliant
Publicoscar-tools
Publiccisnlp.github.io
PublicMEXA
PublicMultilingual Evaluation of English-Centric LLMs via Cross-Lingual AlignmentLangSAMP
PublicLangSAMP: Language-Script Aware Multilingual Pretraininganalogical_reasoning
PublicTransliteration-PPA
PublicBreaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignmentlohoravens-webpage
PublicMaskLID
PublicMaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024GlotScript
PublicResource and Tool for Writing System Identification -- LREC 2024Taxi1500
PublicTransMI
PublicTransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated DataTransliCo
PublicTransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language ModelsSpatial_Schemas
PublicXAMPLER
PublicGlot500
PublicGlot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023GlotSparse
PublicGlotSparse: Building Corpora in Under-Resourced LanguagesGlotStoryBook
PublicChildren StoryBooks for 180 langauges.mPLM-Sim
PublicColexificationNet
PublicCrosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphsofa
Publicsimalign
Publicparcoure
Publicgraph-align
Publicbias-in-nlp
PublicLiterature overview: gender bias in natural language processing