HPLT/madlad-400-1.0-fin_Latn-llama-2b-100bt
2B
•
Updated
Web as a corpus, Large Language Models, Machine Translation, Language Technologies, Natural Language Processing, Internet Archive, CommonCrawl
OpenLID-v3: Improving the Precision of Closely Related Language Identification -- An Experience Report
DHPLT: large-scale multilingual diachronic corpora and word representations for semantic change modelling