A Multilingual Dataset and Model for Information Extraction from News Web Pages
ISPRAS Crawlers
community
AI & ML interests
Web scraping, web data extraction, information extraction
Organization Card
Research group at the Institute for System Programming of the Russian Academy of Sciences focused on web data collection.
models 5
ispras-crawlers/newsxlm-domlm-ae
Token Classification • 0.3B • Updated
• 22
ispras-crawlers/newsxlm-markuplm-en-ae
Token Classification • 0.1B • Updated
ispras-crawlers/newsxlm-xlmroberta-ae
Token Classification • 0.3B • Updated
ispras-crawlers/newsxlm-markuplm-ae
Token Classification • 0.1B • Updated
• 1
ispras-crawlers/newsxlm-domlm-pretrained
0.3B • Updated
• 1