Miscellaneous Text Datasets for Language Models izumi-lab/oscar2301-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 31.4M • 377 • 6 izumi-lab/mc4-ja Viewer • Updated Jul 29, 2023 • 87.4M • 629 • 6 izumi-lab/mc4-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 52.6M • 840 • 5 izumi-lab/wikinews-ja-20230728 Viewer • Updated Jul 29, 2023 • 4.28k • 38 • 5
Japanese General Pre-trained Language Models izumi-lab/deberta-v2-base-japanese Fill-Mask • Updated Jul 19, 2024 • 1.86k • • 4 izumi-lab/deberta-v2-small-japanese Fill-Mask • Updated Jul 19, 2024 • 9 izumi-lab/bert-small-japanese Fill-Mask • Updated Dec 9, 2022 • 127 • 5 izumi-lab/electra-base-japanese-discriminator Updated Dec 9, 2022 • 31 • 2
llm-japanese-dataset izumi-lab/llm-japanese-dataset Viewer • Updated Jan 18, 2024 • 9.07M • 1.1k • 140 izumi-lab/llm-japanese-dataset-vanilla Viewer • Updated Feb 17, 2024 • 2.49M • 791 • 32
Japanese LoRA-tuned LLMs izumi-lab/stormy-7b-10ep Updated Jun 26, 2023 • 5 izumi-lab/llama-13b-japanese-lora-v0-1ep Updated May 23, 2023 • 11 izumi-lab/llama-7b-japanese-lora-v0-5ep Updated Jun 23, 2023 • 3 Paused 4 LLaMA 13B Japanese LoRA v0 1 epoch 🐨 4
Japanese Financial Pre-trained Language Models izumi-lab/bert-base-japanese-fin-additional 0.1B • Updated Jun 16, 2025 • 324 • 3 izumi-lab/bert-small-japanese-fin Fill-Mask • Updated Dec 9, 2022 • 48 • 2 izumi-lab/electra-small-japanese-fin-discriminator Updated Dec 9, 2022 • 25 izumi-lab/electra-small-japanese-fin-generator Fill-Mask • 13.8M • Updated Oct 21, 2023 • 44
Miscellaneous Text Datasets for Language Models izumi-lab/oscar2301-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 31.4M • 377 • 6 izumi-lab/mc4-ja Viewer • Updated Jul 29, 2023 • 87.4M • 629 • 6 izumi-lab/mc4-ja-filter-ja-normal Viewer • Updated Jul 29, 2023 • 52.6M • 840 • 5 izumi-lab/wikinews-ja-20230728 Viewer • Updated Jul 29, 2023 • 4.28k • 38 • 5
Japanese LoRA-tuned LLMs izumi-lab/stormy-7b-10ep Updated Jun 26, 2023 • 5 izumi-lab/llama-13b-japanese-lora-v0-1ep Updated May 23, 2023 • 11 izumi-lab/llama-7b-japanese-lora-v0-5ep Updated Jun 23, 2023 • 3 Paused 4 LLaMA 13B Japanese LoRA v0 1 epoch 🐨 4
Japanese General Pre-trained Language Models izumi-lab/deberta-v2-base-japanese Fill-Mask • Updated Jul 19, 2024 • 1.86k • • 4 izumi-lab/deberta-v2-small-japanese Fill-Mask • Updated Jul 19, 2024 • 9 izumi-lab/bert-small-japanese Fill-Mask • Updated Dec 9, 2022 • 127 • 5 izumi-lab/electra-base-japanese-discriminator Updated Dec 9, 2022 • 31 • 2
Japanese Financial Pre-trained Language Models izumi-lab/bert-base-japanese-fin-additional 0.1B • Updated Jun 16, 2025 • 324 • 3 izumi-lab/bert-small-japanese-fin Fill-Mask • Updated Dec 9, 2022 • 48 • 2 izumi-lab/electra-small-japanese-fin-discriminator Updated Dec 9, 2022 • 25 izumi-lab/electra-small-japanese-fin-generator Fill-Mask • 13.8M • Updated Oct 21, 2023 • 44
llm-japanese-dataset izumi-lab/llm-japanese-dataset Viewer • Updated Jan 18, 2024 • 9.07M • 1.1k • 140 izumi-lab/llm-japanese-dataset-vanilla Viewer • Updated Feb 17, 2024 • 2.49M • 791 • 32