To counter over-positivity in models.
minipasila
mpasila
AI & ML interests
LLM fine-tuning, TTS models, STT models, Multimodal stuff
Recent Activity
new activity
2 days ago
mpasila/orpheus-nano-Q8_0-GGUF:questions about model architecture new activity
6 days ago
LumiOpen/Llama-Poro-2-8B-Instruct:Suomen kielen osaaminen new activity
10 days ago
utter-project/EuroLLM-9B-Instruct-2512:Will there be new models after these? Organizations
None yet
Japanese2English datasets
For translating Japanese to English.
Finnish Instruct Datasets
Finnish dataset collection. It includes some datasets that are not just Finnish data but contains Finnish data.
Magnum used datasets
So I can more easily keep up with all the different datasets they've used, since apparently each model has different datasets used..
-
anthracite-org/Stheno-Data-Filtered
Viewer • Updated • 31.1k • 20 • 14 -
anthracite-org/kalo-opus-instruct-22k-no-refusal
Viewer • Updated • 22.3k • 316 • 33 -
anthracite-org/nopm_claude_writing_fixed
Viewer • Updated • 6.35k • 340 • 16 -
NewEden-Forge/Gryphe-3.5-16k-Subset
Viewer • Updated • 16k • 10 • 1
not very positive datasets
To counter over-positivity in models.
Finnish fine-tunes
All my Finnish fine-tuned models.
Japanese2English datasets
For translating Japanese to English.
ExLlamaV2 quantizations
All my EXL2 quants here.
Finnish Instruct Datasets
Finnish dataset collection. It includes some datasets that are not just Finnish data but contains Finnish data.
Pre-training dataset prep
Some datasets I should probably use.
Magnum used datasets
So I can more easily keep up with all the different datasets they've used, since apparently each model has different datasets used..
-
anthracite-org/Stheno-Data-Filtered
Viewer • Updated • 31.1k • 20 • 14 -
anthracite-org/kalo-opus-instruct-22k-no-refusal
Viewer • Updated • 22.3k • 316 • 33 -
anthracite-org/nopm_claude_writing_fixed
Viewer • Updated • 6.35k • 340 • 16 -
NewEden-Forge/Gryphe-3.5-16k-Subset
Viewer • Updated • 16k • 10 • 1