Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.51k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.1k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 123 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 62 • 10
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 138 • 78
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 319k • 313 codeparrot/apps Updated Oct 20, 2022 • 15.7k • 200 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 9.87k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 30.5k • 98
Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.51k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.1k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 123 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 62 • 10
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 319k • 313 codeparrot/apps Updated Oct 20, 2022 • 15.7k • 200 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 9.87k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 30.5k • 98
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 138 • 78