tomg-group-umd/Gemstone-2048x27
Text Generation
•
2B
•
Updated
•
107
AI security & privacy, algorithmic bias, foundations of ML
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Gemstones: A Model Suite for Multi-Faceted Scaling Laws