7 122 42

Frank Sommers PRO

fsommers

fsommers

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted an article 3 days ago

Transformers v5: Simple model definitions powering the AI ecosystem

upvoted a paper 15 days ago

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

View all activity

Organizations

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 169

upvoted an article 3 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

6 days ago

•

224

upvoted 2 papers 15 days ago

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published 18 days ago • 22

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

Paper • 2511.16528 • Published 17 days ago • 16

upvoted a collection 26 days ago

Qwen3-VL

Collection

37 items • Updated Nov 1 • 488

upvoted a paper about 1 month ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 96

upvoted 2 papers about 2 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 103

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 65

upvoted 2 articles 2 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

253

Article

ModernVBERT: Towards Smaller Visual Document Retrievers

Oct 3

•

upvoted 2 collections 2 months ago

ModernVBERT

Collection

Resources for ModernVBERT • 5 items • Updated Oct 3 • 11

ColModernVBERT

Collection

Resources for ColModernVBERT – the document retrieval–optimized variant of ModernVBERT • 5 items • Updated Oct 3 • 7

upvoted a paper 2 months ago

ModernVBERT: Towards Smaller Visual Document Retrievers

Paper • 2510.01149 • Published Oct 1 • 30

upvoted 2 articles 3 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark 🏅

Jul 1, 2024

•

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Sep 10

•

108

upvoted a paper 3 months ago

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9 • 83

upvoted 3 articles 3 months ago

Article

mmBERT: ModernBERT goes Multilingual

Sep 9

•

128

Article

Theoretical Limitations of Embedding Models and Their Applications in Turkish: An In-Depth Look

Sep 4

•

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

May 23

•

170

upvoted a paper 3 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 208

Frank Sommers PRO

AI & ML interests

Recent Activity

Organizations

fsommers's activity

Transformers v5: Simple model definitions powering the AI ecosystem

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

ModernVBERT: Towards Smaller Visual Document Retrievers

Our Transformers Code Agent beats the GAIA benchmark 🏅

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

mmBERT: ModernBERT goes Multilingual

Theoretical Limitations of Embedding Models and Their Applications in Turkish: An In-Depth Look

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code