nithin

nithin12342

AI & ML interests

Aspiring Software Developer | 🌐 Full-Stack Explorer | 🤖 AI Engineer-in-Progress Passionate about building scalable apps, automating workflows, and craft

Recent Activity

upvoted an article about 15 hours ago

Transformers.js v4 Preview: Now Available on NPM!

updated a collection 3 days ago

My notification

updated a collection 5 days ago

My notification

View all activity

Organizations

upvoted an article about 15 hours ago

Article

Transformers.js v4 Preview: Now Available on NPM!

2 days ago

•

upvoted a paper 5 days ago

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Paper • 2601.20354 • Published 14 days ago • 110

upvoted an article 5 days ago

Article

Training Design for Text-to-Image Models: Lessons from Ablations

7 days ago

•

upvoted 4 papers 6 days ago

upvoted 8 papers 7 days ago

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Paper • 2601.21358 • Published 13 days ago • 7

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published 8 days ago • 124

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published 8 days ago • 31

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published 12 days ago • 33

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published 8 days ago • 41

FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space

Paper • 2602.02092 • Published 8 days ago • 18

FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents

Paper • 2602.01566 • Published 9 days ago • 45

ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought

Paper • 2601.23184 • Published 11 days ago • 35

upvoted a paper 8 days ago

DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

Paper • 2601.22904 • Published 11 days ago • 15

upvoted 3 papers 10 days ago

SERA: Soft-Verified Efficient Repository Agents

Paper • 2601.20789 • Published 13 days ago • 11

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 13 days ago • 42

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published 15 days ago • 7

upvoted a paper 11 days ago

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Paper • 2601.21406 • Published 13 days ago • 4

nithin

AI & ML interests

Recent Activity

Organizations

nithin12342's activity

Transformers.js v4 Preview: Now Available on NPM!

Training Design for Text-to-Image Models: Lessons from Ablations