ouasdg's picture

In a Training Loop 🔄

ouasdg

ouasdg

·

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

Xenova/the-tokenizer-playground

liked a model 15 days ago

speechbrain/spkrec-ecapa-voxceleb

upvoted a paper 15 days ago

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

View all activity

Organizations

upvoted a paper 15 days ago

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Paper • 2601.11141 • Published 20 days ago • 23

upvoted a paper 20 days ago

End-to-End Video Character Replacement without Structural Guidance

Paper • 2601.08587 • Published 23 days ago • 8

upvoted 2 papers about 1 month ago

Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion

Paper • 2512.23709 • Published Dec 29, 2025 • 49

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published Dec 19, 2025 • 97

upvoted 3 collections about 2 months ago

Openly licensed large image datasets

Openly licensed dataset with allowed commercial usage • 3 items • Updated Jul 1, 2024 • 1

sam-audio

11 items • Updated Dec 16, 2025 • 126

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Jul 10, 2025 • 63

upvoted 2 papers 4 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10, 2025 • 51

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 187

upvoted a paper 8 months ago

Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes

Paper • 2506.00227 • Published May 30, 2025 • 12

upvoted a collection 11 months ago

NVILA-Speech-Audio-Setups

2 items • Updated about 15 hours ago • 5