CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production Paper • 2603.01973 • Published 4 days ago • 6
Adaptive Nonlinear Vector Autoregression: Robust Forecasting for Noisy Chaotic Time Series Paper • 2507.08738 • Published Jul 11, 2025 • 1
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9, 2025 • 39
Prompt reinforcing for long-term planning of large language models Paper • 2510.05921 • Published Oct 7, 2025
Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation Paper • 2507.01594 • Published Jul 2, 2025
Learning from Noisy Labels via Self-Taught On-the-Fly Meta Loss Rescaling Paper • 2412.12955 • Published Dec 17, 2024
A Confidence-based Acquisition Model for Self-supervised Active Learning and Label Correction Paper • 2310.08944 • Published Oct 13, 2023
Post-Training Large Language Models via Reinforcement Learning from Self-Feedback Paper • 2507.21931 • Published Jul 29, 2025
Less is More: Local Intrinsic Dimensions of Contextual Language Models Paper • 2506.01034 • Published Jun 1, 2025
Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction Paper • 2408.03706 • Published Aug 7, 2024
Dialogue Term Extraction using Transfer Learning and Topological Data Analysis Paper • 2208.10448 • Published Aug 22, 2022
EmbeddingGemma: Powerful and Lightweight Text Representations Paper • 2509.20354 • Published Sep 24, 2025 • 48
DiffusionNFT: Online Diffusion Reinforcement with Forward Process Paper • 2509.16117 • Published Sep 19, 2025 • 22
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26, 2025 • 77
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4, 2025 • 22