To-read - a ArturoAlcorta Collection

ArturoAlcorta 's Collections

To-read

To-read

updated 1 day ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published 6 days ago • 34

Note Vision-Action model: - Trained on gameplay showing the controller with inputs. NOT REINFORCED LEARNING. - Controller Extraction using SIFT to detect the layout (Zero-shot when compared to YOLO) - Synthetic data showcasing controllers with pressed buttons to detect inputs - SeqFormer with the generated images to learn to detect buttons. - Uses Difussion model based on https://arxiv.org/abs/2503.14734 to generate the inputs using images as conditional variables for the denoising.
mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 10 days ago • 230
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published 11 days ago • 47
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 5 days ago • 85
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published 4 days ago • 90