text -> image Multi-LoRA Composition for Image Generation Paper • 2402.16843 • Published Feb 26, 2024 • 31
为网格生成UV纹理贴图 Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Paper • 2312.13913 • Published Dec 21, 2023 • 24
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Paper • 2312.13913 • Published Dec 21, 2023 • 24
robot RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Paper • 2311.01455 • Published Nov 2, 2023 • 30
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Paper • 2311.01455 • Published Nov 2, 2023 • 30
benchmark GAIA: a benchmark for General AI Assistants Paper • 2311.12983 • Published Nov 21, 2023 • 243
RLHF Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper • 2311.13231 • Published Nov 22, 2023 • 28
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper • 2311.13231 • Published Nov 22, 2023 • 28
MLLMs Merlin:Empowering Multimodal LLMs with Foresight Minds Paper • 2312.00589 • Published Nov 30, 2023 • 27
Merlin:Empowering Multimodal LLMs with Foresight Minds Paper • 2312.00589 • Published Nov 30, 2023 • 27
not read Rethinking Patch Dependence for Masked Autoencoders Paper • 2401.14391 • Published Jan 25, 2024 • 26
image -> 3D ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper • 2310.17994 • Published Oct 27, 2023 • 8
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper • 2310.17994 • Published Oct 27, 2023 • 8
VR/AR VR-NeRF: High-Fidelity Virtualized Walkable Spaces Paper • 2311.02542 • Published Nov 5, 2023 • 19
Text-to-Video FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Paper • 2311.13073 • Published Nov 22, 2023 • 58 Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Paper • 2402.17177 • Published Feb 27, 2024 • 88
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Paper • 2311.13073 • Published Nov 22, 2023 • 58
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Paper • 2402.17177 • Published Feb 27, 2024 • 88
CV Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
text -> image Multi-LoRA Composition for Image Generation Paper • 2402.16843 • Published Feb 26, 2024 • 31
image -> 3D ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper • 2310.17994 • Published Oct 27, 2023 • 8
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper • 2310.17994 • Published Oct 27, 2023 • 8
为网格生成UV纹理贴图 Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Paper • 2312.13913 • Published Dec 21, 2023 • 24
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Paper • 2312.13913 • Published Dec 21, 2023 • 24
robot RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Paper • 2311.01455 • Published Nov 2, 2023 • 30
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Paper • 2311.01455 • Published Nov 2, 2023 • 30
VR/AR VR-NeRF: High-Fidelity Virtualized Walkable Spaces Paper • 2311.02542 • Published Nov 5, 2023 • 19
benchmark GAIA: a benchmark for General AI Assistants Paper • 2311.12983 • Published Nov 21, 2023 • 243
Text-to-Video FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Paper • 2311.13073 • Published Nov 22, 2023 • 58 Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Paper • 2402.17177 • Published Feb 27, 2024 • 88
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Paper • 2311.13073 • Published Nov 22, 2023 • 58
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Paper • 2402.17177 • Published Feb 27, 2024 • 88
RLHF Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper • 2311.13231 • Published Nov 22, 2023 • 28
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper • 2311.13231 • Published Nov 22, 2023 • 28
CV Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
MLLMs Merlin:Empowering Multimodal LLMs with Foresight Minds Paper • 2312.00589 • Published Nov 30, 2023 • 27
Merlin:Empowering Multimodal LLMs with Foresight Minds Paper • 2312.00589 • Published Nov 30, 2023 • 27
not read Rethinking Patch Dependence for Masked Autoencoders Paper • 2401.14391 • Published Jan 25, 2024 • 26