Browse and export your curated research paper collection
Unknown authors 11/27/2025 huggingface
machine learningMulti-agent systems (MAS) extend large language models (LLMs) from independent single-model reasoning to coordinative system-level intelligence. While existing LLM agents depend on text-based mediation for reasoning and communication, we take a step forward by enabling models to collaborate directly...
Unknown authors 11/27/2025 huggingface
machine learning"Thinking with images" has emerged as an effective paradigm for advancing visual reasoning, extending beyond text-only chains of thought by injecting visual evidence into intermediate reasoning steps. However, existing methods fall short of human-like visual thinking, as their flexibility is fundame...
Unknown authors 11/27/2025 huggingface
machine learningWe investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. Existing research is mixed regarding whether training on easier or harder data leads to better results, and whether those gains come on easie...
Unknown authors 11/27/2025 huggingface
computer visionWorld models serve as core simulators for fields such as agentic AI, embodied AI, and gaming, capable of generating long, physically realistic, and interactive high-quality videos. Moreover, scaling these models could unlock emergent capabilities in visual perception, understanding, and reasoning, p...
Unknown authors 11/27/2025 huggingface
generative modelsWe propose Terminal Velocity Matching (TVM), a generalization of flow matching that enables high-fidelity one- and few-step generative modeling. TVM models the transition between any two diffusion timesteps and regularizes its behavior at its terminal time rather than at the initial time. We prove t...
Unknown authors 11/27/2025 huggingface
computer visionDespite 3D Gaussian Splatting (3DGS) excelling in most configurations, it lacks generalization across novel viewpoints in a few-shot scenario because it overfits to the sparse observations. We revisit 3DGS optimization from a machine learning perspective, framing novel view synthesis as a generaliza...
Unknown authors 11/27/2025 huggingface
roboticsGrounding natural-language instructions into continuous control for quadruped robots remains a fundamental challenge in vision language action. Existing methods struggle to bridge high-level semantic reasoning and low-level actuation, leading to unstable grounding and weak generalization in the real...
Unknown authors 11/27/2025 huggingface
machine learningThis paper presents research on nvidia, nemotron, parse. The full abstract is not available at this time. Please visit the paper's website for complete details about the methodology, results, and contributions.
Preparing your export...