Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing
Efficient training-free multi-token prediction via embedding-space probing, improving LLaMA3 acceptance length by 12%.
Raghavv Goel, Mukul Gagrani, Mingu Lee et al.
Efficient training-free multi-token prediction via embedding-space probing, improving LLaMA3 acceptance length by 12%.
Raghavv Goel, Mukul Gagrani, Mingu Lee et al.
RAMP uses reinforcement learning for adaptive mixed-precision quantization, achieving 6% size and 1-3% quality improvements for on-device LLM inference.
Arpit Singh Gautam, Saurabh Jha
Adaptive Domain Models leverage Bayesian distillation and warm rotation for efficient training in geometric and neuromorphic AI.
Houston Haynes
LLMs as semantic interfaces and ethical mediators in neuro-digital ecosystems, introducing Neuro-Linguistic Integration.
Alexander V. Shenderuk-Zhidkov, Alexander E. Hramov
A synthesizable RTL architecture for predictive coding networks, supporting local prediction-error dynamics, executed directly in hardware.
Timothy Oh
Utilizing a Quadratic Surrogate Attractor to enhance Particle Swarm Optimization's global convergence and robustness.
Maurizio Clemente, Marcello Canova
Optimization-embedded active multi-fidelity surrogate learning for airfoil shape optimization improves cruise efficiency by 41.05% and take-off lift by 20.75%.
Isaac Robledo, Alberto Vilariño, Arnau Miró et al.
Attractor-Keyed Memory merges selection and memory access, reducing latency and energy in sparse routing architectures.
Natalia G. Berloff
Video models exhibit reasoning via Chain-of-Steps mechanism during diffusion denoising steps.
Ruisi Wang, Zhongang Cai, Fanyi Pu et al.
MessyKitchens achieves high-precision monocular 3D scene reconstruction using the MOD algorithm, significantly enhancing the physical plausibility of inter-object contacts.
Junaid Ahmed Ansari, Ran Ding, Fabio Pizzati et al.
Efficient reasoning in small LLMs using LoRA adapters and RL, significantly reducing response length.
Yelysei Bondarenko, Thomas Hehn, Rob Hesselink et al.
SegviGen repurposes 3D generative models for part segmentation, achieving a 40% improvement in interactive segmentation using only 0.32% labeled data.
Lin Li, Haoran Feng, Zehuan Huang et al.
ManiTwin generates 100K high-quality 3D digital assets from a single image for large-scale robotic manipulation data generation.
Kaixuan Wang, Tianxing Chen, Jiawei Liu et al.
BrickSim is a physics-based simulator for real-time simulation of brick assemblies, achieving 100% accuracy.
Haowei Wen, Ruixuan Liu, Weiyi Piao et al.
M^3 integrates multi-view foundation models with monocular Gaussian splatting SLAM, reducing ATE RMSE by 64.3%.
Kerui Ren, Guanghao Li, Changjian Jiang et al.
LEAFE framework internalizes recovery agency from reflective experience, enhancing Pass@k performance in long-horizon tasks.
Rui Ge, Yichao Fu, Yuyang Qian et al.
Achieved superior drone interception using PPO-based competitive reinforcement learning with high catch rates.
Timothée Gavin, Simon Lacroix, Murat Bronz
PUMA model improves success rate by 6.3% in dynamic environments using historical optical flow and world queries.
Heng Fang, Shangru Li, Shuhan Wang et al.
Mixture-of-Depths Attention (MoDA) improves downstream task performance by 2.11% on a 1.5B-parameter model with only a 3.7% increase in FLOPs.
Lianghui Zhu, Yuxin Fang, Bencheng Liao et al.
HorizonMath evaluates AI progress in mathematical discovery using an automated verification framework, with GPT 5.4 Pro achieving breakthroughs on two problems.
Erik Y. Wang, Sumeet Motwani, James V. Roggeveen et al.