Two University of Iowa engineers have won funding from the National Science Foundation to develop a theory that would improve ...
Quantum key distribution (QKD) is an emerging communication technology that utilizes quantum mechanics principles to ensure ...
Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025) ...
Abstract: Learning-based image compression algorithms typically focus on designing encoding and decoding networks and improving the accuracy of entropy model estimation to enhance the rate-distortion ...
We are excited to release the CapRL 2.0 series: CapRL-Qwen3VL-2B and CapRL-Qwen3VL-4B. These models feature fewer parameters while delivering even more powerful captioning performance. Notably, ...
Today's AI agents don't meet the definition of true agents. Key missing elements are reinforcement learning and complex memory. It will take at least five years to get AI agents where they need to be.
Abstract: Adaptive control policies for robots often require balancing generalization from large offline datasets with efficient adaptation to specific deployment conditions. In this paper, we propose ...
We propose TraceRL, a trajectory-aware reinforcement learning method for diffusion language models, which demonstrates the best performance among RL approaches for DLMs. We also introduce a ...