verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.
Overview: Reinforcement learning in 2025 is more practical than ever, with Python libraries evolving to support real-world simulations, robotics, and deci ...
Abstract: A mixed traffic environment of manual driving and automatic driving will become the norm in future intelligent transportation systems. The deep reinforcement learning (DRL) method has shown ...
This is the official repository of the paper "TabM: Advancing Tabular Deep Learning With Parameter-Efficient Ensembling". It consists of two parts: One dot represents a performance score on one ...
AI agents are reshaping software development, from writing code to carrying out complex instructions. Yet LLM-based agents are prone to errors and often perform poorly on complicated, multi-step tasks ...
Abstract: Magnetic helical microrobots have great potential in biomedical applications due to their ability to access confined and enclosed environments via remote manipulation by magnetic fields.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results