Time


ManeuverNet: A Soft Actor-Critic Framework for Precise Maneuvering of Double-Ackermann-Steering Robots with Optimized Reward Functions

Add code
Feb 16, 2026
Viaarxiv icon

ROSA: Roundabout Optimized Speed Advisory with Multi-Agent Trajectory Prediction in Multimodal Traffic

Add code
Feb 16, 2026
Viaarxiv icon

Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning

Add code
Feb 15, 2026
Viaarxiv icon

Predicting New Concept-Object Associations in Astronomy by Mining the Literature

Add code
Feb 15, 2026
Viaarxiv icon

Conformal Signal Temporal Logic for Robust Reinforcement Learning Control: A Case Study

Add code
Feb 15, 2026
Viaarxiv icon

Floe: Federated Specialization for Real-Time LLM-SLM Inference

Add code
Feb 15, 2026
Viaarxiv icon

Machine Learning as a Tool (MLAT): A Framework for Integrating Statistical ML Models as Callable Tools within LLM Agent Workflows

Add code
Feb 15, 2026
Viaarxiv icon

MILD: Multi-Intent Learning and Disambiguation for Proactive Failure Prediction in Intent-based Networking

Add code
Feb 15, 2026
Viaarxiv icon

Fast Compute for ML Optimization

Add code
Feb 15, 2026
Viaarxiv icon

A Rational Analysis of the Effects of Sycophantic AI

Add code
Feb 15, 2026
Viaarxiv icon