Picture for Alvaro Velasquez

Alvaro Velasquez

Consensus-based Decentralized Multi-agent Reinforcement Learning for Random Access Network Optimization

Add code
Aug 09, 2025
Viaarxiv icon

Foundation Models for Logistics: Toward Certifiable, Conversational Planning Interfaces

Add code
Jul 15, 2025
Viaarxiv icon

Efficient Neuro-Symbolic Retrieval-Augmented Generation through Adaptive Query Routing

Add code
Jun 15, 2025
Viaarxiv icon

TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision

Add code
Jun 11, 2025
Viaarxiv icon

Finite-Time Global Optimality Convergence in Deep Neural Actor-Critic Methods for Decentralized Multi-Agent Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut

Add code
May 19, 2025
Viaarxiv icon

From Abstraction to Reality: DARPA's Vision for Robust Sim-to-Real Autonomy

Add code
Mar 14, 2025
Viaarxiv icon

A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models

Add code
Feb 18, 2025
Figure 1 for A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models
Figure 2 for A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models
Figure 3 for A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models
Figure 4 for A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models
Viaarxiv icon

ANSR-DT: An Adaptive Neuro-Symbolic Learning and Reasoning Framework for Digital Twins

Add code
Jan 15, 2025
Viaarxiv icon

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon