Picture for Yaqi Xie

Yaqi Xie

DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management

Add code
Mar 20, 2026
Viaarxiv icon

Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory

Add code
Mar 16, 2026
Viaarxiv icon

pySpatial: Generating 3D Visual Programs for Zero-Shot Spatial Reasoning

Add code
Mar 01, 2026
Viaarxiv icon

Unifying Deep Predicate Invention with Pre-trained Foundation Models

Add code
Dec 19, 2025
Viaarxiv icon

Adaptively Coordinating with Novel Partners via Learned Latent Strategies

Add code
Nov 16, 2025
Viaarxiv icon

Thought Communication in Multiagent Collaboration

Add code
Oct 23, 2025
Viaarxiv icon

Multi-level Advantage Credit Assignment for Cooperative Multi-Agent Reinforcement Learning

Add code
Aug 09, 2025
Viaarxiv icon

ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models

Add code
Jul 01, 2025
Viaarxiv icon

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning

Add code
May 23, 2025
Viaarxiv icon