Picture for Baolong Bi

Baolong Bi

FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning

Add code
Feb 26, 2026
Viaarxiv icon

PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding

Add code
Feb 24, 2026
Viaarxiv icon

AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning

Add code
Feb 14, 2026
Viaarxiv icon

HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-BENCH

Add code
Jan 28, 2026
Viaarxiv icon

Gated Differentiable Working Memory for Long-Context Language Modeling

Add code
Jan 19, 2026
Viaarxiv icon

Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning

Add code
Nov 18, 2025
Figure 1 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Figure 2 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Figure 3 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Figure 4 for Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Viaarxiv icon

Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation

Add code
Oct 14, 2025
Viaarxiv icon

A Survey of Vibe Coding with Large Language Models

Add code
Oct 14, 2025
Viaarxiv icon

A Survey of Context Engineering for Large Language Models

Add code
Jul 17, 2025
Viaarxiv icon

Rethinking All Evidence: Enhancing Trustworthy Retrieval-Augmented Generation via Conflict-Driven Summarization

Add code
Jul 02, 2025
Viaarxiv icon