Picture for Jiahao Qiu

Jiahao Qiu

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?

Add code
May 21, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

Add code
Apr 13, 2025
Viaarxiv icon

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Add code
Mar 31, 2025
Viaarxiv icon

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Add code
Mar 27, 2025
Viaarxiv icon

Temporal Consistency for LLM Reasoning Process Error Identification

Add code
Mar 18, 2025
Viaarxiv icon

Fast Best-of-N Decoding via Speculative Rejection

Add code
Oct 26, 2024
Viaarxiv icon

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Add code
Oct 18, 2024
Figure 1 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 2 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 3 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 4 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Viaarxiv icon