Picture for Bowen Jin

Bowen Jin

COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization

Add code
Oct 08, 2025
Viaarxiv icon

GRACE: Generative Representation Learning via Contrastive Policy Optimization

Add code
Oct 06, 2025
Viaarxiv icon

Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering

Add code
May 25, 2025
Viaarxiv icon

Hybrid Latent Reasoning via Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents

Add code
May 21, 2025
Figure 1 for An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents
Figure 2 for An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents
Figure 3 for An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents
Figure 4 for An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents
Viaarxiv icon

LLM-Based Compact Reranking with Document Features for Scientific Retrieval

Add code
May 19, 2025
Figure 1 for LLM-Based Compact Reranking with Document Features for Scientific Retrieval
Figure 2 for LLM-Based Compact Reranking with Document Features for Scientific Retrieval
Figure 3 for LLM-Based Compact Reranking with Document Features for Scientific Retrieval
Figure 4 for LLM-Based Compact Reranking with Document Features for Scientific Retrieval
Viaarxiv icon

mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model

Add code
May 18, 2025
Figure 1 for mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model
Figure 2 for mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model
Figure 3 for mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model
Figure 4 for mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model
Viaarxiv icon

Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents

Add code
May 17, 2025
Viaarxiv icon

RM-R1: Reward Modeling as Reasoning

Add code
May 05, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Figure 1 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 2 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 3 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 4 for OTC: Optimal Tool Calls via Reinforcement Learning
Viaarxiv icon