Picture for Shi Feng

Shi Feng

DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference

Add code
Jan 26, 2026
Viaarxiv icon

Self-Improvement as Coherence Optimization: A Theoretical Account

Add code
Jan 20, 2026
Viaarxiv icon

ES4R: Speech Encoding Based on Prepositive Affective Modeling for Empathetic Response Generation

Add code
Jan 16, 2026
Viaarxiv icon

High-Rank Structured Modulation for Parameter-Efficient Fine-Tuning

Add code
Jan 12, 2026
Viaarxiv icon

SAD: A Large-Scale Strategic Argumentative Dialogue Dataset

Add code
Jan 12, 2026
Viaarxiv icon

PlaM: Training-Free Plateau-Guided Model Merging for Better Visual Grounding in MLLMs

Add code
Jan 12, 2026
Viaarxiv icon

CIRAG: Construction-Integration Retrieval and Adaptive Generation for Multi-hop Question Answering

Add code
Jan 11, 2026
Viaarxiv icon

SAFE-QAQ: End-to-End Slow-Thinking Audio-Text Fraud Detection via Reinforcement Learning

Add code
Jan 04, 2026
Viaarxiv icon

Emergent Persuasion: Will LLMs Persuade Without Being Prompted?

Add code
Dec 20, 2025
Viaarxiv icon

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

Add code
Sep 26, 2025
Viaarxiv icon