Picture for Yujie Zhang

Yujie Zhang

ASAP: An Azimuth-Priority Strip-Based Search Approach to Planar Microphone Array DOA Estimation in 3D

Add code
Apr 28, 2026
Viaarxiv icon

Wired for Overconfidence: A Mechanistic Perspective on Inflated Verbalized Confidence in LLMs

Add code
Apr 01, 2026
Viaarxiv icon

EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling

Add code
Mar 30, 2026
Viaarxiv icon

MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling

Add code
Feb 11, 2026
Viaarxiv icon

Angular Sensing by Highly Reconfigurable Pixel Antennas with Joint Radiating Aperture and Feeding Ports Reconfiguration

Add code
Jan 19, 2026
Viaarxiv icon

RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations

Add code
Dec 30, 2025
Viaarxiv icon

Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection

Add code
Dec 18, 2025
Figure 1 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 2 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 3 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 4 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Viaarxiv icon

Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis

Add code
Dec 16, 2025
Viaarxiv icon

Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark

Add code
Oct 01, 2025
Figure 1 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Figure 2 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Figure 3 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Figure 4 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Viaarxiv icon

LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

Add code
May 26, 2025
Figure 1 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 2 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 3 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 4 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Viaarxiv icon