Picture for Ji Zhang

Ji Zhang

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding

Add code
May 27, 2025
Viaarxiv icon

Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration

Add code
May 27, 2025
Viaarxiv icon

LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

Add code
May 26, 2025
Figure 1 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 2 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 3 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Figure 4 for LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation
Viaarxiv icon

QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization

Add code
May 23, 2025
Viaarxiv icon

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

VLM-R$^3$: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Add code
May 22, 2025
Viaarxiv icon

Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation

Add code
May 21, 2025
Viaarxiv icon

InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning

Add code
May 20, 2025
Viaarxiv icon

Policy Contrastive Decoding for Robotic Foundation Models

Add code
May 19, 2025
Viaarxiv icon

SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization

Add code
May 16, 2025
Viaarxiv icon