Picture for Long Xing

Long Xing

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Add code
Jun 08, 2026
Viaarxiv icon

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Add code
May 19, 2026
Viaarxiv icon

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Add code
May 11, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Add code
Mar 12, 2026
Viaarxiv icon

LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors

Add code
Nov 10, 2025
Figure 1 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 2 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 3 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 4 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Viaarxiv icon

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Add code
Oct 31, 2025
Viaarxiv icon

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Add code
Jun 24, 2025
Figure 1 for ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
Figure 2 for ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
Figure 3 for ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
Figure 4 for ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
Viaarxiv icon

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon