Picture for Yuchen Zhou

Yuchen Zhou

RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning

Add code
Oct 09, 2025
Viaarxiv icon

ORIC: Benchmarking Object Recognition in Incongruous Context for Large Vision-Language Models

Add code
Sep 19, 2025
Viaarxiv icon

Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning

Add code
Aug 11, 2025
Figure 1 for Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning
Figure 2 for Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning
Figure 3 for Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning
Figure 4 for Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning
Viaarxiv icon

Oedipus and the Sphinx: Benchmarking and Improving Visual Language Models for Complex Graphic Reasoning

Add code
Aug 01, 2025
Viaarxiv icon

DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation

Add code
Jul 31, 2025
Viaarxiv icon

Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion

Add code
Aug 08, 2024
Figure 1 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Figure 2 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Figure 3 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Figure 4 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Viaarxiv icon

Point-SAM: Promptable 3D Segmentation Model for Point Clouds

Add code
Jun 25, 2024
Viaarxiv icon

Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition

Add code
May 16, 2024
Figure 1 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Figure 2 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Figure 3 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Figure 4 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Viaarxiv icon

Advancing Multimodal Medical Capabilities of Gemini

Add code
May 06, 2024
Figure 1 for Advancing Multimodal Medical Capabilities of Gemini
Figure 2 for Advancing Multimodal Medical Capabilities of Gemini
Figure 3 for Advancing Multimodal Medical Capabilities of Gemini
Figure 4 for Advancing Multimodal Medical Capabilities of Gemini
Viaarxiv icon

CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision

Add code
Feb 26, 2024
Viaarxiv icon