Picture for Yuchen Zhou

Yuchen Zhou

Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning

Add code
Aug 11, 2025
Viaarxiv icon

Oedipus and the Sphinx: Benchmarking and Improving Visual Language Models for Complex Graphic Reasoning

Add code
Aug 01, 2025
Viaarxiv icon

DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation

Add code
Jul 31, 2025
Viaarxiv icon

Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion

Add code
Aug 08, 2024
Figure 1 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Figure 2 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Figure 3 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Figure 4 for Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
Viaarxiv icon

Point-SAM: Promptable 3D Segmentation Model for Point Clouds

Add code
Jun 25, 2024
Viaarxiv icon

Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition

Add code
May 16, 2024
Figure 1 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Figure 2 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Figure 3 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Figure 4 for Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Viaarxiv icon

Advancing Multimodal Medical Capabilities of Gemini

Add code
May 06, 2024
Figure 1 for Advancing Multimodal Medical Capabilities of Gemini
Figure 2 for Advancing Multimodal Medical Capabilities of Gemini
Figure 3 for Advancing Multimodal Medical Capabilities of Gemini
Figure 4 for Advancing Multimodal Medical Capabilities of Gemini
Viaarxiv icon

CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision

Add code
Feb 26, 2024
Viaarxiv icon

PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation

Add code
Dec 05, 2023
Figure 1 for PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation
Figure 2 for PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation
Figure 3 for PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation
Figure 4 for PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation
Viaarxiv icon

How Far Have We Gone in Vulnerability Detection Using Large Language Models

Add code
Nov 21, 2023
Viaarxiv icon