Picture for Yuran Wang

Yuran Wang

Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World

Add code
May 13, 2025
Viaarxiv icon

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Add code
Apr 26, 2025
Viaarxiv icon

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Add code
Mar 19, 2025
Viaarxiv icon

GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation

Add code
Mar 12, 2025
Viaarxiv icon

Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Add code
Jan 26, 2025
Figure 1 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 2 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 3 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 4 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching

Add code
Nov 14, 2024
Figure 1 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Figure 2 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Figure 3 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Figure 4 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Viaarxiv icon

ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding

Add code
Oct 29, 2024
Viaarxiv icon

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Add code
Oct 15, 2024
Figure 1 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 2 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 3 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 4 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Viaarxiv icon

Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation

Add code
Sep 30, 2024
Viaarxiv icon