Picture for Tao Yu

Tao Yu

Non-Overlapping Placement of Macro Cells based on Reinforcement Learning in Chip Design

Add code
Jul 26, 2024
Viaarxiv icon

CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation

Add code
Jul 25, 2024
Figure 1 for CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Figure 2 for CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Figure 3 for CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Figure 4 for CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Viaarxiv icon

Rethinking Domain Adaptation and Generalization in the Era of CLIP

Add code
Jul 21, 2024
Figure 1 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Figure 2 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Figure 3 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Figure 4 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Viaarxiv icon

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Add code
Jul 16, 2024
Figure 1 for BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Figure 2 for BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Figure 3 for BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Figure 4 for BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Viaarxiv icon

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Figure 1 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 2 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 3 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 4 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Viaarxiv icon

HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models

Add code
Jun 03, 2024
Viaarxiv icon

Collage: Light-Weight Low-Precision Strategy for LLM Training

Add code
May 06, 2024
Figure 1 for Collage: Light-Weight Low-Precision Strategy for LLM Training
Figure 2 for Collage: Light-Weight Low-Precision Strategy for LLM Training
Figure 3 for Collage: Light-Weight Low-Precision Strategy for LLM Training
Figure 4 for Collage: Light-Weight Low-Precision Strategy for LLM Training
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors

Add code
Mar 30, 2024
Figure 1 for MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
Figure 2 for MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
Figure 3 for MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
Figure 4 for MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
Viaarxiv icon

Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience

Add code
Mar 15, 2024
Figure 1 for Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience
Figure 2 for Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience
Figure 3 for Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience
Figure 4 for Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience
Viaarxiv icon