Picture for Yuqi Wang

Yuqi Wang

Monocular Occupancy Prediction for Scalable Indoor Scenes

Add code
Jul 16, 2024
Viaarxiv icon

Enhancing End-to-End Autonomous Driving with Latent World Model

Add code
Jun 12, 2024
Viaarxiv icon

Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images

Add code
Jun 11, 2024
Viaarxiv icon

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

Add code
May 06, 2024
Viaarxiv icon

Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Add code
May 03, 2024
Viaarxiv icon

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Add code
Apr 18, 2024
Viaarxiv icon

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering

Add code
Apr 16, 2024
Figure 1 for Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Figure 2 for Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Figure 3 for Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Figure 4 for Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Viaarxiv icon

DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

Add code
Apr 14, 2024
Viaarxiv icon

Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science

Add code
Apr 07, 2024
Figure 1 for Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science
Figure 2 for Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science
Figure 3 for Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science
Figure 4 for Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science
Viaarxiv icon

Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models

Add code
Mar 04, 2024
Figure 1 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 2 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 3 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Figure 4 for Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Viaarxiv icon