Picture for Kaicheng Yu

Kaicheng Yu

AutoLab, Westlake University

BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science

Add code
Jun 29, 2024
Viaarxiv icon

M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark

Add code
Jun 08, 2024
Viaarxiv icon

Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation

Add code
Jun 03, 2024
Figure 1 for Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Figure 2 for Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Figure 3 for Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Figure 4 for Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Viaarxiv icon

AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis

Add code
Feb 27, 2024
Viaarxiv icon

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Add code
Dec 12, 2023
Figure 1 for OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Figure 2 for OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Figure 3 for OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Figure 4 for OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Viaarxiv icon

BEVHeight++: Toward Robust Visual Centric 3D Object Detection

Add code
Sep 28, 2023
Figure 1 for BEVHeight++: Toward Robust Visual Centric 3D Object Detection
Figure 2 for BEVHeight++: Toward Robust Visual Centric 3D Object Detection
Figure 3 for BEVHeight++: Toward Robust Visual Centric 3D Object Detection
Figure 4 for BEVHeight++: Toward Robust Visual Centric 3D Object Detection
Viaarxiv icon

FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Objection

Add code
Sep 11, 2023
Figure 1 for FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Objection
Figure 2 for FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Objection
Figure 3 for FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Objection
Figure 4 for FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Objection
Viaarxiv icon

Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training

Add code
Aug 18, 2023
Figure 1 for Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Figure 2 for Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Figure 3 for Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Figure 4 for Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Viaarxiv icon

FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving

Add code
Aug 14, 2023
Figure 1 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Figure 2 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Figure 3 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Figure 4 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Viaarxiv icon

BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout

Add code
Aug 07, 2023
Figure 1 for BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Figure 2 for BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Figure 3 for BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Figure 4 for BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Viaarxiv icon