Picture for Wei Li

Wei Li

Tsinghua University, Beijing, China

Autonomous Driving in Unstructured Environments: How Far Have We Come?

Add code
Oct 10, 2024
Figure 1 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Figure 2 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Figure 3 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Figure 4 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Viaarxiv icon

Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization

Add code
Oct 09, 2024
Figure 1 for Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization
Figure 2 for Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization
Figure 3 for Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization
Figure 4 for Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization
Viaarxiv icon

Video Instruction Tuning With Synthetic Data

Add code
Oct 03, 2024
Figure 1 for Video Instruction Tuning With Synthetic Data
Figure 2 for Video Instruction Tuning With Synthetic Data
Figure 3 for Video Instruction Tuning With Synthetic Data
Figure 4 for Video Instruction Tuning With Synthetic Data
Viaarxiv icon

MinerU: An Open-Source Solution for Precise Document Content Extraction

Add code
Sep 27, 2024
Figure 1 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 2 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 3 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 4 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Viaarxiv icon

A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring

Add code
Sep 27, 2024
Figure 1 for A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring
Figure 2 for A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring
Figure 3 for A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring
Figure 4 for A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring
Viaarxiv icon

Contrasformer: A Brain Network Contrastive Transformer for Neurodegenerative Condition Identification

Add code
Sep 17, 2024
Viaarxiv icon

BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking

Add code
Aug 22, 2024
Figure 1 for BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Figure 2 for BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Figure 3 for BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Figure 4 for BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Viaarxiv icon

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Add code
Aug 20, 2024
Figure 1 for Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation
Figure 2 for Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation
Figure 3 for Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation
Figure 4 for Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation
Viaarxiv icon

Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning

Add code
Aug 18, 2024
Figure 1 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Figure 2 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Figure 3 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Figure 4 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Viaarxiv icon

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

Add code
Aug 16, 2024
Figure 1 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 2 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 3 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 4 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Viaarxiv icon