Picture for Wei Li

Wei Li

Tsinghua University, Beijing, China

ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios

Add code
May 07, 2024
Figure 1 for ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios
Figure 2 for ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios
Figure 3 for ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios
Figure 4 for ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios
Viaarxiv icon

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Apr 29, 2024
Figure 1 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 2 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 3 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 4 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Viaarxiv icon

FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

BezierFormer: A Unified Architecture for 2D and 3D Lane Detection

Add code
Apr 25, 2024
Figure 1 for BezierFormer: A Unified Architecture for 2D and 3D Lane Detection
Figure 2 for BezierFormer: A Unified Architecture for 2D and 3D Lane Detection
Figure 3 for BezierFormer: A Unified Architecture for 2D and 3D Lane Detection
Figure 4 for BezierFormer: A Unified Architecture for 2D and 3D Lane Detection
Viaarxiv icon

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

Add code
Apr 16, 2024
Figure 1 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Figure 2 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Figure 3 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Figure 4 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Viaarxiv icon

LIPT: Latency-aware Image Processing Transformer

Add code
Apr 09, 2024
Viaarxiv icon

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Add code
Apr 09, 2024
Figure 1 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 2 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 3 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 4 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Viaarxiv icon

Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution

Add code
Apr 03, 2024
Figure 1 for Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution
Figure 2 for Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution
Figure 3 for Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution
Figure 4 for Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution
Viaarxiv icon

Distilling Semantic Priors from SAM to Efficient Image Restoration Models

Add code
Apr 02, 2024
Figure 1 for Distilling Semantic Priors from SAM to Efficient Image Restoration Models
Figure 2 for Distilling Semantic Priors from SAM to Efficient Image Restoration Models
Figure 3 for Distilling Semantic Priors from SAM to Efficient Image Restoration Models
Figure 4 for Distilling Semantic Priors from SAM to Efficient Image Restoration Models
Viaarxiv icon

Make Continual Learning Stronger via C-Flat

Add code
Apr 01, 2024
Viaarxiv icon