Picture for Bin Li

Bin Li

Member, IEEE

Robot Skin with Touch and Bend Sensing using Electrical Impedance Tomography

Add code
Mar 17, 2025
Viaarxiv icon

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

Add code
Mar 13, 2025
Figure 1 for Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Figure 2 for Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Figure 3 for Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Figure 4 for Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation
Viaarxiv icon

VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models

Add code
Mar 08, 2025
Figure 1 for VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models
Figure 2 for VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models
Figure 3 for VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models
Figure 4 for VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models
Viaarxiv icon

EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models

Add code
Mar 06, 2025
Viaarxiv icon

Small but Mighty: Enhancing Time Series Forecasting with Lightweight LLMs

Add code
Mar 05, 2025
Viaarxiv icon

DLF: Extreme Image Compression with Dual-generative Latent Fusion

Add code
Mar 03, 2025
Figure 1 for DLF: Extreme Image Compression with Dual-generative Latent Fusion
Figure 2 for DLF: Extreme Image Compression with Dual-generative Latent Fusion
Figure 3 for DLF: Extreme Image Compression with Dual-generative Latent Fusion
Figure 4 for DLF: Extreme Image Compression with Dual-generative Latent Fusion
Viaarxiv icon

Towards Practical Real-Time Neural Video Compression

Add code
Feb 28, 2025
Viaarxiv icon

ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models

Add code
Feb 27, 2025
Figure 1 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Figure 2 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Figure 3 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Figure 4 for ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models
Viaarxiv icon

Accurate and Scalable Graph Neural Networks via Message Invariance

Add code
Feb 27, 2025
Figure 1 for Accurate and Scalable Graph Neural Networks via Message Invariance
Figure 2 for Accurate and Scalable Graph Neural Networks via Message Invariance
Figure 3 for Accurate and Scalable Graph Neural Networks via Message Invariance
Figure 4 for Accurate and Scalable Graph Neural Networks via Message Invariance
Viaarxiv icon

Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System

Add code
Feb 27, 2025
Figure 1 for Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System
Figure 2 for Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System
Figure 3 for Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System
Figure 4 for Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System
Viaarxiv icon