Picture for Wei Chen

Wei Chen

Soochow University

PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

Add code
Oct 08, 2024
Viaarxiv icon

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration

Add code
Oct 06, 2024
Figure 1 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Figure 2 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Figure 3 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Figure 4 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Viaarxiv icon

Structural-Entropy-Based Sample Selection for Efficient and Effective Learning

Add code
Oct 03, 2024
Figure 1 for Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
Figure 2 for Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
Figure 3 for Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
Figure 4 for Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
Viaarxiv icon

Model-Based GNN Enabled Energy-Efficient Beamforming for Ultra-Dense Wireless Networks

Add code
Oct 03, 2024
Figure 1 for Model-Based GNN Enabled Energy-Efficient Beamforming for Ultra-Dense Wireless Networks
Figure 2 for Model-Based GNN Enabled Energy-Efficient Beamforming for Ultra-Dense Wireless Networks
Figure 3 for Model-Based GNN Enabled Energy-Efficient Beamforming for Ultra-Dense Wireless Networks
Figure 4 for Model-Based GNN Enabled Energy-Efficient Beamforming for Ultra-Dense Wireless Networks
Viaarxiv icon

GNN-Enabled Optimization of Placement and Transmission Design for UAV Communications

Add code
Oct 03, 2024
Figure 1 for GNN-Enabled Optimization of Placement and Transmission Design for UAV Communications
Figure 2 for GNN-Enabled Optimization of Placement and Transmission Design for UAV Communications
Figure 3 for GNN-Enabled Optimization of Placement and Transmission Design for UAV Communications
Figure 4 for GNN-Enabled Optimization of Placement and Transmission Design for UAV Communications
Viaarxiv icon

Generative Retrieval Meets Multi-Graded Relevance

Add code
Sep 27, 2024
Figure 1 for Generative Retrieval Meets Multi-Graded Relevance
Figure 2 for Generative Retrieval Meets Multi-Graded Relevance
Figure 3 for Generative Retrieval Meets Multi-Graded Relevance
Figure 4 for Generative Retrieval Meets Multi-Graded Relevance
Viaarxiv icon

Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR

Add code
Sep 18, 2024
Figure 1 for Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR
Figure 2 for Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR
Figure 3 for Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR
Figure 4 for Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR
Viaarxiv icon

RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for Robust Singing Voice Conversion

Add code
Sep 10, 2024
Figure 1 for RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for Robust Singing Voice Conversion
Figure 2 for RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for Robust Singing Voice Conversion
Figure 3 for RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for Robust Singing Voice Conversion
Figure 4 for RobustSVC: HuBERT-based Melody Extractor and Adversarial Learning for Robust Singing Voice Conversion
Viaarxiv icon

Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Add code
Sep 03, 2024
Figure 1 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Figure 2 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Figure 3 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Figure 4 for Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Viaarxiv icon