Picture for Long Chen

Long Chen

University of Kaiserslautern-Landau, MODE Collaboration

TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems

Add code
Jun 25, 2024
Figure 1 for TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems
Figure 2 for TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems
Figure 3 for TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems
Figure 4 for TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems
Viaarxiv icon

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

Add code
Jun 15, 2024
Figure 1 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Figure 2 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Figure 3 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Figure 4 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Viaarxiv icon

MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding

Add code
Jun 15, 2024
Figure 1 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Figure 2 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Figure 3 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Figure 4 for MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Viaarxiv icon

CarLLaVA: Vision language models for camera-only closed-loop driving

Add code
Jun 14, 2024
Figure 1 for CarLLaVA: Vision language models for camera-only closed-loop driving
Figure 2 for CarLLaVA: Vision language models for camera-only closed-loop driving
Figure 3 for CarLLaVA: Vision language models for camera-only closed-loop driving
Figure 4 for CarLLaVA: Vision language models for camera-only closed-loop driving
Viaarxiv icon

DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning

Add code
Jun 13, 2024
Viaarxiv icon

Instruct Large Language Models to Drive like Humans

Add code
Jun 11, 2024
Figure 1 for Instruct Large Language Models to Drive like Humans
Figure 2 for Instruct Large Language Models to Drive like Humans
Figure 3 for Instruct Large Language Models to Drive like Humans
Figure 4 for Instruct Large Language Models to Drive like Humans
Viaarxiv icon

Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning

Add code
Jun 03, 2024
Figure 1 for Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning
Figure 2 for Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning
Figure 3 for Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning
Figure 4 for Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning
Viaarxiv icon

RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

Add code
May 29, 2024
Viaarxiv icon

$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation

Add code
May 27, 2024
Figure 1 for $\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
Figure 2 for $\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
Figure 3 for $\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
Figure 4 for $\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
Viaarxiv icon

3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving

Add code
May 24, 2024
Figure 1 for 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
Figure 2 for 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
Figure 3 for 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
Figure 4 for 3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
Viaarxiv icon