Picture for Junjie Wang

Junjie Wang

Sid

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Add code
Jun 20, 2024
Figure 1 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 2 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 3 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 4 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Viaarxiv icon

HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Add code
Jun 17, 2024
Viaarxiv icon

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Add code
Jun 14, 2024
Viaarxiv icon

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Add code
Jun 11, 2024
Figure 1 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Figure 2 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Figure 3 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Figure 4 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Viaarxiv icon

MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning

Add code
May 29, 2024
Figure 1 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Figure 2 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Figure 3 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Figure 4 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Viaarxiv icon

OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision

Add code
May 28, 2024
Viaarxiv icon

VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing

Add code
Mar 05, 2024
Figure 1 for VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
Figure 2 for VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
Figure 3 for VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
Figure 4 for VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
Viaarxiv icon

Adversarial Testing for Visual Grounding via Image-Aware Property Reduction

Add code
Mar 02, 2024
Figure 1 for Adversarial Testing for Visual Grounding via Image-Aware Property Reduction
Figure 2 for Adversarial Testing for Visual Grounding via Image-Aware Property Reduction
Figure 3 for Adversarial Testing for Visual Grounding via Image-Aware Property Reduction
Figure 4 for Adversarial Testing for Visual Grounding via Image-Aware Property Reduction
Viaarxiv icon

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Add code
Feb 28, 2024
Viaarxiv icon

Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing

Add code
Feb 28, 2024
Viaarxiv icon