Picture for Junjie Wang

Junjie Wang

Jack

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Add code
Jun 28, 2024
Viaarxiv icon

TrustUQA: A Trustful Framework for Unified Structured Data Question Answering

Add code
Jun 27, 2024
Viaarxiv icon

GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality

Add code
Jun 26, 2024
Viaarxiv icon

Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement

Add code
Jun 24, 2024
Viaarxiv icon

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Add code
Jun 20, 2024
Figure 1 for Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Figure 2 for Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Figure 3 for Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Figure 4 for Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Viaarxiv icon

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Add code
Jun 20, 2024
Figure 1 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 2 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 3 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 4 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Viaarxiv icon

HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Add code
Jun 17, 2024
Figure 1 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Figure 2 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Figure 3 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Figure 4 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Viaarxiv icon

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Add code
Jun 14, 2024
Viaarxiv icon

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Add code
Jun 11, 2024
Figure 1 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Figure 2 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Figure 3 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Figure 4 for Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
Viaarxiv icon

MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning

Add code
May 29, 2024
Viaarxiv icon