Picture for Longyue Wang

Longyue Wang

Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation

Add code
Aug 19, 2024
Figure 1 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Figure 2 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Figure 3 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Figure 4 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Viaarxiv icon

LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference

Add code
Jun 26, 2024
Figure 1 for LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Figure 2 for LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Figure 3 for LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Figure 4 for LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Viaarxiv icon

VideoVista: A Versatile Benchmark for Video Understanding and Reasoning

Add code
Jun 17, 2024
Figure 1 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 2 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 3 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 4 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Viaarxiv icon

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

Add code
May 20, 2024
Viaarxiv icon

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Add code
May 18, 2024
Figure 1 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Figure 2 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Figure 3 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Figure 4 for Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Viaarxiv icon

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

Add code
May 08, 2024
Viaarxiv icon

On the Information Redundancy in Non-Autoregressive Translation

Add code
May 04, 2024
Viaarxiv icon

Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model

Add code
Mar 20, 2024
Viaarxiv icon

Anchor-based Large Language Models

Add code
Feb 16, 2024
Viaarxiv icon

Benchmarking LLMs via Uncertainty Quantification

Add code
Jan 23, 2024
Figure 1 for Benchmarking LLMs via Uncertainty Quantification
Figure 2 for Benchmarking LLMs via Uncertainty Quantification
Figure 3 for Benchmarking LLMs via Uncertainty Quantification
Figure 4 for Benchmarking LLMs via Uncertainty Quantification
Viaarxiv icon