Picture for Yujiu Yang

Yujiu Yang

LoCa: Logit Calibration for Knowledge Distillation

Add code
Sep 07, 2024
Viaarxiv icon

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Add code
Sep 06, 2024
Viaarxiv icon

An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation

Add code
Jul 29, 2024
Viaarxiv icon

CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model

Add code
Jul 23, 2024
Viaarxiv icon

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Figure 1 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 2 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 3 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 4 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Viaarxiv icon

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Add code
Jun 28, 2024
Figure 1 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Figure 2 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Figure 3 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Figure 4 for ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
Viaarxiv icon

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance

Add code
Jun 21, 2024
Figure 1 for Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance
Figure 2 for Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance
Figure 3 for Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance
Figure 4 for Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance
Viaarxiv icon

HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Add code
Jun 17, 2024
Figure 1 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Figure 2 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Figure 3 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Figure 4 for HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Viaarxiv icon

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Add code
Jun 14, 2024
Figure 1 for ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Figure 2 for ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Figure 3 for ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Figure 4 for ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Viaarxiv icon

MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon