Picture for Bo Li

Bo Li

Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, China

MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation

Add code
Dec 10, 2024
Figure 1 for MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation
Figure 2 for MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation
Figure 3 for MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation
Figure 4 for MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation
Viaarxiv icon

Data Free Backdoor Attacks

Add code
Dec 09, 2024
Figure 1 for Data Free Backdoor Attacks
Figure 2 for Data Free Backdoor Attacks
Figure 3 for Data Free Backdoor Attacks
Figure 4 for Data Free Backdoor Attacks
Viaarxiv icon

SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations

Add code
Dec 09, 2024
Viaarxiv icon

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Add code
Dec 06, 2024
Figure 1 for MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Figure 2 for MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Figure 3 for MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Figure 4 for MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Viaarxiv icon

Composition of Experts: A Modular Compound AI System Leveraging Large Language Models

Add code
Dec 02, 2024
Figure 1 for Composition of Experts: A Modular Compound AI System Leveraging Large Language Models
Figure 2 for Composition of Experts: A Modular Compound AI System Leveraging Large Language Models
Figure 3 for Composition of Experts: A Modular Compound AI System Leveraging Large Language Models
Figure 4 for Composition of Experts: A Modular Compound AI System Leveraging Large Language Models
Viaarxiv icon

Learning Adaptive Lighting via Channel-Aware Guidance

Add code
Dec 02, 2024
Figure 1 for Learning Adaptive Lighting via Channel-Aware Guidance
Figure 2 for Learning Adaptive Lighting via Channel-Aware Guidance
Figure 3 for Learning Adaptive Lighting via Channel-Aware Guidance
Figure 4 for Learning Adaptive Lighting via Channel-Aware Guidance
Viaarxiv icon

Practical Performative Policy Learning with Strategic Agents

Add code
Dec 02, 2024
Figure 1 for Practical Performative Policy Learning with Strategic Agents
Figure 2 for Practical Performative Policy Learning with Strategic Agents
Figure 3 for Practical Performative Policy Learning with Strategic Agents
Figure 4 for Practical Performative Policy Learning with Strategic Agents
Viaarxiv icon

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation

Add code
Dec 02, 2024
Viaarxiv icon

DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams

Add code
Nov 26, 2024
Figure 1 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Figure 2 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Figure 3 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Figure 4 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Viaarxiv icon

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Add code
Nov 22, 2024
Figure 1 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Figure 2 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Figure 3 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Figure 4 for Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Viaarxiv icon