Picture for Bowen Shi

Bowen Shi

Mosaic: Unlocking Long-Context Inference for Diffusion LLMs via Global Memory Planning and Dynamic Peak Taming

Add code
Jan 10, 2026
Viaarxiv icon

SAM Audio: Segment Anything in Audio

Add code
Dec 19, 2025
Figure 1 for SAM Audio: Segment Anything in Audio
Figure 2 for SAM Audio: Segment Anything in Audio
Figure 3 for SAM Audio: Segment Anything in Audio
Figure 4 for SAM Audio: Segment Anything in Audio
Viaarxiv icon

NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models

Add code
Sep 04, 2025
Viaarxiv icon

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Add code
Jul 28, 2025
Viaarxiv icon

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Add code
Feb 07, 2025
Figure 1 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 2 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 3 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 4 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Viaarxiv icon

Molecular Graph Contrastive Learning with Line Graph

Add code
Jan 15, 2025
Viaarxiv icon

Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation

Add code
Nov 07, 2024
Viaarxiv icon

MDCure: A Scalable Pipeline for Multi-Document Instruction-Following

Add code
Oct 30, 2024
Figure 1 for MDCure: A Scalable Pipeline for Multi-Document Instruction-Following
Figure 2 for MDCure: A Scalable Pipeline for Multi-Document Instruction-Following
Figure 3 for MDCure: A Scalable Pipeline for Multi-Document Instruction-Following
Figure 4 for MDCure: A Scalable Pipeline for Multi-Document Instruction-Following
Viaarxiv icon

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Add code
Oct 27, 2024
Figure 1 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 2 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 3 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Figure 4 for MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon