Picture for Jinlan Fu

Jinlan Fu

LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding

Add code
May 22, 2025
Viaarxiv icon

Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal Inconsistency

Add code
May 20, 2025
Viaarxiv icon

Rethinking Visual Layer Selection in Multimodal LLMs

Add code
Apr 30, 2025
Viaarxiv icon

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Add code
Apr 17, 2025
Viaarxiv icon

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Add code
Mar 13, 2025
Viaarxiv icon

Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

Add code
Mar 08, 2025
Viaarxiv icon

CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Add code
Jan 28, 2025
Viaarxiv icon

FlipAttack: Jailbreak LLMs via Flipping

Add code
Oct 02, 2024
Figure 1 for FlipAttack: Jailbreak LLMs via Flipping
Figure 2 for FlipAttack: Jailbreak LLMs via Flipping
Figure 3 for FlipAttack: Jailbreak LLMs via Flipping
Figure 4 for FlipAttack: Jailbreak LLMs via Flipping
Viaarxiv icon

Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism

Add code
Jul 24, 2024
Figure 1 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Figure 2 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Figure 3 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Figure 4 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Viaarxiv icon

Cross-Modality Safety Alignment

Add code
Jun 21, 2024
Viaarxiv icon