Picture for Yu Wang

Yu Wang

School of Control and Computer Engineering, North China Electric Power University

Personalized Multimodal Large Language Models: A Survey

Add code
Dec 03, 2024
Viaarxiv icon

Jailbreak Large Vision-Language Models Through Multi-Modal Linkage

Add code
Dec 03, 2024
Viaarxiv icon

DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models

Add code
Nov 27, 2024
Viaarxiv icon

LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization

Add code
Nov 26, 2024
Viaarxiv icon

ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction

Add code
Nov 26, 2024
Viaarxiv icon

Glo-In-One-v2: Holistic Identification of Glomerular Cells, Tissues, and Lesions in Human and Mouse Histopathology

Add code
Nov 25, 2024
Viaarxiv icon

Continual SFT Matches Multimodal RLHF with Negative Supervision

Add code
Nov 22, 2024
Viaarxiv icon

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

Add code
Nov 20, 2024
Viaarxiv icon

Towards Accurate and Efficient Sub-8-Bit Integer Training

Add code
Nov 17, 2024
Viaarxiv icon

WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking

Add code
Nov 14, 2024
Viaarxiv icon