Picture for Zhaoxin Fan

Zhaoxin Fan

Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks

Add code
Apr 24, 2025
Viaarxiv icon

AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline

Add code
Apr 01, 2025
Viaarxiv icon

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Add code
Mar 28, 2025
Viaarxiv icon

STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM

Add code
Mar 27, 2025
Viaarxiv icon

MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System

Add code
Mar 12, 2025
Viaarxiv icon

ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis

Add code
Mar 09, 2025
Viaarxiv icon

DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue

Add code
Feb 19, 2025
Viaarxiv icon

TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding

Add code
Jan 26, 2025
Viaarxiv icon

EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers

Add code
Dec 29, 2024
Viaarxiv icon

MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing

Add code
Dec 28, 2024
Viaarxiv icon