Picture for Xijie Huang

Xijie Huang

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing

Add code
Jan 06, 2026
Viaarxiv icon

Flying in Clutter on Monocular RGB by Learning in 3D Radiance Fields with Domain Adaptation

Add code
Dec 19, 2025
Viaarxiv icon

A 28nm 0.22 μJ/token memory-compute-intensity-aware CNN-Transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semanticsegmentation

Add code
Dec 19, 2025
Viaarxiv icon

Reactive Aerobatic Flight via Reinforcement Learning

Add code
May 30, 2025
Viaarxiv icon

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Add code
Feb 24, 2025
Viaarxiv icon

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

Add code
Aug 01, 2024
Figure 1 for SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
Figure 2 for SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
Figure 3 for SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
Figure 4 for SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
Viaarxiv icon

Synth-Empathy: Towards High-Quality Synthetic Empathy Data

Add code
Jul 31, 2024
Viaarxiv icon

RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization

Add code
Jul 10, 2024
Figure 1 for RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Figure 2 for RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Figure 3 for RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Figure 4 for RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Viaarxiv icon

Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

Add code
May 26, 2024
Figure 1 for Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models
Figure 2 for Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models
Figure 3 for Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models
Figure 4 for Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models
Viaarxiv icon