Picture for Hao Li

Hao Li

Jack

UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision

Add code
Dec 24, 2024
Viaarxiv icon

LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding

Add code
Dec 24, 2024
Figure 1 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Figure 2 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Figure 3 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Figure 4 for LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
Viaarxiv icon

DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak

Add code
Dec 23, 2024
Figure 1 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Figure 2 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Figure 3 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Figure 4 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Viaarxiv icon

CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction

Add code
Dec 23, 2024
Viaarxiv icon

Object Style Diffusion for Generalized Object Detection in Urban Scene

Add code
Dec 18, 2024
Figure 1 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Figure 2 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Figure 3 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Figure 4 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Viaarxiv icon

Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency

Add code
Dec 17, 2024
Figure 1 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Figure 2 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Figure 3 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Figure 4 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Viaarxiv icon

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting

Add code
Dec 17, 2024
Viaarxiv icon

Efficient Scaling of Diffusion Transformers for Text-to-Image Generation

Add code
Dec 16, 2024
Figure 1 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 2 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 3 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 4 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Viaarxiv icon

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Add code
Dec 12, 2024
Figure 1 for SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
Figure 2 for SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
Figure 3 for SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
Figure 4 for SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
Viaarxiv icon

Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models

Add code
Dec 10, 2024
Figure 1 for Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models
Figure 2 for Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models
Figure 3 for Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models
Figure 4 for Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models
Viaarxiv icon