Picture for Yuan Yao

Yuan Yao

Department of Mathematics, Hong Kong University of Science and Technology

Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation

Add code
Jan 09, 2025
Figure 1 for Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation
Figure 2 for Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation
Figure 3 for Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation
Figure 4 for Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation
Viaarxiv icon

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Neuro-Symbolic Data Generation for Math Reasoning

Add code
Dec 06, 2024
Figure 1 for Neuro-Symbolic Data Generation for Math Reasoning
Figure 2 for Neuro-Symbolic Data Generation for Math Reasoning
Figure 3 for Neuro-Symbolic Data Generation for Math Reasoning
Figure 4 for Neuro-Symbolic Data Generation for Math Reasoning
Viaarxiv icon

Large Language Models show both individual and collective creativity comparable to humans

Add code
Dec 04, 2024
Figure 1 for Large Language Models show both individual and collective creativity comparable to humans
Figure 2 for Large Language Models show both individual and collective creativity comparable to humans
Figure 3 for Large Language Models show both individual and collective creativity comparable to humans
Figure 4 for Large Language Models show both individual and collective creativity comparable to humans
Viaarxiv icon

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

Add code
Nov 11, 2024
Figure 1 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Figure 2 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Figure 3 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Figure 4 for ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Viaarxiv icon

UniGAD: Unifying Multi-level Graph Anomaly Detection

Add code
Nov 10, 2024
Figure 1 for UniGAD: Unifying Multi-level Graph Anomaly Detection
Figure 2 for UniGAD: Unifying Multi-level Graph Anomaly Detection
Figure 3 for UniGAD: Unifying Multi-level Graph Anomaly Detection
Figure 4 for UniGAD: Unifying Multi-level Graph Anomaly Detection
Viaarxiv icon

Autoregressive Models in Vision: A Survey

Add code
Nov 08, 2024
Figure 1 for Autoregressive Models in Vision: A Survey
Figure 2 for Autoregressive Models in Vision: A Survey
Figure 3 for Autoregressive Models in Vision: A Survey
Figure 4 for Autoregressive Models in Vision: A Survey
Viaarxiv icon

Neuro-symbolic Learning Yielding Logical Constraints

Add code
Oct 28, 2024
Figure 1 for Neuro-symbolic Learning Yielding Logical Constraints
Figure 2 for Neuro-symbolic Learning Yielding Logical Constraints
Figure 3 for Neuro-symbolic Learning Yielding Logical Constraints
Figure 4 for Neuro-symbolic Learning Yielding Logical Constraints
Viaarxiv icon

Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models

Add code
Oct 23, 2024
Viaarxiv icon

Elucidating the design space of language models for image generation

Add code
Oct 21, 2024
Figure 1 for Elucidating the design space of language models for image generation
Figure 2 for Elucidating the design space of language models for image generation
Figure 3 for Elucidating the design space of language models for image generation
Figure 4 for Elucidating the design space of language models for image generation
Viaarxiv icon