Picture for Ping Luo

Ping Luo

CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians

Add code
Oct 28, 2024
Figure 1 for CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
Figure 2 for CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
Figure 3 for CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
Figure 4 for CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
Viaarxiv icon

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Add code
Oct 17, 2024
Figure 1 for Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Figure 2 for Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Figure 3 for Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Figure 4 for Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Viaarxiv icon

Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos

Add code
Oct 15, 2024
Figure 1 for Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
Figure 2 for Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
Figure 3 for Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
Figure 4 for Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
Viaarxiv icon

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Add code
Oct 11, 2024
Figure 1 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 2 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 3 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 4 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Viaarxiv icon

DCP: Learning Accelerator Dataflow for Neural Network via Propagation

Add code
Oct 09, 2024
Figure 1 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Figure 2 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Figure 3 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Figure 4 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Viaarxiv icon

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Add code
Oct 07, 2024
Figure 1 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 2 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 3 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 4 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Viaarxiv icon

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Add code
Oct 07, 2024
Viaarxiv icon

HRVMamba: High-Resolution Visual State Space Model for Dense Prediction

Add code
Oct 04, 2024
Viaarxiv icon

Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking

Add code
Sep 24, 2024
Viaarxiv icon

Prior Knowledge Distillation Network for Face Super-Resolution

Add code
Sep 22, 2024
Figure 1 for Prior Knowledge Distillation Network for Face Super-Resolution
Figure 2 for Prior Knowledge Distillation Network for Face Super-Resolution
Figure 3 for Prior Knowledge Distillation Network for Face Super-Resolution
Figure 4 for Prior Knowledge Distillation Network for Face Super-Resolution
Viaarxiv icon