Picture for Zhenyu Li

Zhenyu Li

PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

Add code
Jun 10, 2024
Viaarxiv icon

DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance

Add code
Mar 20, 2024
Figure 1 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Figure 2 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Figure 3 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Figure 4 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Viaarxiv icon

AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework

Add code
Mar 19, 2024
Figure 1 for AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
Figure 2 for AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
Figure 3 for AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
Figure 4 for AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
Viaarxiv icon

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track

Add code
Feb 27, 2024
Viaarxiv icon

AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning

Add code
Feb 08, 2024
Viaarxiv icon

UniMem: Towards a Unified View of Long-Context Large Language Models

Add code
Feb 05, 2024
Viaarxiv icon

Mixed Static and Reconfigurable Metasurface Deployment in Indoor Dense Spaces: How Much Reconfigurability is Needed?

Add code
Feb 01, 2024
Viaarxiv icon

PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Add code
Dec 04, 2023
Figure 1 for PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Figure 2 for PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Figure 3 for PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Figure 4 for PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Viaarxiv icon

Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model

Add code
Nov 01, 2023
Figure 1 for Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model
Figure 2 for Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model
Figure 3 for Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model
Figure 4 for Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model
Viaarxiv icon

Enhancing Subtask Performance of Multi-modal Large Language Model

Add code
Aug 31, 2023
Figure 1 for Enhancing Subtask Performance of Multi-modal Large Language Model
Figure 2 for Enhancing Subtask Performance of Multi-modal Large Language Model
Figure 3 for Enhancing Subtask Performance of Multi-modal Large Language Model
Figure 4 for Enhancing Subtask Performance of Multi-modal Large Language Model
Viaarxiv icon