Picture for Peng Li

Peng Li

DJI Innovations Inc

DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms

Add code
Mar 05, 2025
Figure 1 for DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Figure 2 for DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Figure 3 for DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Figure 4 for DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Viaarxiv icon

Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization

Add code
Feb 20, 2025
Viaarxiv icon

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Add code
Feb 20, 2025
Viaarxiv icon

LLM-USO: Large Language Model-based Universal Sizing Optimizer

Add code
Feb 04, 2025
Viaarxiv icon

Perspective Transition of Large Language Models for Solving Subjective Tasks

Add code
Jan 16, 2025
Figure 1 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Figure 2 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Figure 3 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Figure 4 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Viaarxiv icon

Hierarchical Superpixel Segmentation via Structural Information Theory

Add code
Jan 13, 2025
Viaarxiv icon

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Add code
Jan 07, 2025
Figure 1 for Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Figure 2 for Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Figure 3 for Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Figure 4 for Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Viaarxiv icon

Towards 3D Acceleration for low-power Mixture-of-Experts and Multi-Head Attention Spiking Transformers

Add code
Dec 07, 2024
Viaarxiv icon

Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC

Add code
Dec 07, 2024
Viaarxiv icon

Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search

Add code
Dec 07, 2024
Figure 1 for Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search
Figure 2 for Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search
Figure 3 for Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search
Figure 4 for Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search
Viaarxiv icon