Picture for Pengfei Hu

Pengfei Hu

Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios

Add code
Jun 21, 2024
Figure 1 for Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
Figure 2 for Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
Figure 3 for Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
Figure 4 for Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
Viaarxiv icon

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Add code
Jun 13, 2024
Viaarxiv icon

SEMv3: A Fast and Robust Approach to Table Separation Line Detection

Add code
May 20, 2024
Viaarxiv icon

Poisson-Gamma Dynamical Systems with Non-Stationary Transition Dynamics

Add code
Feb 26, 2024
Viaarxiv icon

Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition

Add code
Dec 31, 2023
Viaarxiv icon

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

Add code
Nov 17, 2023
Viaarxiv icon

Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives

Add code
Sep 21, 2023
Figure 1 for Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives
Figure 2 for Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives
Figure 3 for Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives
Figure 4 for Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives
Viaarxiv icon

Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023

Add code
Sep 11, 2023
Figure 1 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 2 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 3 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 4 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Viaarxiv icon

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

Add code
Sep 09, 2023
Figure 1 for Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Figure 2 for Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Figure 3 for Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Figure 4 for Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Viaarxiv icon

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Add code
Jul 30, 2023
Figure 1 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 2 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 3 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 4 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Viaarxiv icon