Picture for Hao Shi

Hao Shi

GeoVLA: Empowering 3D Representations in Vision-Language-Action Models

Add code
Aug 12, 2025
Viaarxiv icon

NeuralDB: Scaling Knowledge Editing in LLMs to 100,000 Facts with Neural KV Database

Add code
Jul 24, 2025
Viaarxiv icon

Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding

Add code
Jun 05, 2025
Viaarxiv icon

Hadaptive-Net: Efficient Vision Models via Adaptive Cross-Hadamard Synergy

Add code
May 28, 2025
Viaarxiv icon

Bridging Speech Emotion Recognition and Personality: Dataset and Temporal Interaction Condition Network

Add code
May 20, 2025
Viaarxiv icon

Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement

Add code
May 20, 2025
Viaarxiv icon

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding

Add code
May 08, 2025
Viaarxiv icon

DConAD: A Differencing-based Contrastive Representation Learning Framework for Time Series Anomaly Detection

Add code
Apr 19, 2025
Viaarxiv icon

EgoEvGesture: Gesture Recognition Based on Egocentric Event Camera

Add code
Mar 16, 2025
Figure 1 for EgoEvGesture: Gesture Recognition Based on Egocentric Event Camera
Figure 2 for EgoEvGesture: Gesture Recognition Based on Egocentric Event Camera
Figure 3 for EgoEvGesture: Gesture Recognition Based on Egocentric Event Camera
Figure 4 for EgoEvGesture: Gesture Recognition Based on Egocentric Event Camera
Viaarxiv icon

HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors

Add code
Mar 10, 2025
Viaarxiv icon