Picture for Yifei Hu

Yifei Hu

Anjelica

CLAMP: Crowdsourcing a LArge-scale in-the-wild haptic dataset with an open-source device for Multimodal robot Perception

Add code
May 27, 2025
Viaarxiv icon

AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving

Add code
May 21, 2025
Viaarxiv icon

Goku: Flow Based Video Generative Foundation Models

Add code
Feb 10, 2025
Viaarxiv icon

GraphRPM: Risk Pattern Mining on Industrial Large Attributed Graphs

Add code
Nov 11, 2024
Viaarxiv icon

EVLM: An Efficient Vision-Language Model for Visual Understanding

Add code
Jul 19, 2024
Figure 1 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 2 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 3 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 4 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Viaarxiv icon

MORPHeus: a Multimodal One-armed Robot-assisted Peeling System with Human Users In-the-loop

Add code
Apr 09, 2024
Viaarxiv icon

Misspelling Correction with Pre-trained Contextual Language Model

Add code
Jan 08, 2021
Figure 1 for Misspelling Correction with Pre-trained Contextual Language Model
Figure 2 for Misspelling Correction with Pre-trained Contextual Language Model
Figure 3 for Misspelling Correction with Pre-trained Contextual Language Model
Figure 4 for Misspelling Correction with Pre-trained Contextual Language Model
Viaarxiv icon