Frames Dataset


DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter

Add code
Feb 05, 2026
Viaarxiv icon

EgoPoseVR: Spatiotemporal Multi-Modal Reasoning for Egocentric Full-Body Pose in Virtual Reality

Add code
Feb 05, 2026
Viaarxiv icon

E.M.Ground: A Temporal Grounding Vid-LLM with Holistic Event Perception and Matching

Add code
Feb 05, 2026
Viaarxiv icon

TSBOW: Traffic Surveillance Benchmark for Occluded Vehicles Under Various Weather Conditions

Add code
Feb 05, 2026
Viaarxiv icon

xList-Hate: A Checklist-Based Framework for Interpretable and Generalizable Hate Speech Detection

Add code
Feb 05, 2026
Viaarxiv icon

Characterizing Human Semantic Navigation in Concept Production as Trajectories in Embedding Space

Add code
Feb 05, 2026
Viaarxiv icon

Visual Implicit Geometry Transformer for Autonomous Driving

Add code
Feb 05, 2026
Viaarxiv icon

Poster: Camera Tampering Detection for Outdoor IoT Systems

Add code
Feb 05, 2026
Viaarxiv icon

PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction

Add code
Feb 05, 2026
Viaarxiv icon

Exploring the Temporal Consistency for Point-Level Weakly-Supervised Temporal Action Localization

Add code
Feb 05, 2026
Viaarxiv icon