Picture for Yi Wang

Yi Wang

NUS

EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization

Add code
Jun 17, 2025
Viaarxiv icon

A Gravity-informed Spatiotemporal Transformer for Human Activity Intensity Prediction

Add code
Jun 16, 2025
Viaarxiv icon

OneRec Technical Report

Add code
Jun 16, 2025
Viaarxiv icon

Fuzzy Propositional Formulas under the Stable Model Semantics

Add code
Jun 15, 2025
Viaarxiv icon

IndoorWorld: Integrating Physical Task Solving and Social Simulation in A Heterogeneous Multi-Agent Environment

Add code
Jun 14, 2025
Viaarxiv icon

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Add code
Jun 12, 2025
Viaarxiv icon

3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection

Add code
Jun 11, 2025
Viaarxiv icon

Can LLMs Generate Good Stories? Insights and Challenges from a Narrative Planning Perspective

Add code
Jun 11, 2025
Viaarxiv icon

Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning

Add code
Jun 09, 2025
Viaarxiv icon

VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning

Add code
Jun 06, 2025
Viaarxiv icon