Picture for Tat-Seng Chua

Tat-Seng Chua

DanceOPD: On-Policy Generative Field Distillation

Add code
Jun 25, 2026
Viaarxiv icon

CARE: Competence-Aware Reward Shaping for Adaptive Reasoning Length in Video-MLLMs

Add code
Jun 18, 2026
Viaarxiv icon

HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining

Add code
Jun 18, 2026
Viaarxiv icon

Enhancing Decision-Making with Large Language Models through Multi-Agent Fictitious Play

Add code
Jun 17, 2026
Viaarxiv icon

Reasoning as Intersection: Consensus-Frame Alignment for Visual Focus in Video-MLLMs

Add code
Jun 16, 2026
Viaarxiv icon

An Extensive Benchmark for Single-round and Multi-round Instruction-based Image Editing

Add code
Jun 14, 2026
Viaarxiv icon

A Robust Point Cloud Analysis Framework Inspired By Primary Visual Cortex

Add code
Jun 12, 2026
Viaarxiv icon

CFALR: Collaborative Filtering-Augmented Large Language Model for Personalized Fashion Outfit Recommendation

Add code
Jun 11, 2026
Viaarxiv icon

MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning

Add code
Jun 10, 2026
Viaarxiv icon

Turing Patterns for Multimedia: Reaction-Diffusion Multi-Modal Fusion for Language-Guided Video Moment Retrieval

Add code
Jun 01, 2026
Viaarxiv icon