Picture for Tianren Ma

Tianren Ma

AceTone: Bridging Words and Colors for Conditional Image Grading

Add code
Apr 01, 2026
Viaarxiv icon

ReDDiT: Rehashing Noise for Discrete Visual Generation

Add code
May 26, 2025
Viaarxiv icon

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding

Add code
Jun 17, 2024
Viaarxiv icon

Artemis: Towards Referential Understanding in Complex Videos

Add code
Jun 01, 2024
Figure 1 for Artemis: Towards Referential Understanding in Complex Videos
Figure 2 for Artemis: Towards Referential Understanding in Complex Videos
Figure 3 for Artemis: Towards Referential Understanding in Complex Videos
Figure 4 for Artemis: Towards Referential Understanding in Complex Videos
Viaarxiv icon

ChatterBox: Multi-round Multimodal Referring and Grounding

Add code
Jan 24, 2024
Viaarxiv icon