Picture for Ser-Nam Lim

Ser-Nam Lim

Facebook Research, New York, NY, USA

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

Add code
Dec 03, 2024
Viaarxiv icon

OmniCreator: Self-Supervised Unified Generation with Universal Editing

Add code
Dec 03, 2024
Viaarxiv icon

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses

Add code
Nov 30, 2024
Viaarxiv icon

DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness

Add code
Nov 29, 2024
Viaarxiv icon

Towards Chunk-Wise Generation for Long Videos

Add code
Nov 27, 2024
Viaarxiv icon

Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop

Add code
Nov 26, 2024
Viaarxiv icon

Fast Encoding and Decoding for Implicit Video Representation

Add code
Sep 28, 2024
Viaarxiv icon

Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

Add code
Sep 16, 2024
Viaarxiv icon

DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks

Add code
Sep 10, 2024
Viaarxiv icon

AirSketch: Generative Motion to Sketch

Add code
Jul 12, 2024
Viaarxiv icon