Picture for Zhenfang Chen

Zhenfang Chen

Scene-agnostic Pose Regression for Visual Localization

Add code
Mar 25, 2025
Figure 1 for Scene-agnostic Pose Regression for Visual Localization
Figure 2 for Scene-agnostic Pose Regression for Visual Localization
Figure 3 for Scene-agnostic Pose Regression for Visual Localization
Figure 4 for Scene-agnostic Pose Regression for Visual Localization
Viaarxiv icon

Scaling Autonomous Agents via Automatic Reward Modeling And Planning

Add code
Feb 17, 2025
Viaarxiv icon

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Add code
Feb 04, 2025
Figure 1 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 2 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 3 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 4 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Viaarxiv icon

Compositional Physical Reasoning of Objects and Events from Videos

Add code
Aug 02, 2024
Viaarxiv icon

FlexAttention for Efficient High-Resolution Vision-Language Models

Add code
Jul 29, 2024
Figure 1 for FlexAttention for Efficient High-Resolution Vision-Language Models
Figure 2 for FlexAttention for Efficient High-Resolution Vision-Language Models
Figure 3 for FlexAttention for Efficient High-Resolution Vision-Language Models
Figure 4 for FlexAttention for Efficient High-Resolution Vision-Language Models
Viaarxiv icon

SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge

Add code
May 17, 2024
Figure 1 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Figure 2 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Figure 3 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Figure 4 for SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Viaarxiv icon

STAR: A Benchmark for Situated Reasoning in Real-World Videos

Add code
May 15, 2024
Figure 1 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 2 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 3 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 4 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Viaarxiv icon

ContPhy: Continuum Physical Concept Learning and Reasoning from Videos

Add code
Feb 09, 2024
Figure 1 for ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Figure 2 for ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Figure 3 for ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Figure 4 for ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Viaarxiv icon

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

Add code
Jan 30, 2024
Figure 1 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Figure 2 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Figure 3 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Figure 4 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Viaarxiv icon

GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs

Add code
Nov 08, 2023
Figure 1 for GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Figure 2 for GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Figure 3 for GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Figure 4 for GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Viaarxiv icon