Picture for Xin Lai

Xin Lai

Rethinking Recurrent Neural Networks for Time Series Forecasting: A Reinforced Recurrent Encoder with Prediction-Oriented Proximal Policy Optimization

Add code
Jan 07, 2026
Viaarxiv icon

VideoZoomer: Reinforcement-Learned Temporal Focusing for Long Video Reasoning

Add code
Dec 26, 2025
Viaarxiv icon

Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off

Add code
Nov 12, 2025
Viaarxiv icon

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Add code
Sep 09, 2025
Figure 1 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Figure 2 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Figure 3 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Figure 4 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Viaarxiv icon

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Add code
Jul 17, 2025
Viaarxiv icon

CausalVE: Face Video Privacy Encryption via Causal Video Prediction

Add code
Sep 28, 2024
Figure 1 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Figure 2 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Figure 3 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Figure 4 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Viaarxiv icon

LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling

Add code
Sep 13, 2024
Figure 1 for LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
Figure 2 for LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
Figure 3 for LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling
Viaarxiv icon

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Add code
Jun 26, 2024
Figure 1 for Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
Figure 2 for Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
Figure 3 for Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
Figure 4 for Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
Viaarxiv icon

Improved Genetic Algorithm Based on Greedy and Simulated Annealing Ideas for Vascular Robot Ordering Strategy

Add code
Mar 28, 2024
Viaarxiv icon

An Improved Baseline for Reasoning Segmentation with Large Language Model

Add code
Jan 03, 2024
Figure 1 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Figure 2 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Figure 3 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Figure 4 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Viaarxiv icon