Picture for Junqi Gao

Junqi Gao

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Add code
Apr 01, 2025
Viaarxiv icon

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Add code
Feb 10, 2025
Viaarxiv icon

Fast and Slow Gradient Approximation for Binary Neural Network Optimization

Add code
Dec 16, 2024
Figure 1 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 2 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 3 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 4 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Viaarxiv icon

Less is More: Efficient Model Merging with Binary Task Switch

Add code
Nov 24, 2024
Viaarxiv icon

An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Add code
Nov 11, 2024
Viaarxiv icon

SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning

Add code
Aug 04, 2024
Viaarxiv icon

Enhancing Adversarial Transferability via Information Bottleneck Constraints

Add code
Jun 08, 2024
Figure 1 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Figure 2 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Figure 3 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Figure 4 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Viaarxiv icon

Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability

Add code
Jun 08, 2024
Viaarxiv icon

Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing

Add code
Jun 08, 2024
Viaarxiv icon

SMR: State Memory Replay for Long Sequence Modeling

Add code
May 27, 2024
Viaarxiv icon