Picture for Bo Wang

Bo Wang

Tencent, WeChat Pay

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Add code
Jan 23, 2026
Viaarxiv icon

Beyond the Dirac Delta: Mitigating Diversity Collapse in Reinforcement Fine-Tuning for Versatile Image Generation

Add code
Jan 18, 2026
Viaarxiv icon

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Add code
Jan 16, 2026
Viaarxiv icon

From Performance to Practice: Knowledge-Distilled Segmentator for On-Premises Clinical Workflows

Add code
Jan 14, 2026
Viaarxiv icon

Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models

Add code
Jan 13, 2026
Viaarxiv icon

Multi-hop Reasoning via Early Knowledge Alignment

Add code
Dec 23, 2025
Figure 1 for Multi-hop Reasoning via Early Knowledge Alignment
Figure 2 for Multi-hop Reasoning via Early Knowledge Alignment
Figure 3 for Multi-hop Reasoning via Early Knowledge Alignment
Figure 4 for Multi-hop Reasoning via Early Knowledge Alignment
Viaarxiv icon

Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection

Add code
Dec 19, 2025
Figure 1 for Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection
Figure 2 for Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection
Figure 3 for Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection
Figure 4 for Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection
Viaarxiv icon

Benchmarking and Adapting On-Device Large Language Models for Clinical Decision Support

Add code
Dec 18, 2025
Viaarxiv icon

TF-MCL: Time-frequency Fusion and Multi-domain Cross-Loss for Self-supervised Depression Detection

Add code
Dec 14, 2025
Viaarxiv icon

A 96pJ/Frame/Pixel and 61pJ/Event Anti-UAV System with Hybrid Object Tracking Modes

Add code
Dec 12, 2025
Viaarxiv icon