Picture for Lin Qu

Lin Qu

Factored Causal Representation Learning for Robust Reward Modeling in RLHF

Add code
Jan 29, 2026
Viaarxiv icon

Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement

Add code
Jan 08, 2026
Viaarxiv icon

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Add code
Dec 31, 2025
Viaarxiv icon

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure

Add code
Dec 27, 2025
Viaarxiv icon

Reliable and Private Utility Signaling for Data Markets

Add code
Nov 11, 2025
Viaarxiv icon

DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement

Add code
Aug 20, 2025
Figure 1 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Figure 2 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Figure 3 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Figure 4 for DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement
Viaarxiv icon

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Add code
Jun 06, 2025
Viaarxiv icon

InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling

Add code
May 27, 2025
Figure 1 for InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling
Figure 2 for InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling
Figure 3 for InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling
Figure 4 for InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling
Viaarxiv icon

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Add code
Jul 23, 2024
Figure 1 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 2 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 3 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Figure 4 for DDK: Distilling Domain Knowledge for Efficient Large Language Models
Viaarxiv icon

SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules

Add code
Jul 02, 2024
Figure 1 for SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules
Figure 2 for SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules
Figure 3 for SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules
Figure 4 for SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules
Viaarxiv icon