Picture for Qiying Yu

Qiying Yu

Virtual Width Networks

Add code
Nov 17, 2025
Viaarxiv icon

ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation

Add code
Aug 24, 2025
Viaarxiv icon

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Add code
Jul 03, 2025
Figure 1 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Figure 2 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Figure 3 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Figure 4 for MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Viaarxiv icon

Truncated Proximal Policy Optimization

Add code
Jun 18, 2025
Figure 1 for Truncated Proximal Policy Optimization
Figure 2 for Truncated Proximal Policy Optimization
Figure 3 for Truncated Proximal Policy Optimization
Figure 4 for Truncated Proximal Policy Optimization
Viaarxiv icon

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Add code
May 26, 2025
Figure 1 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Figure 2 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Figure 3 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Figure 4 for Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Viaarxiv icon

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Add code
Apr 08, 2025
Figure 1 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 2 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 3 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Figure 1 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 2 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 3 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 4 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Viaarxiv icon

Emu3: Next-Token Prediction is All You Need

Add code
Sep 27, 2024
Figure 1 for Emu3: Next-Token Prediction is All You Need
Figure 2 for Emu3: Next-Token Prediction is All You Need
Figure 3 for Emu3: Next-Token Prediction is All You Need
Figure 4 for Emu3: Next-Token Prediction is All You Need
Viaarxiv icon

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Add code
Feb 06, 2024
Figure 1 for EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Figure 2 for EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Figure 3 for EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Figure 4 for EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Viaarxiv icon

Generative Multimodal Models are In-Context Learners

Add code
Dec 20, 2023
Viaarxiv icon