Picture for Kevin Song

Kevin Song

Evaluating Model-Free Policy Optimization in Masked-Action Environments via an Exact Blackjack Oracle

Add code
Mar 19, 2026
Viaarxiv icon

LLM Agent Swarm for Hypothesis-Driven Drug Discovery

Add code
Apr 24, 2025
Figure 1 for LLM Agent Swarm for Hypothesis-Driven Drug Discovery
Figure 2 for LLM Agent Swarm for Hypothesis-Driven Drug Discovery
Figure 3 for LLM Agent Swarm for Hypothesis-Driven Drug Discovery
Viaarxiv icon

Seesaw: High-throughput LLM Inference via Model Re-sharding

Add code
Mar 09, 2025
Viaarxiv icon