Picture for Youfeng Liu

Youfeng Liu

AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Add code
May 22, 2025
Viaarxiv icon

PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Add code
Jan 13, 2025
Viaarxiv icon

LegalAgentBench: Evaluating LLM Agents in Legal Domain

Add code
Dec 23, 2024
Viaarxiv icon