Picture for Derek F. Wong

Derek F. Wong

From Scenes to Elements: Multi-Granularity Evidence Retrieval for Verifiable Multimodal RAG

Add code
May 14, 2026
Viaarxiv icon

Chain-of-Procedure: Hierarchical Visual-Language Reasoning for Procedural QA

Add code
May 14, 2026
Viaarxiv icon

Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis Using Caption-Prompt-Judge and LLM-as-a-Judge

Add code
Apr 26, 2026
Viaarxiv icon

OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning

Add code
Apr 20, 2026
Viaarxiv icon

Who Wrote This Line? Evaluating the Detection of LLM-Generated Classical Chinese Poetry

Add code
Apr 11, 2026
Viaarxiv icon

Can ChatGPT Really Understand Modern Chinese Poetry?

Add code
Mar 21, 2026
Viaarxiv icon

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Add code
Mar 13, 2026
Viaarxiv icon

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Add code
Oct 30, 2025
Figure 1 for One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Figure 2 for One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Figure 3 for One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Figure 4 for One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Viaarxiv icon

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Add code
Oct 23, 2025
Figure 1 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Figure 2 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Figure 3 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Figure 4 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon