Picture for Hang He

Hang He

VULCA-Bench: A Multicultural Vision-Language Benchmark for Evaluating Cultural Understanding

Add code
Jan 12, 2026
Viaarxiv icon

ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs

Add code
Dec 18, 2025
Viaarxiv icon

LocalSearchBench: Benchmarking Agentic Search in Real-World Local Life Services

Add code
Dec 08, 2025
Viaarxiv icon

Promoting Efficient Reasoning with Verifiable Stepwise Reward

Add code
Aug 14, 2025
Viaarxiv icon

Automated detection of atomicity violations in large-scale systems

Add code
Apr 01, 2025
Figure 1 for Automated detection of atomicity violations in large-scale systems
Figure 2 for Automated detection of atomicity violations in large-scale systems
Figure 3 for Automated detection of atomicity violations in large-scale systems
Figure 4 for Automated detection of atomicity violations in large-scale systems
Viaarxiv icon

FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics

Add code
Feb 24, 2024
Figure 1 for FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics
Figure 2 for FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics
Figure 3 for FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics
Figure 4 for FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics
Viaarxiv icon