Picture for Hejia Zhang

Hejia Zhang

Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Add code
Mar 09, 2026
Viaarxiv icon

LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation

Add code
Feb 18, 2026
Viaarxiv icon

ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design

Add code
Jan 29, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

Add code
Nov 13, 2025
Viaarxiv icon

PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification

Add code
Jun 13, 2025
Viaarxiv icon

ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation

Add code
May 14, 2025
Viaarxiv icon

MAGE: A Multi-Agent Engine for Automated RTL Code Generation

Add code
Dec 10, 2024
Figure 1 for MAGE: A Multi-Agent Engine for Automated RTL Code Generation
Figure 2 for MAGE: A Multi-Agent Engine for Automated RTL Code Generation
Figure 3 for MAGE: A Multi-Agent Engine for Automated RTL Code Generation
Figure 4 for MAGE: A Multi-Agent Engine for Automated RTL Code Generation
Viaarxiv icon

Improving Model Factuality with Fine-grained Critique-based Evaluator

Add code
Oct 24, 2024
Viaarxiv icon

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon