Picture for Jing Huang

Jing Huang

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Add code
Jun 05, 2025
Viaarxiv icon

ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation

Add code
May 30, 2025
Viaarxiv icon

GIM: Improved Interpretability for Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Manipulating Elasto-Plastic Objects With 3D Occupancy and Learning-Based Predictive Control

Add code
May 22, 2025
Viaarxiv icon

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

Add code
May 17, 2025
Viaarxiv icon

ScaleTrack: Scaling and back-tracking Automated GUI Agents

Add code
May 01, 2025
Viaarxiv icon

MIB: A Mechanistic Interpretability Benchmark

Add code
Apr 17, 2025
Viaarxiv icon

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Add code
Apr 13, 2025
Viaarxiv icon

Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Add code
Mar 27, 2025
Viaarxiv icon

HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks

Add code
Mar 13, 2025
Viaarxiv icon