Picture for Jinyang Guo

Jinyang Guo

Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference

Add code
Apr 05, 2026
Viaarxiv icon

BWTA: Accurate and Efficient Binarized Transformer by Algorithm-Hardware Co-design

Add code
Apr 05, 2026
Viaarxiv icon

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Add code
Jan 26, 2026
Viaarxiv icon

AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing

Add code
Jan 05, 2026
Viaarxiv icon

Context as a Tool: Context Management for Long-Horizon SWE-Agents

Add code
Dec 26, 2025
Viaarxiv icon

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

Add code
Nov 19, 2025
Figure 1 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 2 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 3 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 4 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Viaarxiv icon

SLMQuant:Benchmarking Small Language Model Quantization for Practical Deployment

Add code
Nov 17, 2025
Viaarxiv icon

LLMC+: Benchmarking Vision-Language Model Compression with a Plug-and-play Toolkit

Add code
Aug 13, 2025
Viaarxiv icon

AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions

Add code
Jun 17, 2025
Viaarxiv icon

DynaSplat: Dynamic-Static Gaussian Splatting with Hierarchical Motion Decomposition for Scene Reconstruction

Add code
Jun 11, 2025
Viaarxiv icon