Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shreshth Rajan

Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation

Dec 19, 2025

Shreshth Rajan, Raymond Liu

Figure 1 for Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation

Figure 2 for Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation

Figure 3 for Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation

Figure 4 for Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation

Abstract:Semantic segmentation of outdoor street scenes plays a key role in applications such as autonomous driving, mobile robotics, and assistive technology for visually-impaired pedestrians. For these applications, accurately distinguishing between key surfaces and objects such as roads, sidewalks, vehicles, and pedestrians is essential for maintaining safety and minimizing risks. Semantic segmentation must be robust to different environments, lighting and weather conditions, and sensor noise, while being performed in real-time. We propose a region-level, uncertainty-gated retrieval mechanism that improves segmentation accuracy and calibration under domain shift. Our best method achieves an 11.3% increase in mean intersection-over-union while reducing retrieval cost by 87.5%, retrieving for only 12.5% of regions compared to 100% for always-on baseline.

Via

Access Paper or Ask Questions

Arithmetic-Intensity-Aware Quantization

Dec 17, 2025

Taig Singh, Shreshth Rajan, Nikhil Jain

Figure 1 for Arithmetic-Intensity-Aware Quantization

Figure 2 for Arithmetic-Intensity-Aware Quantization

Figure 3 for Arithmetic-Intensity-Aware Quantization

Figure 4 for Arithmetic-Intensity-Aware Quantization

Abstract:As modern neural networks become increasingly memory-bound, inference throughput is limited by DRAM bandwidth rather than compute. We present Arithmetic-Intensity-Aware Quantization (AIQ), a mixed precision quantization framework that chooses per-layer bit-widths to maximize arithmetic intensity (AI) while minimizing accuracy loss. AIQ is a post-training quantization method that uses search algorithms over per-layer quantization schemes to minimize a weighted loss over AI and accuracy. On ResNet-20/CIFAR-10, AIQ increases AI by ~50% over an FP32 baseline while keeping test accuracy within ~1 percentage point, and outperforming global uniform quantization schemes. On a memory-bound MobileNetV2 architecture, AIQ configurations give a 1.66x higher throughput than the FP32 baseline while keeping test accuracy within 1 percentage point. We also find that AIQ naturally quantizes larger layers more aggressively.

Via

Access Paper or Ask Questions