Picture for Anuj Kumar

Anuj Kumar

North Carolina State University

WearVox: An Egocentric Multichannel Voice Assistant Benchmark for Wearables

Add code
Dec 25, 2025
Viaarxiv icon

MobileLLM-Pro Technical Report

Add code
Nov 10, 2025
Viaarxiv icon

CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark

Add code
Oct 30, 2025
Figure 1 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 2 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 3 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 4 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Viaarxiv icon

SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 2 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 3 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 4 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Viaarxiv icon

Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage

Add code
Oct 02, 2025
Figure 1 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 2 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 3 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 4 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Viaarxiv icon

ConfQA: Answer Only If You Are Confident

Add code
Jun 08, 2025
Figure 1 for ConfQA: Answer Only If You Are Confident
Figure 2 for ConfQA: Answer Only If You Are Confident
Figure 3 for ConfQA: Answer Only If You Are Confident
Figure 4 for ConfQA: Answer Only If You Are Confident
Viaarxiv icon

Proactive Assistant Dialogue Generation from Streaming Egocentric Videos

Add code
Jun 06, 2025
Viaarxiv icon

VisualLens: Personalization through Visual History

Add code
Nov 25, 2024
Figure 1 for VisualLens: Personalization through Visual History
Figure 2 for VisualLens: Personalization through Visual History
Figure 3 for VisualLens: Personalization through Visual History
Figure 4 for VisualLens: Personalization through Visual History
Viaarxiv icon

EgoQR: Efficient QR Code Reading in Egocentric Settings

Add code
Oct 07, 2024
Figure 1 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Figure 2 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Figure 3 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Figure 4 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Viaarxiv icon

Doppelgänger's Watch: A Split Objective Approach to Large Language Models

Add code
Sep 09, 2024
Viaarxiv icon