Distractor Generation


Cognitively Diverse Multiple-Choice Question Generation: A Hybrid Multi-Agent Framework with Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

Auto-Comp: An Automated Pipeline for Scalable Compositional Probing of Contrastive Vision-Language Models

Add code
Feb 02, 2026
Viaarxiv icon

CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering

Add code
Feb 01, 2026
Viaarxiv icon

LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization

Add code
Feb 02, 2026
Viaarxiv icon

ArabicDialectHub: A Cross-Dialectal Arabic Learning Resource and Platform

Add code
Jan 30, 2026
Viaarxiv icon

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Add code
Jan 30, 2026
Viaarxiv icon

DeepEra: A Deep Evidence Reranking Agent for Scientific Retrieval-Augmented Generated Question Answering

Add code
Jan 23, 2026
Viaarxiv icon

When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering

Add code
Jan 27, 2026
Viaarxiv icon

Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits

Add code
Jan 23, 2026
Viaarxiv icon

VJEPA: Variational Joint Embedding Predictive Architectures as Probabilistic World Models

Add code
Jan 20, 2026
Viaarxiv icon