Picture for Andreas Vlachos

Andreas Vlachos

Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation

Add code
May 29, 2025
Viaarxiv icon

TCP: a Benchmark for Temporal Constraint-Based Planning

Add code
May 26, 2025
Viaarxiv icon

Social Good or Scientific Curiosity? Uncovering the Research Framing Behind NLP Artefacts

Add code
May 24, 2025
Viaarxiv icon

AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web

Add code
May 23, 2025
Viaarxiv icon

Capturing Symmetry and Antisymmetry in Language Models through Symmetry-Aware Training Objectives

Add code
Apr 22, 2025
Viaarxiv icon

Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems

Add code
Mar 06, 2025
Viaarxiv icon

The Future Outcome Reasoning and Confidence Assessment Benchmark

Add code
Feb 27, 2025
Viaarxiv icon

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Add code
Dec 30, 2024
Figure 1 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 2 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 3 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 4 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Viaarxiv icon

Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models

Add code
Dec 15, 2024
Figure 1 for Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models
Figure 2 for Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models
Figure 3 for Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models
Figure 4 for Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models
Viaarxiv icon

Decompose and Leverage Preferences from Expert Models for Improving Trustworthiness of MLLMs

Add code
Nov 20, 2024
Viaarxiv icon