Picture for Dileep Kalathil

Dileep Kalathil

Department of Electrical and Computer Engineering, Texas A&M University

Data-Efficient Autoregressive-to-Diffusion Language Models via On-Policy Distillation

Add code
Jun 04, 2026
Viaarxiv icon

Learnability-Informed Fine-Tuning of Diffusion Language Models

Add code
May 21, 2026
Viaarxiv icon

Reinforcement Learning for Diffusion LLMs with Entropy-Guided Step Selection and Stepwise Advantages

Add code
Mar 13, 2026
Viaarxiv icon

Optimistic World Models: Efficient Exploration in Model-Based Deep Reinforcement Learning

Add code
Feb 10, 2026
Viaarxiv icon

In-Context Learning for Gradient-Free Receiver Adaptation: Principles, Applications, and Theory

Add code
Jun 18, 2025
Viaarxiv icon

Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning

Add code
Jun 07, 2025
Viaarxiv icon

Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models

Add code
May 24, 2025
Figure 1 for Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Figure 2 for Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Figure 3 for Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Figure 4 for Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Viaarxiv icon

Distributionally Robust Direct Preference Optimization

Add code
Feb 04, 2025
Figure 1 for Distributionally Robust Direct Preference Optimization
Figure 2 for Distributionally Robust Direct Preference Optimization
Figure 3 for Distributionally Robust Direct Preference Optimization
Figure 4 for Distributionally Robust Direct Preference Optimization
Viaarxiv icon

Risk-Averse Finetuning of Large Language Models

Add code
Jan 12, 2025
Figure 1 for Risk-Averse Finetuning of Large Language Models
Figure 2 for Risk-Averse Finetuning of Large Language Models
Figure 3 for Risk-Averse Finetuning of Large Language Models
Figure 4 for Risk-Averse Finetuning of Large Language Models
Viaarxiv icon

PowerMamba: A Deep State Space Model and Comprehensive Benchmark for Time Series Prediction in Electric Power Systems

Add code
Dec 09, 2024
Figure 1 for PowerMamba: A Deep State Space Model and Comprehensive Benchmark for Time Series Prediction in Electric Power Systems
Figure 2 for PowerMamba: A Deep State Space Model and Comprehensive Benchmark for Time Series Prediction in Electric Power Systems
Figure 3 for PowerMamba: A Deep State Space Model and Comprehensive Benchmark for Time Series Prediction in Electric Power Systems
Figure 4 for PowerMamba: A Deep State Space Model and Comprehensive Benchmark for Time Series Prediction in Electric Power Systems
Viaarxiv icon