Picture for Srinivas Shakkottai

Srinivas Shakkottai

Department of Electrical and Computer Engineering, Texas A&M University

PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training

Add code
Jul 26, 2025
Viaarxiv icon

Risk-Averse Finetuning of Large Language Models

Add code
Jan 12, 2025
Figure 1 for Risk-Averse Finetuning of Large Language Models
Figure 2 for Risk-Averse Finetuning of Large Language Models
Figure 3 for Risk-Averse Finetuning of Large Language Models
Figure 4 for Risk-Averse Finetuning of Large Language Models
Viaarxiv icon

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

Add code
Oct 07, 2024
Figure 1 for DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Figure 2 for DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Figure 3 for DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Figure 4 for DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Viaarxiv icon

CONGO: Compressive Online Gradient Optimization with Application to Microservices Management

Add code
Jul 08, 2024
Figure 1 for CONGO: Compressive Online Gradient Optimization with Application to Microservices Management
Figure 2 for CONGO: Compressive Online Gradient Optimization with Application to Microservices Management
Figure 3 for CONGO: Compressive Online Gradient Optimization with Application to Microservices Management
Figure 4 for CONGO: Compressive Online Gradient Optimization with Application to Microservices Management
Viaarxiv icon

Structured Reinforcement Learning for Media Streaming at the Wireless Edge

Add code
Apr 10, 2024
Figure 1 for Structured Reinforcement Learning for Media Streaming at the Wireless Edge
Figure 2 for Structured Reinforcement Learning for Media Streaming at the Wireless Edge
Figure 3 for Structured Reinforcement Learning for Media Streaming at the Wireless Edge
Figure 4 for Structured Reinforcement Learning for Media Streaming at the Wireless Edge
Viaarxiv icon

Transformers are Efficient In-Context Estimators for Wireless Communication

Add code
Nov 01, 2023
Figure 1 for Transformers are Efficient In-Context Estimators for Wireless Communication
Figure 2 for Transformers are Efficient In-Context Estimators for Wireless Communication
Figure 3 for Transformers are Efficient In-Context Estimators for Wireless Communication
Figure 4 for Transformers are Efficient In-Context Estimators for Wireless Communication
Viaarxiv icon

LLMZip: Lossless Text Compression using Large Language Models

Add code
Jun 26, 2023
Figure 1 for LLMZip: Lossless Text Compression using Large Language Models
Figure 2 for LLMZip: Lossless Text Compression using Large Language Models
Figure 3 for LLMZip: Lossless Text Compression using Large Language Models
Figure 4 for LLMZip: Lossless Text Compression using Large Language Models
Viaarxiv icon

Federated Ensemble-Directed Offline Reinforcement Learning

Add code
May 04, 2023
Figure 1 for Federated Ensemble-Directed Offline Reinforcement Learning
Figure 2 for Federated Ensemble-Directed Offline Reinforcement Learning
Figure 3 for Federated Ensemble-Directed Offline Reinforcement Learning
Figure 4 for Federated Ensemble-Directed Offline Reinforcement Learning
Viaarxiv icon

Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality

Add code
Nov 02, 2022
Viaarxiv icon

Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Add code
Sep 26, 2022
Figure 1 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 2 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 3 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 4 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Viaarxiv icon