Alert button
Picture for Hiteshi Sharma

Hiteshi Sharma

Alert button

Language Models can be Logical Solvers

Nov 10, 2023
Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen, Dongyan Zhao, Weizhu Chen

Figure 1 for Language Models can be Logical Solvers
Figure 2 for Language Models can be Logical Solvers
Figure 3 for Language Models can be Logical Solvers
Figure 4 for Language Models can be Logical Solvers
Viaarxiv icon

ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning

Sep 27, 2023
Hosein Hasanbeig, Hiteshi Sharma, Leo Betthauser, Felipe Vieira Frujeri, Ida Momennejad

Figure 1 for ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning
Figure 2 for ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning
Figure 3 for ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning
Figure 4 for ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning
Viaarxiv icon

Evaluating Cognitive Maps and Planning in Large Language Models with CogEval

Sep 25, 2023
Ida Momennejad, Hosein Hasanbeig, Felipe Vieira, Hiteshi Sharma, Robert Osazuwa Ness, Nebojsa Jojic, Hamid Palangi, Jonathan Larson

Viaarxiv icon

Fine-Tuning Language Models with Advantage-Induced Policy Alignment

Jun 06, 2023
Banghua Zhu, Hiteshi Sharma, Felipe Vieira Frujeri, Shi Dong, Chenguang Zhu, Michael I. Jordan, Jiantao Jiao

Figure 1 for Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Figure 2 for Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Figure 3 for Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Figure 4 for Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Viaarxiv icon

Randomized Policy Learning for Continuous State and Action MDPs

Jun 08, 2020
Hiteshi Sharma, Rahul Jain

Figure 1 for Randomized Policy Learning for Continuous State and Action MDPs
Figure 2 for Randomized Policy Learning for Continuous State and Action MDPs
Viaarxiv icon

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes

Oct 15, 2019
Chen-Yu Wei, Mehdi Jafarnia-Jahromi, Haipeng Luo, Hiteshi Sharma, Rahul Jain

Figure 1 for Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Figure 2 for Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Viaarxiv icon