Picture for Ramakanth Pasunuru

Ramakanth Pasunuru

Efficient Tool Use with Chain-of-Abstraction Reasoning

Add code
Jan 30, 2024
Figure 1 for Efficient Tool Use with Chain-of-Abstraction Reasoning
Figure 2 for Efficient Tool Use with Chain-of-Abstraction Reasoning
Figure 3 for Efficient Tool Use with Chain-of-Abstraction Reasoning
Figure 4 for Efficient Tool Use with Chain-of-Abstraction Reasoning
Viaarxiv icon

PathFinder: Guided Search over Multi-Step Reasoning Paths

Add code
Dec 12, 2023
Figure 1 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 2 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 3 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 4 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Viaarxiv icon

Crystal: Introspective Reasoners Reinforced with Self-Feedback

Add code
Oct 18, 2023
Figure 1 for Crystal: Introspective Reasoners Reinforced with Self-Feedback
Figure 2 for Crystal: Introspective Reasoners Reinforced with Self-Feedback
Figure 3 for Crystal: Introspective Reasoners Reinforced with Self-Feedback
Figure 4 for Crystal: Introspective Reasoners Reinforced with Self-Feedback
Viaarxiv icon

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

Add code
Oct 08, 2023
Figure 1 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Figure 2 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Figure 3 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Figure 4 for Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Viaarxiv icon

Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding

Add code
Sep 26, 2023
Figure 1 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Figure 2 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Figure 3 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Figure 4 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Viaarxiv icon

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

Add code
Sep 05, 2023
Figure 1 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 2 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 3 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 4 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Viaarxiv icon

Shepherd: A Critic for Language Model Generation

Add code
Aug 08, 2023
Figure 1 for Shepherd: A Critic for Language Model Generation
Figure 2 for Shepherd: A Critic for Language Model Generation
Figure 3 for Shepherd: A Critic for Language Model Generation
Figure 4 for Shepherd: A Critic for Language Model Generation
Viaarxiv icon

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Add code
Dec 28, 2022
Figure 1 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 2 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 3 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 4 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Viaarxiv icon

Training Trajectories of Language Models Across Scales

Add code
Dec 19, 2022
Figure 1 for Training Trajectories of Language Models Across Scales
Figure 2 for Training Trajectories of Language Models Across Scales
Figure 3 for Training Trajectories of Language Models Across Scales
Figure 4 for Training Trajectories of Language Models Across Scales
Viaarxiv icon

MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation

Add code
Dec 16, 2022
Figure 1 for MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
Figure 2 for MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
Figure 3 for MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
Figure 4 for MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
Viaarxiv icon