Picture for Ramón Fernandez Astudillo

Ramón Fernandez Astudillo

Optimal Policy Minimum Bayesian Risk

Add code
May 22, 2025
Viaarxiv icon

Latent Principle Discovery for Language Model Self-Improvement

Add code
May 22, 2025
Viaarxiv icon

Multi-Document Grounded Multi-Turn Synthetic Dialog Generation

Add code
Sep 17, 2024
Figure 1 for Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Figure 2 for Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Figure 3 for Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Figure 4 for Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Viaarxiv icon

Self-Refinement of Language Models from External Proxy Metrics Feedback

Add code
Feb 27, 2024
Figure 1 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 2 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 3 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 4 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Viaarxiv icon

Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations

Add code
Feb 20, 2024
Figure 1 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Figure 2 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Figure 3 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Figure 4 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Viaarxiv icon

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

Add code
Feb 04, 2024
Figure 1 for BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Figure 2 for BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Figure 3 for BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Figure 4 for BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Viaarxiv icon

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Add code
Oct 21, 2023
Figure 1 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 2 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 3 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 4 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Viaarxiv icon

AMR Parsing with Instruction Fine-tuned Pre-trained Language Models

Add code
Apr 24, 2023
Viaarxiv icon

DocAMR: Multi-Sentence AMR Representation and Evaluation

Add code
Dec 15, 2021
Figure 1 for DocAMR: Multi-Sentence AMR Representation and Evaluation
Figure 2 for DocAMR: Multi-Sentence AMR Representation and Evaluation
Figure 3 for DocAMR: Multi-Sentence AMR Representation and Evaluation
Figure 4 for DocAMR: Multi-Sentence AMR Representation and Evaluation
Viaarxiv icon

Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing

Add code
Oct 29, 2021
Figure 1 for Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Figure 2 for Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Figure 3 for Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Figure 4 for Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Viaarxiv icon