Alert button
Picture for Chien-Sheng Wu

Chien-Sheng Wu

Alert button

LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond

Add code
Bookmark button
Alert button
May 23, 2023
Philippe Laban, Wojciech Kryściński, Divyansh Agarwal, Alexander R. Fabbri, Caiming Xiong, Shafiq Joty, Chien-Sheng Wu

Figure 1 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Figure 2 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Figure 3 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Figure 4 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Viaarxiv icon

Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation

Add code
Bookmark button
Alert button
Mar 07, 2023
Yixin Liu, Alexander R. Fabbri, Yilun Zhao, Pengfei Liu, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev

Figure 1 for Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Figure 2 for Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Figure 3 for Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Figure 4 for Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Viaarxiv icon

Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions

Add code
Bookmark button
Alert button
Feb 17, 2023
Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Xiang 'Anthony' Chen, Caiming Xiong

Figure 1 for Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Figure 2 for Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Figure 3 for Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Figure 4 for Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Viaarxiv icon

Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization

Add code
Bookmark button
Alert button
Dec 20, 2022
Artidoro Pagnoni, Alexander R. Fabbri, Wojciech Kryściński, Chien-Sheng Wu

Figure 1 for Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Figure 2 for Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Figure 3 for Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Figure 4 for Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Viaarxiv icon

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation

Add code
Bookmark button
Alert button
Dec 15, 2022
Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev

Figure 1 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Figure 2 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Figure 3 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Figure 4 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Viaarxiv icon

Improving Factual Consistency in Summarization with Compression-Based Post-Editing

Add code
Bookmark button
Alert button
Nov 11, 2022
Alexander R. Fabbri, Prafulla Kumar Choubey, Jesse Vig, Chien-Sheng Wu, Caiming Xiong

Figure 1 for Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Figure 2 for Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Figure 3 for Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Figure 4 for Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Viaarxiv icon

Discord Questions: A Computational Approach To Diversity Analysis in News Coverage

Add code
Bookmark button
Alert button
Nov 09, 2022
Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Xiang 'Anthony' Chen, Caiming Xiong

Figure 1 for Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Figure 2 for Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Figure 3 for Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Figure 4 for Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Viaarxiv icon

Conformal Predictor for Improving Zero-shot Text Classification Efficiency

Add code
Bookmark button
Alert button
Oct 23, 2022
Prafulla Kumar Choubey, Yu Bai, Chien-Sheng Wu, Wenhao Liu, Nazneen Rajani

Figure 1 for Conformal Predictor for Improving Zero-shot Text Classification Efficiency
Figure 2 for Conformal Predictor for Improving Zero-shot Text Classification Efficiency
Figure 3 for Conformal Predictor for Improving Zero-shot Text Classification Efficiency
Figure 4 for Conformal Predictor for Improving Zero-shot Text Classification Efficiency
Viaarxiv icon

Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning

Add code
Bookmark button
Alert button
Oct 23, 2022
Xiangyu Peng, Chen Xing, Prafulla Kumar Choubey, Chien-Sheng Wu, Caiming Xiong

Figure 1 for Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
Figure 2 for Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
Figure 3 for Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
Figure 4 for Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
Viaarxiv icon

Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets

Add code
Bookmark button
Alert button
May 13, 2022
Philippe Laban, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong

Figure 1 for Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
Figure 2 for Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
Figure 3 for Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
Figure 4 for Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
Viaarxiv icon