Picture for Kaiqiang Song

Kaiqiang Song

Complex Logical Instruction Generation

Add code
Aug 12, 2025
Viaarxiv icon

Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

Add code
Aug 06, 2025
Viaarxiv icon

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Add code
Oct 09, 2024
Figure 1 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 2 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 3 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 4 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Viaarxiv icon

Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets

Add code
Jul 01, 2024
Figure 1 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Figure 2 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Figure 3 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Figure 4 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Viaarxiv icon

WPO: Enhancing RLHF with Weighted Preference Optimization

Add code
Jun 17, 2024
Viaarxiv icon

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Add code
Jun 17, 2024
Figure 1 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 2 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 3 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 4 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Viaarxiv icon

Polarity Calibration for Opinion Summarization

Add code
Apr 02, 2024
Figure 1 for Polarity Calibration for Opinion Summarization
Figure 2 for Polarity Calibration for Opinion Summarization
Figure 3 for Polarity Calibration for Opinion Summarization
Figure 4 for Polarity Calibration for Opinion Summarization
Viaarxiv icon

Can Large Language Models do Analytical Reasoning?

Add code
Mar 06, 2024
Viaarxiv icon

SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs

Add code
Feb 15, 2024
Viaarxiv icon

SPECTRUM: Speaker-Enhanced Pre-Training for Long Dialogue Summarization

Add code
Jan 31, 2024
Viaarxiv icon