Picture for Xiangliang Zhang

Xiangliang Zhang

KAUST, Saudi Arabia

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Add code
May 29, 2024
Figure 1 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 2 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 3 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 4 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Viaarxiv icon

Cross-Context Backdoor Attacks against Graph Prompt Learning

Add code
May 28, 2024
Figure 1 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Figure 2 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Figure 3 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Figure 4 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Viaarxiv icon

Zero-Shot Relational Learning for Multimodal Knowledge Graphs

Add code
Apr 09, 2024
Viaarxiv icon

Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark

Add code
Feb 22, 2024
Figure 1 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Figure 2 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Figure 3 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Figure 4 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Viaarxiv icon

Defending Jailbreak Prompts via In-Context Adversarial Game

Add code
Feb 20, 2024
Figure 1 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 2 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 3 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 4 for Defending Jailbreak Prompts via In-Context Adversarial Game
Viaarxiv icon

UGMAE: A Unified Framework for Graph Masked Autoencoders

Add code
Feb 12, 2024
Figure 1 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Figure 2 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Figure 3 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Figure 4 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Viaarxiv icon

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

Add code
Feb 06, 2024
Figure 1 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Figure 2 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Figure 3 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Figure 4 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Viaarxiv icon

Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective

Add code
Feb 06, 2024
Figure 1 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Figure 2 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Figure 3 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Figure 4 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Viaarxiv icon

Manipulating Predictions over Discrete Inputs in Machine Teaching

Add code
Jan 31, 2024
Viaarxiv icon

TrustLLM: Trustworthiness in Large Language Models

Add code
Jan 25, 2024
Figure 1 for TrustLLM: Trustworthiness in Large Language Models
Figure 2 for TrustLLM: Trustworthiness in Large Language Models
Figure 3 for TrustLLM: Trustworthiness in Large Language Models
Figure 4 for TrustLLM: Trustworthiness in Large Language Models
Viaarxiv icon