Picture for Xiangliang Zhang

Xiangliang Zhang

KAUST, Saudi Arabia

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Add code
May 29, 2024
Figure 1 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 2 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 3 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 4 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Viaarxiv icon

Cross-Context Backdoor Attacks against Graph Prompt Learning

Add code
May 28, 2024
Figure 1 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Figure 2 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Figure 3 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Figure 4 for Cross-Context Backdoor Attacks against Graph Prompt Learning
Viaarxiv icon

Zero-Shot Relational Learning for Multimodal Knowledge Graphs

Add code
Apr 09, 2024
Viaarxiv icon

Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark

Add code
Feb 22, 2024
Figure 1 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Figure 2 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Figure 3 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Figure 4 for Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Viaarxiv icon

Defending Jailbreak Prompts via In-Context Adversarial Game

Add code
Feb 20, 2024
Figure 1 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 2 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 3 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 4 for Defending Jailbreak Prompts via In-Context Adversarial Game
Viaarxiv icon

UGMAE: A Unified Framework for Graph Masked Autoencoders

Add code
Feb 12, 2024
Figure 1 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Figure 2 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Figure 3 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Figure 4 for UGMAE: A Unified Framework for Graph Masked Autoencoders
Viaarxiv icon

Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective

Add code
Feb 06, 2024
Figure 1 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Figure 2 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Figure 3 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Figure 4 for Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective
Viaarxiv icon

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

Add code
Feb 06, 2024
Figure 1 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Figure 2 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Figure 3 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Figure 4 for SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark
Viaarxiv icon

Manipulating Predictions over Discrete Inputs in Machine Teaching

Add code
Jan 31, 2024
Viaarxiv icon

TrustLLM: Trustworthiness in Large Language Models

Add code
Jan 25, 2024
Figure 1 for TrustLLM: Trustworthiness in Large Language Models
Figure 2 for TrustLLM: Trustworthiness in Large Language Models
Figure 3 for TrustLLM: Trustworthiness in Large Language Models
Figure 4 for TrustLLM: Trustworthiness in Large Language Models
Viaarxiv icon