Picture for Haibo Tong

Haibo Tong

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Add code
Feb 15, 2026
Viaarxiv icon

CogToM: A Comprehensive Theory of Mind Benchmark inspired by Human Cognition for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Scaling Agent Learning via Experience Synthesis

Add code
Nov 10, 2025
Figure 1 for Scaling Agent Learning via Experience Synthesis
Figure 2 for Scaling Agent Learning via Experience Synthesis
Figure 3 for Scaling Agent Learning via Experience Synthesis
Figure 4 for Scaling Agent Learning via Experience Synthesis
Viaarxiv icon

PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks

Add code
May 22, 2025
Figure 1 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Figure 2 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Figure 3 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Figure 4 for PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
Viaarxiv icon

Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society

Add code
Apr 24, 2025
Viaarxiv icon

MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation

Add code
Feb 03, 2025
Figure 1 for MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
Figure 2 for MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
Figure 3 for MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
Figure 4 for MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
Viaarxiv icon

Autonomous Alignment with Human Value on Altruism through Considerate Self-imagination and Theory of Mind

Add code
Jan 07, 2025
Figure 1 for Autonomous Alignment with Human Value on Altruism through Considerate Self-imagination and Theory of Mind
Figure 2 for Autonomous Alignment with Human Value on Altruism through Considerate Self-imagination and Theory of Mind
Figure 3 for Autonomous Alignment with Human Value on Altruism through Considerate Self-imagination and Theory of Mind
Figure 4 for Autonomous Alignment with Human Value on Altruism through Considerate Self-imagination and Theory of Mind
Viaarxiv icon

Evolving Efficient Genetic Encoding for Deep Spiking Neural Networks

Add code
Nov 11, 2024
Figure 1 for Evolving Efficient Genetic Encoding for Deep Spiking Neural Networks
Figure 2 for Evolving Efficient Genetic Encoding for Deep Spiking Neural Networks
Figure 3 for Evolving Efficient Genetic Encoding for Deep Spiking Neural Networks
Figure 4 for Evolving Efficient Genetic Encoding for Deep Spiking Neural Networks
Viaarxiv icon

Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms

Add code
Oct 29, 2024
Figure 1 for Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Figure 2 for Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Figure 3 for Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Figure 4 for Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Viaarxiv icon

Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method

Add code
Oct 22, 2024
Figure 1 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Figure 2 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Figure 3 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Figure 4 for Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Viaarxiv icon