Picture for Yujia Zheng

Yujia Zheng

Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs

Add code
May 26, 2025
Viaarxiv icon

Type Information-Assisted Self-Supervised Knowledge Graph Denoising

Add code
Mar 13, 2025
Viaarxiv icon

Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation

Add code
Feb 06, 2025
Viaarxiv icon

Causal Representation Learning from Multimodal Biological Observations

Add code
Nov 10, 2024
Figure 1 for Causal Representation Learning from Multimodal Biological Observations
Figure 2 for Causal Representation Learning from Multimodal Biological Observations
Figure 3 for Causal Representation Learning from Multimodal Biological Observations
Figure 4 for Causal Representation Learning from Multimodal Biological Observations
Viaarxiv icon

Identifying Selections for Unsupervised Subtask Discovery

Add code
Oct 28, 2024
Figure 1 for Identifying Selections for Unsupervised Subtask Discovery
Figure 2 for Identifying Selections for Unsupervised Subtask Discovery
Figure 3 for Identifying Selections for Unsupervised Subtask Discovery
Figure 4 for Identifying Selections for Unsupervised Subtask Discovery
Viaarxiv icon

Causality for Large Language Models

Add code
Oct 20, 2024
Viaarxiv icon

SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks

Add code
Oct 02, 2024
Figure 1 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Figure 2 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Figure 3 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Figure 4 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Viaarxiv icon

Causal Temporal Representation Learning with Nonstationary Sparse Transition

Add code
Sep 05, 2024
Figure 1 for Causal Temporal Representation Learning with Nonstationary Sparse Transition
Figure 2 for Causal Temporal Representation Learning with Nonstationary Sparse Transition
Figure 3 for Causal Temporal Representation Learning with Nonstationary Sparse Transition
Figure 4 for Causal Temporal Representation Learning with Nonstationary Sparse Transition
Viaarxiv icon

On the Identifiability of Sparse ICA without Assuming Non-Gaussianity

Add code
Aug 19, 2024
Viaarxiv icon

Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight

Add code
Jul 11, 2024
Viaarxiv icon