Picture for Zifan Wang

Zifan Wang

Mechanistically Interpreting a Transformer-based 2-SAT Solver: An Axiomatic Approach

Add code
Jul 18, 2024
Viaarxiv icon

Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation

Add code
Jun 20, 2024
Viaarxiv icon

Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation

Add code
Jun 14, 2024
Figure 1 for Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Figure 2 for Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Figure 3 for Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Figure 4 for Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Viaarxiv icon

Sales Whisperer: A Human-Inconspicuous Attack on LLM Brand Recommendations

Add code
Jun 07, 2024
Figure 1 for Sales Whisperer: A Human-Inconspicuous Attack on LLM Brand Recommendations
Figure 2 for Sales Whisperer: A Human-Inconspicuous Attack on LLM Brand Recommendations
Figure 3 for Sales Whisperer: A Human-Inconspicuous Attack on LLM Brand Recommendations
Figure 4 for Sales Whisperer: A Human-Inconspicuous Attack on LLM Brand Recommendations
Viaarxiv icon

VeriSplit: Secure and Practical Offloading of Machine Learning Inferences across IoT Devices

Add code
Jun 02, 2024
Viaarxiv icon

Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space

Add code
May 26, 2024
Viaarxiv icon

Risk-averse Learning with Non-Stationary Distributions

Add code
Apr 03, 2024
Viaarxiv icon

Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot

Add code
Mar 28, 2024
Figure 1 for Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot
Figure 2 for Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot
Figure 3 for Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot
Figure 4 for Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot
Viaarxiv icon

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Mar 06, 2024
Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Add code
Feb 06, 2024
Figure 1 for HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Figure 2 for HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Figure 3 for HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Figure 4 for HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Viaarxiv icon