Picture for Jun Wang

Jun Wang

IBM T. J. Watson Research Center

Efficient Reinforcement Learning with Large Language Model Priors

Add code
Oct 10, 2024
Figure 1 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 2 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 3 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 4 for Efficient Reinforcement Learning with Large Language Model Priors
Viaarxiv icon

Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization

Add code
Oct 08, 2024
Viaarxiv icon

TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting

Add code
Oct 07, 2024
Viaarxiv icon

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

Add code
Oct 07, 2024
Viaarxiv icon

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Add code
Oct 06, 2024
Viaarxiv icon

Mixture of Attentions For Speculative Decoding

Add code
Oct 04, 2024
Viaarxiv icon

SHAP-CAT: A interpretable multi-modal framework enhancing WSI classification via virtual staining and shapley-value-based multimodal fusion

Add code
Oct 02, 2024
Viaarxiv icon

PathSeeker: Exploring LLM Security Vulnerabilities with a Reinforcement Learning-Based Jailbreak Approach

Add code
Sep 21, 2024
Viaarxiv icon

Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis

Add code
Sep 13, 2024
Viaarxiv icon

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

Add code
Sep 10, 2024
Viaarxiv icon