Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Vishal Maini

Reducing Sentiment Bias in Language Models via Counterfactual Evaluation


Nov 08, 2019
Po-Sen Huang, Huan Zhang, Ray Jiang, Robert Stanforth, Johannes Welbl, Jack Rae, Vishal Maini, Dani Yogatama, Pushmeet Kohli


  Access Paper or Ask Questions

Scalable agent alignment via reward modeling: a research direction


Nov 19, 2018
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg


  Access Paper or Ask Questions