Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nihar B. Shah

On Strategyproof Conference Peer Review

Jun 16, 2018

Yichong Xu, Han Zhao, Xiaofei Shi, Nihar B. Shah

Figure 1 for On Strategyproof Conference Peer Review

Figure 2 for On Strategyproof Conference Peer Review

Figure 3 for On Strategyproof Conference Peer Review

Abstract:We consider peer review in a conference setting where there is typically an overlap between the set of reviewers and the set of authors. This overlap can incentivize strategic reviews to influence the final ranking of one's own papers. In this work, we address this problem through the lens of social choice, and present a theoretical framework for strategyproof and efficient peer review. We first present and analyze an algorithm for reviewer-assignment and aggregation that guarantees strategyproofness and a natural efficiency property called unanimity, when the authorship graph satisfies a simple property. Our algorithm is based on the so-called partitioning method, and can be thought as a generalization of this method to conference peer review settings. We then empirically show that the requisite property on the authorship graph is indeed satisfied in the ICLR-17 submission data, and further demonstrate a simple trick to make the partitioning method more practically appealing for conference peer review. Finally, we complement our positive results with negative theoretical results where we prove that under various ways of strengthening the requirements, it is impossible for any algorithm to be strategyproof and efficient.

Via

Access Paper or Ask Questions

PeerReview4All: Fair and Accurate Reviewer Assignment in Peer Review

Jun 16, 2018

Ivan Stelmakh, Nihar B. Shah, Aarti Singh

Figure 1 for PeerReview4All: Fair and Accurate Reviewer Assignment in Peer Review

Figure 2 for PeerReview4All: Fair and Accurate Reviewer Assignment in Peer Review

Abstract:We consider the problem of automated assignment of papers to reviewers in conference peer review, with a focus on fairness and statistical accuracy. Our fairness objective is to maximize the review quality of the most disadvantaged paper, in contrast to the commonly used objective of maximizing the total quality over all papers. We design an assignment algorithm based on an incremental max-flow procedure that we prove is near-optimally fair. Our statistical accuracy objective is to ensure correct recovery of the papers that should be accepted. We provide a sharp minimax analysis of the accuracy of the peer-review process for a popular objective-score model as well as for a novel subjective-score model that we propose in the paper. Our analysis proves that our proposed assignment algorithm also leads to a near-optimal statistical accuracy. Finally, we design a novel experiment that allows for an objective comparison of various assignment algorithms, and overcomes the inherent difficulty posed by the absence of a ground truth in experiments on peer-review. The results of this experiment corroborate the theoretical guarantees of our algorithm.

Via

Access Paper or Ask Questions

Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues

Sep 28, 2016

Nihar B. Shah, Sivaraman Balakrishnan, Adityanand Guntuboyina, Martin J. Wainwright

Figure 1 for Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues

Figure 2 for Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues

Figure 3 for Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues

Abstract:There are various parametric models for analyzing pairwise comparison data, including the Bradley-Terry-Luce (BTL) and Thurstone models, but their reliance on strong parametric assumptions is limiting. In this work, we study a flexible model for pairwise comparisons, under which the probabilities of outcomes are required only to satisfy a natural form of stochastic transitivity. This class includes parametric models including the BTL and Thurstone models as special cases, but is considerably more general. We provide various examples of models in this broader stochastically transitive class for which classical parametric models provide poor fits. Despite this greater flexibility, we show that the matrix of probabilities can be estimated at the same rate as in standard parametric models. On the other hand, unlike in the BTL and Thurstone models, computing the minimax-optimal estimator in the stochastically transitive model is non-trivial, and we explore various computationally tractable alternatives. We show that a simple singular value thresholding algorithm is statistically consistent but does not achieve the minimax rate. We then propose and study algorithms that achieve the minimax rate over interesting sub-classes of the full stochastically transitive class. We complement our theoretical results with thorough numerical simulations.

Via

Access Paper or Ask Questions

Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help

Sep 23, 2016

Reinhard Heckel, Nihar B. Shah, Kannan Ramchandran, Martin J. Wainwright

Figure 1 for Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help

Figure 2 for Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help

Figure 3 for Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help

Figure 4 for Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help

Abstract:We consider sequential or active ranking of a set of n items based on noisy pairwise comparisons. Items are ranked according to the probability that a given item beats a randomly chosen item, and ranking refers to partitioning the items into sets of pre-specified sizes according to their scores. This notion of ranking includes as special cases the identification of the top-k items and the total ordering of the items. We first analyze a sequential ranking algorithm that counts the number of comparisons won, and uses these counts to decide whether to stop, or to compare another pair of items, chosen based on confidence intervals specified by the data collected up to that point. We prove that this algorithm succeeds in recovering the ranking using a number of comparisons that is optimal up to logarithmic factors. This guarantee does not require any structural properties of the underlying pairwise probability matrix, unlike a significant body of past work on pairwise ranking based on parametric models such as the Thurstone or Bradley-Terry-Luce models. It has been a long-standing open question as to whether or not imposing these parametric assumptions allows for improved ranking algorithms. For stochastic comparison models, in which the pairwise probabilities are bounded away from zero, our second contribution is to resolve this issue by proving a lower bound for parametric models. This shows, perhaps surprisingly, that these popular parametric modeling choices offer at most logarithmic gains for stochastic comparisons.

* improved log factor in main result; added discussion on comparison probabilities close to zero; added numerical results

Via

Access Paper or Ask Questions

A Permutation-based Model for Crowd Labeling: Optimal Estimation and Robustness

Jun 30, 2016

Nihar B. Shah, Sivaraman Balakrishnan, Martin J. Wainwright

Figure 1 for A Permutation-based Model for Crowd Labeling: Optimal Estimation and Robustness

Abstract:The aggregation and denoising of crowd labeled data is a task that has gained increased significance with the advent of crowdsourcing platforms and massive datasets. In this paper, we propose a permutation-based model for crowd labeled data that is a significant generalization of the common Dawid-Skene model, and introduce a new error metric by which to compare different estimators. Working in a high-dimensional non-asymptotic framework that allows both the number of workers and tasks to scale, we derive optimal rates of convergence for the permutation-based model. We show that the permutation-based model offers significant robustness in estimation due to its richness, while surprisingly incurring only a small additional statistical penalty as compared to the Dawid-Skene model. Finally, we propose a computationally-efficient method, called the OBI-WAN estimator, that is uniformly optimal over a class intermediate between the permutation-based and the Dawid-Skene models, and is uniformly consistent over the entire permutation-based model class. In contrast, the guarantees for estimators available in prior literature are sub-optimal over the original Dawid-Skene model.

Via

Access Paper or Ask Questions

Simple, Robust and Optimal Ranking from Pairwise Comparisons

Apr 27, 2016

Nihar B. Shah, Martin J. Wainwright

Figure 1 for Simple, Robust and Optimal Ranking from Pairwise Comparisons

Figure 2 for Simple, Robust and Optimal Ranking from Pairwise Comparisons

Abstract:We consider data in the form of pairwise comparisons of n items, with the goal of precisely identifying the top k items for some value of k < n, or alternatively, recovering a ranking of all the items. We analyze the Copeland counting algorithm that ranks the items in order of the number of pairwise comparisons won, and show it has three attractive features: (a) its computational efficiency leads to speed-ups of several orders of magnitude in computation time as compared to prior work; (b) it is robust in that theoretical guarantees impose no conditions on the underlying matrix of pairwise-comparison probabilities, in contrast to some prior work that applies only to the BTL parametric model; and (c) it is an optimal method up to constant factors, meaning that it achieves the information-theoretic limits for recovering the top k-subset. We extend our results to obtain sharp guarantees for approximate recovery under the Hamming distortion metric, and more generally, to any arbitrary error requirement that satisfies a simple and natural monotonicity condition.

* Changes in version 2: In addition to recovery in the exact and Hamming metrics, v2 analyzes a general, abstract recovery criterion based on a notion of "allowed sets"

Via

Access Paper or Ask Questions

Feeling the Bern: Adaptive Estimators for Bernoulli Probabilities of Pairwise Comparisons

Mar 22, 2016

Nihar B. Shah, Sivaraman Balakrishnan, Martin J. Wainwright

Figure 1 for Feeling the Bern: Adaptive Estimators for Bernoulli Probabilities of Pairwise Comparisons

Abstract:We study methods for aggregating pairwise comparison data in order to estimate outcome probabilities for future comparisons among a collection of n items. Working within a flexible framework that imposes only a form of strong stochastic transitivity (SST), we introduce an adaptivity index defined by the indifference sets of the pairwise comparison probabilities. In addition to measuring the usual worst-case risk of an estimator, this adaptivity index also captures the extent to which the estimator adapts to instance-specific difficulty relative to an oracle estimator. We prove three main results that involve this adaptivity index and different algorithms. First, we propose a three-step estimator termed Count-Randomize-Least squares (CRL), and show that it has adaptivity index upper bounded as $\sqrt{n}$ up to logarithmic factors. We then show that that conditional on the hardness of planted clique, no computationally efficient estimator can achieve an adaptivity index smaller than $\sqrt{n}$. Second, we show that a regularized least squares estimator can achieve a poly-logarithmic adaptivity index, thereby demonstrating a $\sqrt{n}$-gap between optimal and computationally achievable adaptivity. Finally, we prove that the standard least squares estimator, which is known to be optimally adaptive in several closely related problems, fails to adapt in the context of estimating pairwise probabilities.

Via

Access Paper or Ask Questions

Parametric Prediction from Parametric Agents

Feb 24, 2016

Yuan Luo, Nihar B. Shah, Jianwei Huang, Jean Walrand

Figure 1 for Parametric Prediction from Parametric Agents

Figure 2 for Parametric Prediction from Parametric Agents

Figure 3 for Parametric Prediction from Parametric Agents

Figure 4 for Parametric Prediction from Parametric Agents

Abstract:We consider a problem of prediction based on opinions elicited from heterogeneous rational agents with private information. Making an accurate prediction with a minimal cost requires a joint design of the incentive mechanism and the prediction algorithm. Such a problem lies at the nexus of statistical learning theory and game theory, and arises in many domains such as consumer surveys and mobile crowdsourcing. In order to elicit heterogeneous agents' private information and incentivize agents with different capabilities to act in the principal's best interest, we design an optimal joint incentive mechanism and prediction algorithm called COPE (COst and Prediction Elicitation), the analysis of which offers several valuable engineering insights. First, when the costs incurred by the agents are linear in the exerted effort, COPE corresponds to a "crowd contending" mechanism, where the principal only employs the agent with the highest capability. Second, when the costs are quadratic, COPE corresponds to a "crowd-sourcing" mechanism that employs multiple agents with different capabilities at the same time. Numerical simulations show that COPE improves the principal's profit and the network profit significantly (larger than 30% in our simulations), comparing to those mechanisms that assume all agents have equal capabilities.

Via

Access Paper or Ask Questions

Double or Nothing: Multiplicative Incentive Mechanisms for Crowdsourcing

Dec 16, 2015

Nihar B. Shah, Dengyong Zhou

Figure 1 for Double or Nothing: Multiplicative Incentive Mechanisms for Crowdsourcing

Figure 2 for Double or Nothing: Multiplicative Incentive Mechanisms for Crowdsourcing

Figure 3 for Double or Nothing: Multiplicative Incentive Mechanisms for Crowdsourcing

Abstract:Crowdsourcing has gained immense popularity in machine learning applications for obtaining large amounts of labeled data. Crowdsourcing is cheap and fast, but suffers from the problem of low-quality data. To address this fundamental challenge in crowdsourcing, we propose a simple payment mechanism to incentivize workers to answer only the questions that they are sure of and skip the rest. We show that surprisingly, under a mild and natural "no-free-lunch" requirement, this mechanism is the one and only incentive-compatible payment mechanism possible. We also show that among all possible incentive-compatible mechanisms (that may or may not satisfy no-free-lunch), our mechanism makes the smallest possible payment to spammers. We further extend our results to a more general setting in which workers are required to provide a quantized confidence for each question. Interestingly, this unique mechanism takes a "multiplicative" form. The simplicity of the mechanism is an added benefit. In preliminary experiments involving over 900 worker-task pairs, we observe a significant drop in the error rates under this unique mechanism for the same or lower monetary expenditure.

Via

Access Paper or Ask Questions

Approval Voting and Incentives in Crowdsourcing

Sep 07, 2015

Nihar B. Shah, Dengyong Zhou, Yuval Peres

Figure 1 for Approval Voting and Incentives in Crowdsourcing

Figure 2 for Approval Voting and Incentives in Crowdsourcing

Figure 3 for Approval Voting and Incentives in Crowdsourcing

Abstract:The growing need for labeled training data has made crowdsourcing an important part of machine learning. The quality of crowdsourced labels is, however, adversely affected by three factors: (1) the workers are not experts; (2) the incentives of the workers are not aligned with those of the requesters; and (3) the interface does not allow workers to convey their knowledge accurately, by forcing them to make a single choice among a set of options. In this paper, we address these issues by introducing approval voting to utilize the expertise of workers who have partial knowledge of the true answer, and coupling it with a ("strictly proper") incentive-compatible compensation mechanism. We show rigorous theoretical guarantees of optimality of our mechanism together with a simple axiomatic characterization. We also conduct preliminary empirical studies on Amazon Mechanical Turk which validate our approach.

Via

Access Paper or Ask Questions