Microsoft Research
Machine Learning
We consider optimizing the coalition structure in Coalitional Skill Games (CSGs), a succinct representation of coalitional games . In CSGs, the value of a coalition depends on the tasks its members can achieve. The tasks require various... more
Modeling and experimental investigation of B equilibrium diffusivity and its activation in Si in the presence of other species, including ab initio calculations, are presented here. The results suggest that incorporating other species... more
In this paper we extend the class of energy functions for which the optimal α-expansion and αβswap moves can be computed in polynomial time. Specifically, we introduce a novel family of higher order clique potentials and show that the... more
We study the problem of Robust Least Squares Regression (RLSR) where several response variables can be adversarially corrupted. More specifically, for a data matrix X ∈ R p×n and an underlying model w * , the response vector is generated... more
- by Prateek Jain
Abstract: This paper considers the stability of online learning algorithms and its implications for learnability (bounded regret). We introduce a novel quantity called {\ em forward regret} that intuitively measures how good an online... more
- by Prateek Jain
We consider the problem of retrieving the database points nearest to a given hyperplane query without exhaustively scanning the database. We propose two hashingbased solutions. Our first approach maps the data to two-bit binary keys that... more
Abstract: We consider the problem of classification using similarity/distance functions over data. Specifically, we propose a framework for defining the goodness of a (dis) similarity function with respect to a given learning task and... more
Abstract: We address the problem of general supervised learning when data can only be accessed through an (indefinite) similarity function between data points. Existing work on learning with indefinite kernels has concentrated solely on... more
- by Prateek Jain
Metric learning algorithms can provide useful distance functions for a variety of domains, and recent work has shown good accuracy for problems where the learner can access all distance constraints at once. However, in many real... more
Minimizing the rank of a matrix subject to affine constraints is a fundamental problem with many important applications in machine learning and statistics. In this paper we propose a simple and fast algorithm SVP (Singular Value... more
In this paper we consider the problem of semi-supervised kernel function learning. We first propose a general regularized framework for learning a kernel matrix, and then demonstrate an equivalence between our proposed kernel matrix... more
Metric and kernel learning are important in several machine learning applications. However, most existing metric learning algorithms are limited to learning metrics over low-dimensional data, while existing kernel learning algorithms are... more
Anaphora resolution for dialogues is a difficult problem because of the several kinds of com-plex anaphoric references generally present in dialogic discourses. It is nevertheless a criti-cal first step in the processing of any such... more
Abstract: In this paper, we consider the problem of preserving privacy in the online learning setting. We study the problem in the online convex programming (OCP) framework---a popular online learning setting with several interesting... more