Plato: A Selective Context Model for Entity Resolution

Nevena Lazic; Amarnag Subramanya; Michael Ringgaard; Fernando Pereira

Vol. 3 (2015)

TACL approved

Plato: A Selective Context Model for Entity Resolution

Published 2015-10-04

Nevena Lazic
Amarnag Subramanya
Michael Ringgaard
Fernando Pereira

Nevena Lazic
Google Research, Google Inc.

Amarnag Subramanya
Google Research, Google Inc.

Michael Ringgaard
Google Research, Google Inc.

Fernando Pereira
Google Research, Google Inc.

Abstract

We present Plato, a probabilistic model for entity resolution that includes a novel approach for handling noisy or uninformative features, and supplements labeled training data derived from Wikipedia with a very large unlabeled text corpus. Training and inference in the proposed model can easily be distributed across many servers, allowing it to scale to over 10^7 entities. We evaluate Plato on three standard datasets for entity resolution. Our approach achieves the best results to-date on TAC KBP 2011 and is highly competitive on both the CoNLL 2003 and TAC KBP 2012 datasets.

PDF (presented at ACL 2016)