Cross-Device Identity Resolution using Machine Learning: A Scalable Device Graph Approach

Authors

  • Manasa Priya Koduri
  • Pei Xuan Lim Mediacorp Pte Ltd
  • Zhen Li
  • Sandeep Kumar
  • Muhammad Aamir Saleem
  • Roger Moy

DOI:

https://doi.org/10.32473/flairs.v34i1.128625

Keywords:

Machine Learning, Device Graph

Abstract

This paper outlines how Mediacorp, Singapore's public service broadcaster, addresses its cross-device identity challenges using a scalable device graph approach. Research in this area is relevant to the domain of advertising technology as it enables a holistic view of consumers that can be extended to use cases such as improving advertisement targeting, personalized recommendations and demographic predictions. Past research efforts were limited to high-level descriptions of the steps undertaken to create a one-off, static device graph based on data collected over a circumscribed time frame, thus limiting its use in larger-scale commercial applications. In this paper, we propose a scalable solution that enables continuous, incremental revisions of our device graph. We leverage behavioral data captured by Mediacorp across its sites and platforms to build a richer device graph that is updated weekly. First, we introduce additional features and explore various classifiers to improve pairwise probability scores between devices that are likely to belong to the same user. Then, we apply community clustering algorithms to uncover device communities to establish the final device graph. Extensive experiments showed that our additive approach has consistently delivered >90% precision and recall in real-world applications.

Downloads

Published

2021-04-18

How to Cite

Koduri, M. P., Lim, P. X., Li, Z., Kumar, S., Saleem, M. A., & Moy, R. (2021). Cross-Device Identity Resolution using Machine Learning: A Scalable Device Graph Approach. The International FLAIRS Conference Proceedings, 34. https://doi.org/10.32473/flairs.v34i1.128625

Issue

Section

Main Track Proceedings