N-Body Algorithms for Mobile and Social Data

Description

Today’s fast-growing mobile device and online social media infrastructure is producing an unprecedented volume of data on communication, system and device-level interaction, and user behavior. Examples include cellphone data (including voice calls, SMS and Internet data, cellular signal strength) and activity on online social media (itself often sent from mobile devices). This presents both opportunities and challenges. On one hand, the increasing availability, high quality, and relatively easy collection of such data provides a way to better understand online human activity, infer activity patterns, and design new services in urban environments. On the other hand, the combination of big data and powerful inferential techniques pose both computational challenges and privacy concerns. In both cases, there is a need for efficient algorithmic techniques that can support learning from mobile and social data sets, and that enable principled creation of synthetic data at similar scale for modeling, testing and anonymization purposes.

Our project is based on the core insight that many problems in the above space can be framed in terms of pairwise interactions among spatially embedded entities,  traditionally the domain of N-body problems  in the physical sciences. As a consequence, revisiting and adapting N-body algorithms specifically to mobile and social data  analysis and learning  can increase our capacity to (i) work with such data at scale and (ii) do so in a privacy-preserving way.  We investigate novel N-body and parallel algorithms, specifically designed for processing hierarchical, and geospatially embedded, mobile and social data, where the size or access to datasets is prohibitive. In addition to the algorithmic design, we develop software modules (e.g.,  special purpose compilers, crowdsourcing tools and generators of synthetic datasets) that implement these methods.

Application domains include: cellular network monitoring, mobile data privacy, social network modeling, statistical network analysis, web analysis, scalability and performance analysis of mobile devices.


Team

Past Members

  • Emmanouil Alimpertis, PhD in Networked Systems, June 2020; currently with Apple, Cupertino, CA.
  •  Justin Ley, undergraduate student; currently with Google.
  • Loring Tomas, PhD student in Sociology.

Publications

Software & DataSets


Acknowledgements: This material is based upon work supported by the National Science  Foundation under Grant No. 1939237: EAGER: N-Body Algorithms for Mobile and Social Data, Sept.1, 2019-Aug. 31, 2021 (or extended to Aug. 2022).

Disclaimer: Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Point of Contact: please contact the PI, A. Markopoulou.


Last Updated: Aug. 12th, 2021