Praneeth Karimireddy
Assistant Professor of Computer Science and Electrical and Computer Engineering
Education
- Doctoral Degree, École Polytechnique Fédérale de Lausanne
- Master's Degree, Indian Institute of Technology-Delhi
- Bachelor's Degree, Indian Institute of Technology-Delhi
Biography
Sai Praneeth Karimireddy is an Assistant Professor in the Thomas Lord Department of Computer Science at USC. Before this, he was a postdoc at UC Berkeley with Michael Jordan and did his PhD among the picturesque mountains of EPFL with Martin Jaggi. He also co-leads the Data Quality and Federated Learning for Health working group with Doctors Without Borders, the Cancer Registry of Norway) and by industries such as Meta, Google, Owkin, and OpenAI. It has also been recognized by numerous awards such as the Patrick Denantes Memorial Prize for the best thesis in computer science, the Chorafas Foundation Award for exceptional applied research, and multiple best paper awards.Research Summary
My research studies data challenges in machine learning. Data is the most important factor determining the quality of an ML system. However, we understand very little about what makes data good or bad. Further, the most valuable data (e.g. health records) are either extremely expensive to collect and inaccesible. I use theory from optimization, statistics, and economics to answer these issues and build data infrastructure. Topics I am currently thinking about:- Large-scale Private & Federated optimization. Medical data is subject to strict privacy regulations. How can we privately train ML models on data distributed across multiple hospitals without the data leaving the hospitals? How can we share common information across hospitals while personalizing models to the unique aspects of each one? This uses tools like Federated Learning, Differential Privacy, and optimization for large-scale machine learning.
- Data Valuation and Data Markets. The data commons that current AI relies on is disappearing. In order to build a sustainable data-ecosystem, people need to be compensated for their data. But, how much is a specific data point worth?. This questions requires understanding i) how data affects uncertainity in a ML model, and ii) the relative importance of datapoints
- Trustworhty AI for Health. Healthcare comes with tons of data challenges on top of privacy concerns: the data may be highly heterogenous, and have missing features. Further, because of the high-stakes involved, fairness and equity, reliable uncertainity quantification, and interepretable predictions are all extremely important.
Awards
- 2022 SNSF Mobility Fellowship
- 2022 Patrick Denantes Memorial Trust Patrick Denantes Memorial Best Thesis Prize
- 2022 EPFL Thesis Distinction Award
- 2021 Dimitris N. Chorafas Foundation Chorafas Foundation Prize
Appointments
- Thomas Lord Department of Computer Science
- Ming Hsieh Department of Electrical and Computer Engineering
- SAL 327
- Henry Salvatori Computer Science Center
- 941 Bloom Walk, Los Angeles, CA 90089
- karimire@usc.edu