Trustworthy Machine Learning: From Data to Models

Bo Han; Jiangchao Yao; Tongliang Liu; Bo Li; Sanmi Koyejo; Feng Liu

doi:10.1561/3300000043

Foundations and Trends® in Privacy and Security > Vol 7 > Issue 2-3

Trustworthy Machine Learning: From Data to Models

By Bo Han, Hong Kong Baptist University, Hong Kong and RIKEN, Japan, bhanml@comp.hkbu.edu.hk | Jiangchao Yao, Shanghai Jiao Tong University, China, sunarker@sjtu.edu.cn | Tongliang Liu, University of Sydney, Australia and MBZUAI, UAE and RIKEN, Japan, tongliang.liu@sydney.edu.au | Bo Li, University of Illinois Urbana-Champaign, USA, lbo@illinois.edu | Sanmi Koyejo, Stanford University, USA, sanmi@cs.stanford.edu | Feng Liu, University of Melbourne, Australia, feng.liu1@unimelb.edu.au

Suggested Citation

Bo Han, Jiangchao Yao, Tongliang Liu, Bo Li, Sanmi Koyejo and Feng Liu (2025), "Trustworthy Machine Learning: From Data to Models", Foundations and Trends® in Privacy and Security: Vol. 7: No. 2-3, pp 74-246. http://dx.doi.org/10.1561/3300000043

Publication Date: 29 Apr 2025

Subjects

Deep learning, Robustness, Statistical learning theory, Artificial intelligence methods in security and privacy, Big data analytics and privacy

Journal details

Download article

In this article:

Abstract

The success of machine learning algorithms relies not only on achieving good performance but also on ensuring trustworthiness across diverse applications and scenarios. Trustworthy machine learning seeks to handle critical problems in addressing the issues of robustness, privacy, security, reliability, and other desirable properties. The broad research area has achieved remarkable advancement and brings various emerging topics along with the progress. We present this survey to provide a systematic overview of the research problems under trustworthy machine learning covering the perspectives from data to model. Starting with fundamental data-centric learning, the survey reviews learning with noisy data, long-tailed distribution, out-of-distribution data, and adversarial examples to achieve robustness. Delving into private and secured learning, the survey elaborates on core methodologies differential privacy, different attacking threats, and learning paradigms, to realize privacy protection and enhance security. Finally, it introduces several trendy issues related to the foundation models, including jailbreak prompts, watermarking, and hallucination, as well as causal learning and reasoning. The survey integrates commonly isolated research problems in a unified manner, which provides general problem setups, detailed sub-directions, and further discussion on its challenges or future developments. We hope the comprehensive investigation presented in this survey can serve as a clear introduction for the problem evolution from data to models and also bring new insight for developing trustworthy machine learning.

DOI:10.1561/3300000043

Book details

ISBN: 978-1-63828-549-6

182 pp. $320.00

Buy E-book (.pdf)

Table of contents:

1. Introduction

2. Trustworthy Data-centric Learning

3. Trustworthy Private and Secured Learning

4. Trustworthy Foundation Models

5. Conclusion

References

Trustworthy Machine Learning: From Data to Models

The success of machine learning algorithms relies not only on achieving good performance but also on ensuring trustworthiness across diverse applications and scenarios. Trustworthy machine learning seeks to handle critical problems in addressing the issues of robustness, privacy, security, reliability, and other desirable properties. The broad research area has achieved remarkable advancement and brings various emerging topics along with the progress. This monograph provides a systematic overview of the research problems under trustworthy machine learning, covering the perspectives from data to model. Starting with fundamental data-centric learning, this work reviews learning with noisy data, long-tailed distribution, out-of-distribution data, and adversarial examples to achieve robustness.

Delving into private and secured learning, the monograph elaborates on core methodologies such as differential privacy, different attacking threats, and learning paradigms, to realize privacy protection and enhance security. Finally, it introduces several trendy issues related to the foundation models, including jailbreak prompts, watermarking, and hallucination, as well as causal learning and reasoning. This work integrates commonly isolated research problems in a unified manner, which provides general problem setups, detailed sub-directions, and further discussion on its challenges or future developments. The comprehensive investigation presented in this work can serve as a clear introduction for the problem evolution from data to models, and also bring new insight for developing trustworthy machine learning.

Trustworthy Machine Learning: From Data to Models

Free Preview:

Share

Journal details

Abstract

Book details

Trustworthy Machine Learning: From Data to Models