Trustworthy AI

Our trust in technology relies on understanding how it works. We need to understand why AI makes the decisions it does. We're developing tools to make AI more explainable, fair, robust, private, and transparent.

Explore our topics Explore our topics

Overview

Artificial intelligence systems have become increasingly prevalent in everyday life and enterprise settings, and they’re now often being used to support human decision-making. These systems have grown increasingly complex and efficient, and AI holds the promise of uncovering valuable insights across a wide range of applications. But broad adoption of AI systems will require humans to trust their output.

When people understand how technology works, and we can assess that it’s safe and reliable, we’re far more inclined to trust it. Many AI systems to date have been black boxes, where data is fed in and results come out. To trust a decision made by an algorithm, we need to know that it is fair, that it’s reliable and can be accounted for, and that it will cause no harm. We need assurances that AI cannot be tampered with and that the system itself is secure. We need to be able to look inside AI systems, to understand the rationale behind the algorithmic outcome, and even ask it questions as to how it came to its decision.

At IBM Research, we’re working on a range of approaches to ensure that AI systems built in the future are fair, robust, explainable, account, and align with the values of the society they’re designed for. We’re ensuring that in the future, AI applications are as fair as they are efficient across their entire lifecycle.

Our work

What is red teaming for generative AI?
Explainer
Kim Martineau
11 Apr 2024
- Adversarial Robustness and Privacy
- AI
- AI Testing
- Fairness, Accountability, Transparency
- Foundation Models
- Natural Language Processing
- Security
- Trustworthy AI
An AI model trained on data that looks real but won’t leak personal information
Research
Kim Martineau
12 Dec 2023
- AI
- AI Privacy
- Data and AI Security
- Finance
- Foundation Models
The latest AI safety method is a throwback to our maritime past
Research
Kim Martineau
16 Nov 2023
- AI
- AI Transparency
- Explainable AI
- Fairness, Accountability, Transparency
- Generative AI
What is AI alignment?
Explainer
Kim Martineau
08 Nov 2023
- AI
- Automated AI
- Fairness, Accountability, Transparency
- Foundation Models
- Natural Language Processing
Find and fix IT glitches before they crash the system
News
Kim Martineau
28 Sep 2023
- AI for Code
- AI for IT
- Explainable AI
- Foundation Models
- Generative AI
An open-source toolkit for debugging AI models of all data types
Technical note
Kevin Eykholt and Taesung Lee
08 Sep 2023
- Adversarial Robustness and Privacy
- AI Testing
- Data and AI Security
See more of our work on Trustworthy AI

Topics

Science for Social Good

IBM Science for Social Good partners IBM Research scientists and engineers with academic fellows, subject matter experts from NGOs, public sector agencies, and social enterprises to tackle emerging societal challenges using science and technology.

Explore the initiative

Publications

CUI@CHI 2024: Building Trust in CUIs-From Design to Deployment
- - Smit Desai
  - Christina Wei
  - et al.
- 2024
- CHI 2024
Unraveling the Key Components of OOD Generalization via Diversification
- - Harold Benoit
  - Liangze Jiang
  - et al.
- 2024
- ICLR 2024
Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion Models?
- - Yu-Lin Tsai
  - Chia-yi Hsu
  - et al.
- 2024
- ICLR 2024
The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language Models
- - Yan Liu
  - Yu Liu
  - et al.
- 2024
- ICLR 2024
Uncertainty Quantification via Stable Distribution Propagation
- - Felix Petersen
  - Aashwin Mishra
  - et al.
- 2024
- ICLR 2024
An Investigation of Representation and Allocation Harms in Contrastive Learning
- - Subha Maity
  - Mayank Agarwal
  - et al.
- 2024
- ICLR 2024

View all publications

Building trustworthy AI with Watson

Our research is regularly integrated into Watson solutions to make IBM’s AI for business more transparent, explainable, robust, private, and fair.

Learn more