Trustworthy AI: A Conversation with NIST’s Chuck Romine

National Institute of Standards and Technology

6 min readJan 21, 2020

Artificial Intelligence (AI) promises to grow the economy and improve our lives, but with these benefits, it also brings new risks that society is grappling with. How can we be sure this new technology is not just innovative and helpful, but also trustworthy, unbiased, and resilient in the face of attack? We sat down with NIST Information Technology Lab Director Chuck Romine to learn how measurement science can help provide answers.

How would you define artificial intelligence? How is it different from regular computing?

One of the challenges with defining artificial intelligence is that if you put 10 people in a room, you get 11 different definitions. It’s a moving target. We haven’t converged yet on exactly what the definition is, but I think NIST can play an important role here. What we can’t do, and what we never do, is go off in a room and think deep thoughts and say we have the definition. We engage the community.

That said, we’re using a narrow working definition specifically for the satisfaction of the Executive Order on Maintaining American Leadership in Artificial Intelligence, which makes us responsible for providing guidance to the federal government on how it should engage in the standards arena for AI. We acknowledge that there are multiple definitions out there, but from our perspective, an AI system is one that exhibits reasoning and performs some sort of automated decision-making without the interference of a human.

There’s a lot of talk at NIST about “trustworthy” AI. What is trustworthy AI? Why do we need AI systems to be trustworthy?

AI systems will need to exhibit characteristics like resilience, security and privacy if they’re going to be useful and people can adopt them without fear. That’s what we mean by trustworthy. Our aim is to help ensure these desirable characteristics. We want systems that are capable of either combating cybersecurity attacks, or, perhaps more importantly, at least recognizing when they are being attacked. We need to protect people’s privacy. If systems are going to operate in life-or-death type of environments, whether it’s in medicine or transportation, people need to be able to trust AI will make the right decisions and not jeopardize their health or well-being.

Resilience is important. An artificial intelligence system needs to be able to fail gracefully. For example, let’s say you train an artificial intelligence system to operate in a certain environment. Well, what if the system is taken out of its comfort zone, so to speak? One very real possibility is catastrophic failure. That’s clearly not desirable, especially if you have the AI deployed in systems that operate critical infrastructure or our transportation systems. So, if the AI is outside of the boundaries of its nominal operating environment, can it fail in such a way that it doesn’t cause a disaster, and can it recover from that in a way that allows it to continue to operate? These are the characteristics that we’re looking for in a trustworthy artificial intelligence system.

NIST is supposed to be helping industry before they even know they needed us to. What are we thinking about in this area that is beyond the present state of development of AI?

Industry has a remarkable ability to innovate and to provide new capabilities that people don’t even realize that they need or want. And they’re doing that now in the AI consumer space. What they don’t often do is to combine that push to market with deep thought about how to measure characteristics that are going to be important in the future. And we’re talking about, again, privacy, security and resilience … trustworthiness. Those things are critically important, but many companies that are developing and marketing new AI capabilities and products may not have taken those characteristics into consideration. Ultimately, I think there’s a risk of a consumer backlash where people may start saying these things are too easy to compromise and they’re betraying too much of my personal information, so get them out of my house.

What we can do to help, and the reason that we’ve prioritized trustworthy AI, is we can provide that foundational work that people in the consumer space need to manage those risks overall. And I think that the drumbeat for that will get increasingly louder as AI systems begin to be marketed for more than entertainment. Especially at the point when they start to operate critical infrastructure, we’re going to need a little more assurance.

That’s where NIST can come together with industry to think about those things, and we’ve already had some conversations with industry about what trustworthy AI means and how we can get there.

I’m often asked, how is it even possible to influence a trillion-dollar, multitrillion-dollar industry on a budget of $150 million? And the answer is, if we were sitting in our offices doing our own work independent of industry, we would never be able to. But that’s not what we do. We can work in partnership with industry, and we do that routinely. And they trust us, they’re thrilled when we show up, and they’re eager to work with us.

AI is a scary idea for some people. They’ve seen “I, Robot,” or “The Matrix,” or “The Terminator.” What would you say to help them allay these fears?

I think some of this has been overhyped. At the same time, I think it’s important to acknowledge that risks are there, and that they can be pretty high if they’re not managed ahead of time. For the foreseeable future, however, these systems are going to be too fragile and too dependent on us to worry about them taking over. I think the biggest revolution is not AI taking over, but AI augmenting human intelligence.

We’re seeing examples of that now, for instance, in the area of face recognition. The algorithms for face recognition have improved at an astonishing rate over the last seven years. We’re now at the point where, under controlled circumstances, the best artificial intelligence algorithms perform on par with the best human face recognizers. A fascinating thing we learned recently, and published in a report, is that if you take two trained human face recognizers and put them together, the dual system doesn’t perform appreciably better than either one of them alone. If you take two top-performing algorithms, the combination of the two doesn’t really perform much better than either one of them alone. But if you put the best algorithm together with a trained recognizer, that system performs substantially better than either one of them alone. So, I think, human augmentation by AI is going to be the revolution.

What’s next?

I think one of the things that is going to be necessary for us is pulling out the desirable characteristics like usability, interoperability, resilience, security, privacy and all the things that will require a certain amount of care to build into the systems, and get innovators to start incorporating them. Guidance and standards can help to do that.

Last year, we published our plan for how the federal government should engage in the AI standards development process. I think there’s general agreement that guidance will be needed for interoperability, security, reliability, robustness, these characteristics that we want AI systems to exhibit if they’re going to be trusted.

This post originally appeared on Taking Measure, the official blog of the National Institute of Standards and Technology (NIST) on January 21, 2020.

To make sure you never miss our blog posts or other news from NIST, sign up for our email alerts.

About the Author

Charles Romine is director of the Information Technology Laboratory (ITL). ITL is one of six research laboratories within the National Institute of Standards and Technology (NIST) with an annual budget of $120 million, nearly 400 employees, and about 200 guest researchers from industry, universities and foreign laboratories.

Romine oversees a research program that cultivates trust in information technology and metrology by developing and disseminating standards, measurements, and testing for interoperability, security, usability, and reliability of information systems.