
Illustration: Eniola Odetunde/Axios
In the most exclusive AI conferences and journals, AI systems are judged largely on their accuracy: How well do they stack up against human-level translation or vision or speech?
Yes, but: In the messy real world, even the most accurate programs can stumble and break. Considerations that matter little in the lab, like reliability or computing and environmental costs, are huge hurdles for businesses.
Why it matters: Some stumbles that have tarred high-profile AI systems — like facial recognition that fails more often on darker faces, or medical AI that gives potentially harmful advice — have resulted from unanticipated real-world scenarios.
The big picture: Research often guns for incremental advances, juicing an extra percentage point of accuracy out of a previously proposed model.
- "We feel like this is the only thing that gets rewarded nowadays in the community," says Roy Schwartz, a researcher at the Allen Institute for Artificial Intelligence. "We want to argue that there are other things to consider."
- In a paper posted this summer, Schwartz and several co-authors proposed that researchers present not just the accuracy of their models but also the computing cost it took to get there — and the resulting environmental toll.
That's one of several hidden considerations that can drive up the cost of creating an AI model that works in the real world. Among the other important factors accuracy doesn't capture:
- The cost of labeling training data, which is still mostly done by humans.
- The reliability of a system or, conversely, its tendency to make critical mistakes when it sees a new situation it hasn't been trained to deal with.
- Vulnerability to adversarial examples, a special kind of attack that can cripple certain kinds of AI systems, potentially making an autonomous car blind to a fast-approaching stop sign.
- Bias, or whether a model's accuracy varies depending on the kind of person or thing it is evaluating.
- Interpretability of a model's results, or how easy it is to understand why an AI system made a particular prediction.
"The machine learning model is just a tiny piece of the machine learning product," says Andrew Ng, founder of Landing.ai, a startup that helps companies set up AI processes.
- "One of the challenges is that you ship a system and then the world changes in some weird and unpredictable way," says Ng, who previously started Google's and Baidu's AI operations.
- "Anyone can download code from GitHub, but that's not the point," Ng tells Axios. "You need all these other things."
One of the byproducts of the hidden costs associated with increasingly accurate AI systems is that they can make it hard for a new startup or cash-strapped university lab to compete.
- "The AI models that are being designed and developed don't always contemplate the application of the model," says Josh Elliot, director of AI at Booz Allen Hamilton.
- Big companies offer pre-packaged AI programs — like one for detecting street signs, say — but users often lack the computing firepower required to tweak them.
- That means the reigning players get to decide the important problems worth trying to solve with AI, Schwartz says, and everyone else has to go along.
What's next: Schwartz, Ng and others propose putting more effort toward solving problems more efficiently rather than making a slightly more accurate model at an enormous cost.
Go deeper: How not to replace humans