May 24, 2019

The rise of privacy preserving AI

Illustration: Sarah Grillo/Axios

Data is AI's jet fuel — amassing as much as possible allows tech companies to precisely target ads, or medical AI to differentiate between a benign tumor and a malignant one.

The state of play: No problem for Facebook and its gobs of data — but hard for a small clinic with few patients to learn from. Now, new AI methods are allowing companies to benefit from the collective wisdom of peers and competitors, without giving up sensitive data or trade secrets.

Why it matters: This could help improve health care, among the country’s most stubborn problems, by clearing a key hurdle for medical AI — gathering a big and diverse enough dataset to help doctors diagnose difficult problems or choose better treatments.

  • Confined to each individual company's own data, AI systems don't have access to the staggering range of examples they need to outperform humans.
  • The main recourse has been to send information, assiduously scrubbed of private details, to some central hub to be pooled for study — a slow, laborious process.

What's happening: Privacy-preserving AI techniques like federated learning are powering new systems that can benefit from multiple companies' data — without even having to know what the data is.

  • Google showed off in 2017 how federated learning helped its Android keyboards learn new words, based on lessons gleaned from its enormous user base.
  • More recently, companies have applied the techniques to new industries, allowing sectors with privacy responsibilities to exploit the strength in numbers that other, less regulated industries can marshal.

Perhaps the most obvious application for federated learning is in health care, where strict rules prevent sharing patient data — but the benefit of gathering lots is potentially very high.

  • Owkin, a French startup, has connected more than 30 hospitals and research centers to a system that learns from all of them, in the process rewarding the hospitals that contribute the best data.
  • Each institution's data stays on its own computers, rather than being sent elsewhere for processing.
  • "We can have different hospitals collaborate while being competitive on their research," Anna Huyghues-Despointes, Owkin's director of strategy, tells Axios.

VIA, a Boston-area AI startup, uses federated learning to pool data about the condition of power transformers, such that a utility in Europe can learn from one in Thailand or New Zealand.

  • For power companies, predicting the next catastrophic equipment failure could require data on 1,000 previous problems — but any one company only sees one or two a year, says Colin Gounden, VIA's co-founder.
  • Get a few dozen utilities to team up and those 1,000 examples are within reach. Security concerns prevent them from just doling out information about their transformers, but several have already joined VIA's pilot federated learning system.

What's next: Intel is working on methods that will allow companies to apply an AI model to data without even decrypting it, which would open new doors to cooperation even among the most privacy-conscious companies and industries.

The bottom line: Sharing is just one way to solve one of the biggest problems still ahead of AI — figuring out how to slake computers' unending thirst for data. Researchers are also experimenting with bolstering small datasets with synthetic training data, or creating algorithms that can learn from far fewer examples.

Go deeper

Trump: "This is going to be a very painful two weeks"

President Trump said at a press briefing on Tuesday that the next two weeks in the U.S. will be "very painful" and that he wants "every American to be prepared for the days that lie ahead," before giving way to Deborah Birx to explain the models informing the White House's new guidance on the coronavirus.

Why it matters: It's a somber new tone from the president that comes after his medical advisers showed him data projecting that the virus could kill 100,000–240,000 Americans — even with strict social distancing guidelines in place.

Go deeperArrow19 mins ago - Health

Coronavirus dashboard

Illustration: Sarah Grillo/Axios

  1. Global: Total confirmed cases as of 6 p.m. ET: 850,583 — Total deaths: 41,654 — Total recoveries: 176,714.
  2. U.S.: Leads the world in confirmed cases. Total confirmed cases as of 6 p.m. ET: 184,183 — Total deaths: 3,721 — Total recoveries: 6,043.
  3. Business updates: Should you pay your rent or mortgage during the coronavirus pandemic? Find out if you are protected under the CARES Act.
  4. Public health updates: More than 400 long-term care facilities across the U.S. report patients with coronavirus — Older adults and people with underlying health conditions are more at risk, new data shows.
  5. Federal government latest: The White House and other institutions are observing several models to better understand and prepare cities for when the coronavirus is expected to peak in the U.S.
  6. U.S.S. Theodore Roosevelt: Captain of nuclear aircraft carrier docked in Guam pleaded with the U.S. Navy for more resources after more than 100 members of his crew tested positive.
  7. What should I do? Answers about the virus from Axios expertsWhat to know about social distancingQ&A: Minimizing your coronavirus risk.
  8. Other resources: CDC on how to avoid the virus, what to do if you get it.

Subscribe to Mike Allen's Axios AM to follow our coronavirus coverage each morning from your inbox.

Paying rent in a pandemic

Illustration: Aïda Amer/Axios

For many people who've lost jobs or income because of the coronavirus pandemic, tomorrow presents a stressful decision: Do you pay your rent or mortgage?

Why it matters: The new CARES Act that was signed by President Trump on Friday protects homeowners and renters who are suffering from the response to the coronavirus pandemic — but it's not “a one-size-fits-all policy rulebook,” a congressional aide tells Axios.

Go deeperArrow2 hours ago - Health