Axios AI+

July 08, 2024
I had a great time with the family last week at Legoland (where else?). Hope you had a good Fourth as well.
Today's AI+ is 966 words, a 3.5-minute read.
1 big thing: Bot vs. bot forever
A new kind of turf war is breaking out on the web, with AI bots battling other AI bots to seize or defend stockpiles of the AI era's most valuable commodity: data.
The big picture: AI makers hungry for more data to train their language models are grabbing everything they can, while information owners are increasingly fighting fire with fire by turning to AI-powered tools to protect their intellectual property.
Driving the news: Cloudflare, the infrastructure and security firm used by 1 in 5 websites, introduced a new service last week that protects clients' content from poaching by data-harvesting bots.
- "We hear clearly that customers don't want AI bots visiting their websites, and especially those that do so dishonestly. To help, we've added a brand new one-click to block all AI bots," Cloudflare said in a blog post.
Catch up quick: Since the '90s, websites have used a simple file on their sites —called "robots.txt" — to declare whether their content was available for automated tools to read and copy.
- These files are like "no trespassing" signs that rely on robots to respect them. They have always been informal pacts, and are neither technically nor legally enforceable.
In the AI era, AI makers are using increasingly aggressive scraping tactics, seizing any web data that's "publicly available."
- Microsoft AI CEO Mustafa Suleyman recently told the Aspen Ideas Festival that all "open web" content is "freeware" available for the taking under fair use principles: "Anyone can copy it, recreate with it, reproduce with it," he said.
- Suleyman called content protected by robots.txt a "gray area ... that's going to work its way through the courts."
Content owners and media publishers sharply disagree.
- But they've been torn between their instinct to protect their intellectual property and their eagerness to take money from those AI makers, like OpenAI, that have been opening their checkbooks to secure data rights.
- Some copyright holders, like the New York Times, have challenged AI data use in court.
Between the lines: Cloudflare's "one-click" tool goes further than robots.txt and actively blocks AI bots.
- Ironically, Cloudflare's tool depends on its own machine learning model to "fingerprint" which bots are working for the AI data stockpilers.
- The company says its model is smart enough to adapt fast when AI makers rename their bots or mask their identities.
- "We're proud to say that our global machine learning model has always recognized this activity as a bot, even when operators lie about their user agent," Cloudflare said in its post.
Zoom out: The bot versus bot free-for-all over access to web data is just the opening salvo of what will be an increasingly hot war in every realm, as AI fuels new attack strategies and defense tactics.
- You can already see it in social media content moderation, where AI-managed rule sets are being applied to identify and manage new floods of AI-generated posts.
- The same dynamic is playing itself out in cybersecurity, where malicious hackers are using AI to probe and exploit vulnerabilities while defenders increasingly rely on AI-based pattern detection to flag anomalous behavior.
- In finance, AI algorithms are already trading against one another, pushing edge-case crises to extremes — and we haven't even begun to see the longer-term impact of "smart contract"-driven crypto trading.
- The future of medicine could pit super-bugs unleashed by AI-empowered biohackers against AI-produced antidotes raced to the field by white-hat biohackers.
- Arms-makers and the military are already preparing for robot-filled, drone-drenched battlefields where AI agents call the shots.
The bottom line: There's no escape from this bot vs. bot "technodialectic," as Andrew Leonard prophetically wrote in a 1996 Wired story.
- Leonard's dictum that bots "cause as many problems as they solve" should be front of mind for every AI CEO today.
2. AI floats all VC boats

AI is keeping the VC ecosystem afloat. It has accounted for more than 40% of new private U.S. "unicorns" so far this year, and over 60% of the increase in total venture-backed valuation, per PitchBook.
The big picture: There have been 13 new U.S. AI unicorns — private companies that have reached valuations of over $1 billion — so far this year.
- Elon Musk's xAI on its own added $24 billion in valuation, while all AI unicorn companies combined added $116 billion in aggregate value.
- Overall U.S. unicorn value has grown by $162 billion so far this year.
Follow the money: Over the past few years, AI accounted for about 15-17% of venture dollars invested in North America and Europe. That number increased to about 20% in 2023 and is rising even more this year, according to PitchBook senior emerging technology analyst Brendan Burke.
Yes, but: Even AI wasn't immune to the exuberance seen in 2021.
- "There's an entire universe of 40,000 companies that incorporated machine learning into their applications, and many that were funded during the bull market haven't achieved their goals," says Burke.
- "Those mega-deals from the late bull market haven't sustained their valuations."
Zoom in: Inflection, which last year raised $1.3 billion at a $4 billion valuation, struggled to find a sustainable business model, and its founders joined Microsoft earlier this year while the startup pivots to selling tech to businesses.
- The founders of Adept, which was valued last year at $1 billion, have similarly departed to join Amazon. Adept still hasn't publicly released a product.
The bottom line: VCs' interest in AI is only growing.
3. Training data
- ChatGPT (the older version, based on GPT 3.5) is great at coding, but it falters when tackling problems that emerged after 2021, the cutoff date for its training data. (IEEE Spectrum)
- Security researchers say the safeguards offered by Glaze, a popular tool that artists use to protect their images and styles from AI imitation, are "easily bypassed." (Ars Technica)
4. + This
Of course, the Lego version of my family came along for the vacation too.
Thanks to Megan Morrone and Scott Rosenberg for editing this newsletter and to Caitlin Wolper for copy editing it.
Sign up for Axios AI+





