News

Reddit content powers AI, Huffman says human authenticity still matters

Huffman cast Reddit as AI’s raw material, while Reddit’s voting culture still punishes machine-made posts. That mix is shaping who gets surfaced in AI answers.

Nina Kowalski··2 min read
Published
Listen to this article0:00 min
Reddit content powers AI, Huffman says human authenticity still matters
Source: searchenginejournal.com

Steve Huffman put Reddit at the center of the AI search fight in New York City, saying at Fast Company’s Most Innovative Companies Summit on May 19 that Reddit content has become like oil and that large language models would not exist as we know them without it. His message was bigger than a boast about traffic: Reddit is no longer just a discussion site, but a source that helps feed the modern AI stack, from training data to the live retrieval systems now shaping chatbot answers and AI Overviews.

That matters because Reddit sits in two places at once. It licensed data to Google in February 2024, around the time of its IPO filing, in a deal Reuters reported could be worth about $60 million a year. Reddit also disclosed that its data licensing agreements carried a total contract value of $203 million over two to three years. In May 2024, Reddit announced a similar arrangement with OpenAI, and CNBC reported that OpenAI received access to Reddit’s Data API for real-time, structured, unique content. Reddit said that partnership would also help it build new AI-powered features for users and moderators.

AI-generated illustration
AI-generated illustration

The company’s expanding role has drawn scrutiny as well. In March 2024, Reddit disclosed that the Federal Trade Commission had opened a non-public inquiry focused on the sale, licensing, or sharing of user-generated content for AI training. That backdrop makes Huffman’s remarks especially pointed: Reddit is valuable because it is both licensed and crawled, both curated and messy, both human and machine-readable.

Related photo
Source: upload.wikimedia.org

The platform’s influence shows up in the numbers. Research cited by Columbia Journalism Review and Profound found that between August 2024 and June 2025, Reddit was the most cited domain by Google AI Overviews and Perplexity, and the second most cited by ChatGPT. Reddit has also started building for that reality itself, testing or rolling out Reddit Answers in December 2024, an AI-powered search feature that summarizes conversations and links to related posts.

Reddit AI Citation Rank
Data visualization chart

Huffman has repeatedly framed Reddit as a place for lived experiences, recommendations, human curation and conversation, and that is exactly why the platform is so powerful in AI search. Its voting system can function as a crowd-sourced authenticity filter, and Huffman said Reddit’s community is already pushing back on AI-written content through downvotes. For brands, that is the strategic warning buried inside the hype: if Reddit is becoming part of the information substrate that shapes AI answers, then weak, promotional or inauthentic posts may not just fail to win attention. They can become the signal AI systems learn from.

This article was produced by Prism’s automated news system from verified source data, official records, and press releases, then run through automated quality and moderation checks before publishing. The system is built and supervised by the people who set the standards it runs under. Read our full AI policy.

Did this article answer your question?

Discussion

More AI Search Visibility Articles