The short version: OKF is a packaging format for knowledge, not a new SEO tag
People have started forwarding us Google’s Open Knowledge Format (OKF) announcement, asking “should we rush to add OKF to our site?” Let’s be blunt up front:
OKF is a file format for packaging knowledge so that both humans and AI agents can read it. What it set out to solve is the problem of organizational knowledge scattered across incompatible systems, where every AI agent builder has to reassemble context from scratch — it is not a tag you paste on a marketing site to get ChatGPT mentioning you more this week.
In Google’s own words: a bundle “will not move your rankings or your AI visibility this week.” Its real value is directional: the day an agent comes looking, your knowledge is already organized and “effortless to read.” So this post isn’t telling you to rebuild your site today — it’s helping you understand where OKF sits in the GEO picture and how it relates to what you’re already doing.
What OKF actually is
In June 2026, Google Cloud’s Data Analytics team (Sam McVeety and Amir Hormati) released OKF v0.1. It formalizes the emerging “LLM-wiki” pattern articulated by Andrej Karpathy: rather than having an AI re-search the same documents every time, you give it a shared Markdown knowledge library that grows more useful over time, with the agent handling the tedious cross-reference bookkeeping.
Technically it’s deliberately thin:
- Just Markdown — opens in any editor, renders on GitHub, is indexable by search tools
- Just files — shippable as a tarball, hostable in git, mountable on a filesystem
- Just YAML frontmatter — a small set of queryable structured fields
One concept = one Markdown file, and the file path is that concept’s identity. Each file declares a few YAML fields at the top (only type is required):
---
type: BigQuery Table
title: Orders
description: One row per completed customer order.
resource: https://console.cloud.google.com/...
tags: [sales, revenue]
timestamp: 2026-05-28T14:30:00Z
---
# Schema
| Column | Type | Description |
|--------|------|-------------|
| `order_id` | STRING | Globally unique order identifier |
| `customer_id` | STRING | FK to [customers](/tables/customers.md) |
Notice the last line — a Markdown link pointing to another concept file. That’s the key thing OKF adds over a pile of loose documents: the relationships between pieces of knowledge are preserved, not just the page-by-page content but how the pages connect.
The problem it solves (the most common misunderstanding)
Glance at the fields and you’ll see it: BigQuery Table, table schemas, console links — OKF’s first home is internal enterprise data and knowledge, giving AI agents inside an organization consistent, trustworthy context instead of each one reassembling its own.
So if your question is “should my marketing site adopt OKF,” the honest answer is: as designed in v0.1, it isn’t aimed at your public-facing website. No mainstream AI engine has announced that it will crawl public OKF bundles to decide whether to cite you. Saying this plainly matters, because the distinction directly affects whether you should invest time now.
So does it relate to GEO and being cited by AI?
Yes — but directionally, not as this quarter’s tactic.
Zoom out and the “machine-readable web” has been stacking up layer by layer, each answering a different question:
What’s genuinely worth noting is that OKF shares the same instinct we keep emphasizing: AI doesn’t just look at how polished a single page is — it looks at whether it can understand you as a consistent, internally connected knowledge entity. OKF bakes the page-to-page relationships into the format itself, which is the same road as our work on the brand knowledge system and entity consistency.
But hold onto this: OKF is v0.1, adoption is still unknown, and the public-web side is even less formed. It’s a direction worth tracking, not a “do it now or you lose” to-do. Anyone using it to scare you into rebuilding your site is dressing up something rare as if it were the foundation of your site’s health — the order is exactly backwards.
Do we already have anything related?
Yes, quite a bit. The layer OKF describes is an extension of GEO fundamentals you’re already doing or should be:
- llms.txt — a plain-text source of facts and a signpost for AI crawlers; see SSR vs SPA: the rendering problem behind AI citation
- Structured data (JSON-LD) — author-declared facts the engines trust; see How structured data affects LLM citation
- Agent readiness — where llms.txt, MCP servers and other emerging standards sit in the 12 GEO dimensions; see How GEO dimensions interact
- Brand knowledge system — turning scattered brand facts into a consistent entity AI can read; see The GEO brand knowledge system
OKF isn’t a new job category out of nowhere — it pushes this “organize it for machines” work one notch further toward being more formal and more portable.
So what should you actually do now?
Don’t: tear down and rebuild your site to produce OKF bundles for a v0.1 standard before any mainstream engine has committed to consuming it. That’s chasing hype, not building site health.
Do: exactly the fundamentals that pay off now and also make you “OKF-ready” later:
- Make sure AI crawlers can actually fetch your content (pure client-rendered sites often hand them an empty div)
- State who your company is, what it does, and your track record consistently across your site and beyond
- Keep internal links and topical relationships clear, so “how pages connect” already holds
- Get structured data right and consistent with the content (mismatched markup costs you)
Get these solid and the day a mainstream engine starts consuming such knowledge bundles, you’re already organized and waiting rather than scrambling. And these four aren’t a one-off project — the AI engines, your content, and your competitors all keep moving, so it’s site-health work you have to keep watching and maintaining. That’s exactly why we run it as continuous monitoring and managed service: doing it yourself along these lines is completely fine, it’s just that watching it by hand every week will burn you out.
OKF is worth knowing exists and knowing which slot it fills. But don’t mistake “knowing a new standard exists” for “having to act on it now” — keep the foundation in shape, and when the new standard arrives you’ll already be standing in the right place.
ok but how is this any different from seo with extra steps? feels like every few years someone renames the same thing and sells it back to us
ok but how is this any different from seo with extra steps? feels like every few years someone renames the same thing
Fair pushback 😅 It's not a replacement — clean site + good content is still the foundation, GEO just sits on top of it. The 'extra steps' are mostly about being the source an answer is built from, not one of ten blue links it ignores. Not enemies, same foundation.
honestly half of this reads like the early seo blogs from 15 yrs ago. 'do good content, be the authority, get mentioned in trustworthy places' ... yeah we know, the hard part was always the how and that's exactly the part everyone skips
finally someone says it. been telling clients for a year that getting mentioned by the ai is not the same as ranking #1 and they look at me like im speaking martian
honest question for the author, does this change month to month? like do you get picked up and then quietly dropped when the model updates, or is it sticky once youre in
Both happen. Tactics-driven gains get wiped when a model updates — that's the churn you're describing. But citations that come from genuinely being a clear, trustworthy source tend to be sticky across updates. So we tell people: don't chase this month's behaviour, build the part that survives the update.
wait so do i need to pay for chatgpt to get my shop to show up in it?? sorry if dumb question im not techy, i just run a small bakery and my niece said i should look into this
as a small business owner i'm tired lol. just got the hang of google reviews and now there's a whole new thing
i mean isnt the answer just write good clear pages that explain things. feels like we reinvented that wheel every few years with a new acronym
this is the third article ive read this week saying basically the same stuff and none of them tell you what to actually DO on monday morning
good read but the part about structured content felt a little thin. would love an actual before/after of a page that started getting picked up
doesnt work for me
the part about schema markup is slightly off. the engines aren't 'reading' json-ld the way you imply, most of them rely on the rendered text + retrieval from an index. structured data helps disambiguate entities but it's not the primary signal. worth clarifying so people don't go spend a week on schema thinking it's the magic switch
wait so is this just seo with extra steps
good stuff. i run a small agency and we've been quietly doing this kind of work for clients for about a year, happy to compare notes with anyone here who's experimenting, no pitch just nerding out
my dog could write better meta descriptions than half the sites that DO get cited, so theres hope for all of us
ok but how do you even measure if youre getting cited? like is there a tool or are people just typing questions into chatgpt all day and screenshotting
Honestly there's no single magic tool — what matters is the method: test across multiple mainstream AI engines, with several real-world phrasings, repeatedly over time. One-shot from one engine is mostly noise. We run it as continuous monitoring because doing it by hand weekly will burn you out, but the principle's the same if you DIY.
saved this. been trying to figure out why we show up on google fine but the AI answers never mention us. makes more sense now
skeptical. half of these "get cited by ai" takes are just people noticing the ai already knew about big brands and pretending they cracked a code
we actually started seeing our brand mentioned in answers after we cleaned up our about page and got a couple of decent writeups, so theres something to this even if its hard to measure. didnt do anything fancy