Experiments into the future of publishing and marketing

Research

How would you rate the LLMs?

Human ratings of LLM answers to topical questions. Questions are generated from the past three days of topical news, and users compare responses side-by-side in blind pairwise comparison. Higher ELO = better performance.

RankModelELOWin RateVotesWLT

1Claude 4 Opus92.491.8–93.012,847———

2GPT-4o89.188.4–89.812,847———

3Gemini 2.5 Pro87.686.9–88.312,847———

4Claude 4 Sonnet85.284.5–85.912,847———

5GPT-4o Mini78.377.5–79.112,847———

Elo is a rating system for calculating the relative skill levels of players (or teams) in zero-sum competitions. →

Labs Live

Where impossible ideas get built

We bring together brands, agencies, publishers and engineers to solve hard problems — from first idea to shipped product.

Pitch ideas

Anyone can throw an idea on the table — a hunch, a frustration, a “what if.”

Form teams

Self-organise around the ideas that excite you. Engineers, commercial, data — all welcome.

Build fast

Focused sprint, usually 1–2 days. Working prototypes, not presentations.

Ship

Proven experiments graduate into products. What doesn't ship still teaches us something.

Start a Labs Live hackathon

Projects

Concepts, prototypes, open source projects and stuff

Not all great ideas make it to products, but a lot can be learned along the way. Here's stuff we are working on that might be a great idea, or could become a great product.

Experimenting with AdCP

Last updated: 2026-03-24

Agentic ad buying protocol

ADVERTISINGPLANNINGACTIVATION

Axon

Last updated: 2026-03-24

MCP server with 62+ tools for AI agents

GITHUBOPENSOURCE

Natural ads

Last updated: 2026-03-24

LLM-powered contextual ad creative

ADVERTISINGCREATIVE

Synapse

Last updated: 2026-03-24

Shared prompts, configs, and AI components

GITHUBOPENSOURCE

Updates

Latest news from the lab

Updates, announcements, and posts for a behind-the-scenes look at what we're building.

Featured

Chronicling: "The Fall of Timothee Chalamet at the Oscars" - in real time:

2026-03-27

During our last hackathon, one team built a model to predict the Golden Globe award winners. The team used Ozone's AI sentiment assessment, as well as our "person" keyword extractions from each of the actors, across millions of articles around the world.

Welcome to Ozone Labs!

2026-03-27

Ozone Labs is our channel for sharing our experiments into what the future of publishing, audience engagement and marketing could be, in a world of LLMs and agents.

Damon Reeve

General

How to use Posts (delete before going live)

2026-03-25

Instructions on how to create posts in this updates section

DELETE