This is a submission for the Google I/O Writing Challenge
๐ฌ The Scene
Google I/O 2026 dropped a wall of announcements in two hours.
๐ฅ Gemini 3.5 Flash
๐ค Antigravity 2.0
๐ก๏ธ Firebase AI Logic
๐ WebMCP
๐จ Stitch
๐ง Jules
๐๏ธ Gemini Omni
The keynote sugar rush was real.
Every recap I've read picks one announcement and explains it. That's useful. But it doesn't answer the question I actually had after the livestream ended:
๐ค Which of these can I use TODAY, in a real project, without it blowing up in my face?
So I spent the last 48 hours building with four of the newest tools from I/O 2026. Not demo projects. Not "hello world." Real integration attempts into actual workflows.
Here's what happened. ๐
๐ ๏ธ The Four Tools I Tested
I picked tools that cover different parts of the stack:
| # | Tool | What It Does |
|---|---|---|
| 1๏ธโฃ | Antigravity CLI 1.0.2 | Successor to Gemini CLI โ agent orchestration |
| 2๏ธโฃ | Gemini 3.5 Flash | New default model via AI Studio API |
| 3๏ธโฃ | Firebase AI Logic | Client-side AI inference with security |
| 4๏ธโฃ | WebMCP | Protocol that makes web apps agent-readable |
I tried each one for a specific task. Not a tutorial. A real thing I'd actually ship. ๐
1๏ธโฃ Antigravity CLI: The 129 Skills Nobody's Talking About
Everyone's writing about Antigravity's multi-model routing (Gemini + Claude + GPT-OSS in one CLI). That's cool. ๐
But the thing that actually changed how I work is /skills.
Antigravity ships with 129 built-in skills. Not autocomplete rules โ actual agent behaviors. Things like:
- ๐
agency-code-reviewerโ reviews staged changes before commit - ๐ค
agency-agentic-search-optimizerโ audits whether AI agents can complete tasks on your site - ๐
agency-codebase-onboarding-engineerโ helps new devs understand unfamiliar repos
๐งช The Test
I tested the skill creation workflow on a real React/TypeScript project. One prompt:
"Create a skill that enforces TypeScript strict mode violations before any PR merge"
โก What Antigravity Actually Did
Step 1: Read tsconfig.json and package.json โ understood the stack โ
Step 2: Scanned src/ for existing type patterns โ
Step 3: Ran git status โ understood current state โ
Step 4: Proposed SKILL.md + checker script + pre-commit hook โ
Step 5: Asked for approval, then built all three โ
Step 6: Created mock violations, ran hook against itself, verified โ
โ The Good
One prompt. Zero config files written by hand. The pre-commit hook is active right now and will block the next TypeScript violation.
โ ๏ธ The Bad
The skill lives globally in ~/.gemini/config/skills/, not in the project directory. That means it's available across ALL projects on this machine. Convenient until you have 60 skills conflicting with each other. ๐ฌ
โ The Ugly
Gemini CLI (open source, 10K+ contributors) shuts down June 18. Antigravity is closed source. Google moved developer tooling into its monetization stack.
That's a tradeoff worth acknowledging. ๐ซ
๐ Verdict
The skill system is genuinely powerful. The closed-source migration is genuinely concerning. Both are true.
โญโญโญโญ (4/5)
2๏ธโฃ Gemini 3.5 Flash: Fast, Cheap, and Missing One Thing
I hit the Gemini API via AI Studio to power a content summarization feature. Straightforward task: feed it 3,000-word articles, get back structured summaries.
โก Speed
Sub-second responses for most inputs. Noticeably faster than Gemini 1.5 Pro for equivalent tasks.
Gemini 1.5 Pro: ~2.3s average
Gemini 3.5 Flash: ~0.8s average โ 3x faster ๐
๐ฏ Quality
Good at extraction and summarization. Struggled with nuance โ when I asked it to identify the "controversial take" in an opinion piece, it often defaulted to the most prominent claim rather than the most provocative one.
๐ฐ Cost
This is where it gets interesting. Gemini 3.5 Flash is priced aggressively for high-volume use. If you're building a tool that processes thousands of documents daily, the economics are real. ๐
๐จ The Thing Nobody's Mentioning
Context window behavior. At 128K tokens, it technically handles long inputs. But I noticed quality degradation past ~60K tokens โ the model started missing details buried in the middle of long documents.
This matches what other developers are reporting but nobody's writing about.
๐ Verdict
Excellent for high-volume, structured extraction tasks. Don't trust it for nuanced analysis of long documents without a retrieval layer.
โญโญโญโญ (4/5)
3๏ธโฃ Firebase AI Logic: The Security Model Is the Story
Firebase AI Logic lets you run Gemini inference directly from the client โ your web app or mobile app talks to Google's API without a backend proxy.
The I/O keynote made this sound like magic. ๐ช
The reality is more nuanced.
๐ก๏ธ What's Genuinely New: The 4-Layer Security Model
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Layer 1: App Check โ โ Verifies requests from YOUR app
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ Layer 2: Firestore Rules โ โ Controls who can call the model
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ Layer 3: Rate Limiting โ โ Per-user throttling
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ Layer 4: Output Filtering โ โ Content safety on responses
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
This matters because client-side AI has always had a trust problem: if the API key is in the browser, anyone can abuse it. Firebase's approach doesn't eliminate that risk, but it adds enough friction that casual abuse becomes non-trivial. ๐
๐คท What's NOT New
The inference itself. You could already call Gemini from a frontend using the AI Studio API. Firebase AI Logic wraps this in Firebase's auth and security ecosystem.
If you're already on Firebase โ clean integration โ
If you're not โ migration cost is real โ
๐ต๏ธ The Catch
Client-side inference means your prompt structure is visible in the browser's network tab. For any application where prompt engineering is part of your competitive advantage, you still want a backend proxy. ๐
๐ Verdict
Great for Firebase-native apps that need AI features without backend complexity. Not a replacement for server-side inference in security-sensitive applications.
โญโญโญ (3/5)
4๏ธโฃ WebMCP: The Announcement That Could Matter Most (But Doesn't Yet)
WebMCP is a protocol that lets web applications expose structured information to AI agents. Think of it as robots.txt but for agent interactions โ it tells AI crawlers what your app can do, not just what pages it has.
๐ค Why This Matters
The entire agentic stack (Gemini agents, Antigravity, Jules, etc.) needs to understand web applications to interact with them. WebMCP is Google's attempt at making that standardized.
๐ Why I'm NOT Excited Yet
I tried implementing WebMCP on a small web app and found:
- ๐ Documentation is sparse โ the I/O session covered it in ~4 minutes
- ๐ง Tooling is minimal โ no CLI scaffold, no validator, no testing framework
- ๐ Adoption is zero โ no major frameworks support it yet
- โ It's a Google proposal, not a standard โ W3C/IETF involvement is TBD
๐ Verdict
Watch this space. Don't build on it yet.
โญโญ (2/5)
๐ The Final Scoreboard
| Tool | Score | Use It If... | Skip It If... |
|---|---|---|---|
| ๐ค Antigravity CLI | โญโญโญโญ | You want agent-powered dev workflows | You need open-source tooling |
| โก Gemini 3.5 Flash | โญโญโญโญ | You're building high-volume AI features | You need nuanced long-doc analysis |
| ๐ก๏ธ Firebase AI Logic | โญโญโญ | You're already on Firebase | You need server-side prompt protection |
| ๐ WebMCP | โญโญ | You can afford to experiment | You need something that works today |
๐ก The One Thing That Changed How I Think
The skill file. Hands down. ๐
Before I/O 2026, my AI workflow was:
Open chat โ Paste context โ Get answer โ Copy result
Open chat โ Paste context โ Get answer โ Copy result
Open chat โ Paste context โ Get answer โ Copy result
...forever ๐ฉ
The skill file inverts that:
Define behavior once (SKILL.md) โ Agent executes autonomously โ Forever โพ๏ธ
That's not a feature improvement. That's a different programming model.
The accessibility reviewer I built is now skill #130 on my machine. It lives at:
~/.gemini/config/skills/soilsense-accessibility-reviewer/SKILL.md
Every future Antigravity session can invoke it. One prompt created it. No orchestration code.
๐ฌ The Gemini 3.5 Flash benchmarks will be obsolete in six months. A skill file that enforces your team's standards on every commit โ that compounds.
๐ฏ What Would You Build?
I'm curious what others are finding. Have you tested any of these tools on real projects? What worked? What broke? ๐ค
Especially interested in:
- ๐ง Anyone running Antigravity CLI on Linux (I tested on Windows)
- ๐ฅ Firebase AI Logic in production (not just demos)
- ๐ WebMCP implementations in the wild
Drop your experience below! ๐
The best I/O coverage comes from people who actually built things, not people who watched keynotes. ๐บโก๏ธ๐จ
Thanks for reading! If this helped you decide which I/O tools to try, drop a โค๏ธ and share your own experience in the comments.






Top comments (0)