Artwork for podcast SEO F**king What - Get Found on Google and make money from your website with practical SEO tips
Why Your AI Website Looks Great But Fails at SEO (AI Sites Series Pt 3)
Episode 287th June 2026 • SEO F**king What - Get Found on Google and make money from your website with practical SEO tips • Nikki Pilkington
00:00:00 00:17:19

Share Episode

Shownotes

Hi, I’m Nikki Pilkington. My site is https://nikki-pilkington.com/ and in this episode of “SEO F**king What” (the third and final part of the AI Website series) I pull back the curtain on why so many AI-built websites may look perfect on the surface yet fail spectacularly when it comes to SEO.

Drawing on 30 years of search experience and a recent audit of five B2B AI-generated sites, I explore the hidden problems lurking in headings, meta titles, robots.txt, sitemaps, schema, site performance, and trust signals—issues that even non-AI sites can suffer from if you don’t know to look!

Whether you built your site with AI or not, this episode is packed with practical advice for anyone wanting to get found and make money online. Listen in to find out what’s really going wrong beneath the gloss and how you can fix it.

I cover these things, and more:

  • Issues with homepage H1s
  • Website SEO and sitemap issues
  • Common schema issues on websites
  • Identifying site publication issues
  • Importance of a human SEO audit

Follow Nikki:

LinkedIn: https://www.linkedin.com/in/nikkipilkington/

Instagram: https://www.instagram.com/nikkipilkington/

Bluesky: https://bsky.app/profile/nikkipilkington.bsky.social

Transcripts

Speaker:

Your AI built website looks perfect, professional,

Speaker:

fast, modern. Underneath all of that is

Speaker:

sending Google and other search engines completely the wrong

Speaker:

signals. Let me show you exactly where it's going wrong.

Speaker:

This is SEO Fucking What? I'm Nicky, 30 years in

Speaker:

SEO and I've seen a lot of things that look fine on the surface

Speaker:

and are an absolute fucking mess underneath.

Speaker:

I help small business owners and marketing directors get found on

Speaker:

Google and other search engines and make money from their websites.

Speaker:

Today is part three of our AI website series

Speaker:

and this is the one where we get into the detail of what really

Speaker:

goes wrong. I audited 5 AI built

Speaker:

B2B websites in April 2026. Same

Speaker:

methodology across all 5. And the problems I found

Speaker:

consistent enough and specific enough that I think you

Speaker:

need to hear them whether you used AI to build your site or not.

Speaker:

Because some of this stuff, it's not just an AI problem, it's

Speaker:

a. Nobody told me to check this problem. So let's get into

Speaker:

it.

Speaker:

First of all, the H1. The H1 in most

Speaker:

of the sites was doing absolutely nothing for

Speaker:

SEO. Every single site I audited,

Speaker:

five out of five, not one exception.

Speaker:

Your H1 is the single most important on

Speaker:

page signal search engines use. To understand what

Speaker:

a page is about, it should tell them what the page

Speaker:

covers and who it's for. On every AI built

Speaker:

site I looked at the homepage H1 was a

Speaker:

brand statement or a piece of conversion copy, a

Speaker:

strong line, a clever hook, something that sounds great when you

Speaker:

read it out loud, but means absolutely fuck all to a

Speaker:

search engine. And yes, that includes AI

Speaker:

search. There was a leadership consultant whose

Speaker:

H1 talked about transformation, a SaaS product with

Speaker:

a witty one liner about why their competitors customers were losing

Speaker:

deals. A freelancer whose H1 was a

Speaker:

provocation designed to resonate emotionally with their

Speaker:

target client. One site where the H1 was just

Speaker:

a brand name, nothing else. And these are all

Speaker:

reasonable creative choices. Some of them work

Speaker:

really well as copy. None of them are

Speaker:

search signals. The thing that makes this particularly

Speaker:

annoying is that AI accepts whatever you give it

Speaker:

without questioning whether it works for search. You

Speaker:

bring Your brilliant tagline, AI puts it in the

Speaker:

H1 and nobody flags that. It's invisible to search.

Speaker:

A human SEO would push back. AI

Speaker:

just builds, go and check yours right now.

Speaker:

It's usually the line at the top of the page, the title of

Speaker:

the page. If you're not sure, right click your homepage

Speaker:

view Source search for H1. Is it something a

Speaker:

potential customer would type into Google? If it's

Speaker:

your slogan or tagline or clever words or something

Speaker:

you thought of at 3 o' clock in the morning that you're particularly proud of,

Speaker:

it isn't going to help you in search.

Speaker:

And then the heading structures underneath the H1s were broken

Speaker:

in four out of five sites. The problem isn't just what the

Speaker:

headings say, it's what heading levels have been used and why

Speaker:

the H2s jump into H4s. H4s appear

Speaker:

in navigation menus before the H1 has even appeared in the

Speaker:

document. H5s used for pricing tiers with no

Speaker:

logical parent heading above them. Footer links coded as

Speaker:

H3 tags. This happens because AI picks

Speaker:

heading levels based on how things look rather than what they

Speaker:

mean. It sees something that should look like a subheading,

Speaker:

gives it a smaller font size, assigns it in H4 and

Speaker:

moves on. Search engines don't read the visual

Speaker:

output, they read the code. And when the code is saying

Speaker:

that your navigation label is more important than your page content,

Speaker:

that's a problem.

Speaker:

And then we had meta titles, where meta

Speaker:

title tags were used as a branding exercise, not a search

Speaker:

signal. In four out of five sites,

Speaker:

the meta title tag is what appears in Google search results as

Speaker:

the clickable headline for your page. It's often the first thing

Speaker:

a potential customer sees before they've visited your site.

Speaker:

On four of these sites, it led with the brand name and followed it with

Speaker:

a tagline that wouldn't match any realistic search

Speaker:

query. Your meta title, sometimes called the

Speaker:

SEO title, should tell someone scanning search results

Speaker:

exactly what the page is and whether it's relevant to what they

Speaker:

searched for. Getting it wrong doesn't just affect

Speaker:

rankings, it affects whether someone who does find you

Speaker:

decides to click. Nobody configured the

Speaker:

robots text properly. Four out of five sites,

Speaker:

two had no Roblox text at all. Two had one that was

Speaker:

actively causing problems. One site had a

Speaker:

conflict between Cloudflare managed directives blocking AI

Speaker:

crawlers and manual rules trying to allow them.

Speaker:

Except the Cloudflare section took precedence. So the

Speaker:

manual rules were being overridden before they could take effect.

Speaker:

The owner thought they fixed it. They hadn't. The site was

Speaker:

still blocking ChatGPT, Flexity and others,

Speaker:

and nobody knew. Another site's robot text

Speaker:

had directives that search engines don't recognize as valid, which was

Speaker:

flagging the file as malformed and dragging down the site's technical

Speaker:

SEO score. One site that got this right

Speaker:

went out of its way to explicitly permit AI crawlers,

Speaker:

which was a deliberate and sensible choice given what the product does.

Speaker:

That level of intentionality requires a human

Speaker:

we then looked at the sitemap and the sitemap was either missing

Speaker:

or full of pages that shouldn't be there. Two had

Speaker:

no sitemap at all. On one site the sitemap was at a non

Speaker:

standard URL that crawlers wouldn't find automatically,

Speaker:

and the robot's text was pointing to a different sitemap URL

Speaker:

that returned a 404. On another, every

Speaker:

URL in the sitemap was showing a last modified date

Speaker:

from years earlier, regardless of when the page was actually

Speaker:

updated. And here's the one that comes up constantly on non

Speaker:

AI sites too. Category and tag pages indexed

Speaker:

and included in the sitemap. These are auto

Speaker:

generated index pages with no unique content.

Speaker:

They exist to organize posts in the content management system,

Speaker:

not to rank for anything. Having them in Google's index

Speaker:

creates unnecessary duplication and dilutes the crawl

Speaker:

budget available for pages that matter. Your

Speaker:

sitemap should be a curated list of the pages you want Google

Speaker:

to find on most of these sites. It was a full

Speaker:

inventory of everything the CMS had ever generated.

Speaker:

The schema was broken or missing entirely on

Speaker:

every site. 5 out of 5, and this one has some

Speaker:

genuinely spectacular individual failures.

Speaker:

One site had no structured data anywhere, not a single

Speaker:

schema block across the entire site. One

Speaker:

site had article schema on every blog post, which is right,

Speaker:

except the schema had been written incorrectly.

Speaker:

One site had article schema on every blog post, which is the right

Speaker:

idea, except it had been written incorrectly.

Speaker:

Google's rich results test confirmed it couldn't pass any of

Speaker:

it. Every blog post on that site is ineligible for article

Speaker:

rich results, not because the schema is missing, but because the

Speaker:

implementation is broken in a way you'd never spot just by looking at

Speaker:

the site. One site had software application

Speaker:

schema on its homepage with the price field set to zero. The

Speaker:

product isn't free, it has a monthly subscription. The schema

Speaker:

was telling search engines the product costs nothing. Another

Speaker:

site had article schema applied as a global template to

Speaker:

every page, including the pricing page. So search

Speaker:

engines were being told a page designed to convert paying customers was

Speaker:

a piece of editorial content. The pattern

Speaker:

is consistent. AI treats schema as something

Speaker:

to attempt rather than something to get right. The

Speaker:

intent is sometimes there. The understanding isn't.

Speaker:

In all the sites I looked at, mobile performance was

Speaker:

significantly worse than desktop. Desktop

Speaker:

scores were generally okay. Mobile scores were a different story

Speaker:

story render block in CSS third party scripts

Speaker:

loading before they're needed. Images search at far larger

Speaker:

dimensions than they're displayed at. One site that

Speaker:

particularly stuck with me had a single waitlist signup

Speaker:

form embedded on the homepage. That form was

Speaker:

loading a full React application, a CSS framework,

Speaker:

and a recaptcha integration on every page load

Speaker:

whether or not the visitor ever scrolled to it. Moving that form

Speaker:

to its own page or loading it lazily so it only fires when someone

Speaker:

reaches it would fix the mobile score almost

Speaker:

entirely. AI embedded the form. It

Speaker:

didn't consider what the form was bringing with it.

Speaker:

And then we have trust signals. Trust signals were present

Speaker:

in the structure and only every site.

Speaker:

Generic author bylines with no biography or credentials.

Speaker:

Testimonials with no names, no company names, nothing

Speaker:

verifiable Author avatars using the brand logo

Speaker:

instead of a photo of an actual person. Privacy

Speaker:

policies hosted on Google Docs Blog posts with no

Speaker:

author information beyond a name and a date. Google's

Speaker:

EEAT framework places increasing weight on being

Speaker:

able to verify who created content and whether that person

Speaker:

has genuine credentials. AI builds the

Speaker:

container for trust signals, but it doesn't fill them with

Speaker:

anything a skeptical prospect or Google could actually

Speaker:

verify, because that's a human job.

Speaker:

And on every one of these sites, nobody had done it.

Speaker:

And then there were the one off findings. These are the

Speaker:

ones that made me stop and reread my notes. One site had

Speaker:

been built using an AI tool that generated three complete

Speaker:

design directions for the client to choose from, which is a

Speaker:

sensible idea. The problem is that when the chosen design

Speaker:

went live, all three layouts went with it. Only one was

Speaker:

visible to visitors. All three were in the HTML. So what

Speaker:

looked to a visitor like a clean single page site looked to a search

Speaker:

engine like a page with three identical H1s and the same

Speaker:

content repeated three times on the same URL.

Speaker:

Another site had both the WWW and the non

Speaker:

www versions of every page live, live and

Speaker:

accessible, with no redirect between them, no canonical

Speaker:

tags, no idea which one was authoritative.

Speaker:

That single structural issue was responsible for for the majority

Speaker:

of the error count in the crawl. What looked like dozens of

Speaker:

separate problems was one problem multiplying across

Speaker:

every URL on the site. Another site

Speaker:

had published over 100 blog posts, and the vast majority of them

Speaker:

carried the same publication date. Dozens of posts on

Speaker:

completely different topics, all dated the same day, all

Speaker:

live, simultaneously. Search engines noticed that

Speaker:

the site that drops a huge volume of content in one batch

Speaker:

looks very different to a site that builds a content library

Speaker:

steadily over time, particularly when the site is

Speaker:

relatively new and hasn't yet established any authority.

Speaker:

So that's what's going wrong? And I want to be clear. None of these

Speaker:

things are things you'd spot just by looking at your site. They're

Speaker:

invisible unless you know where to look, which is exactly

Speaker:

the fucking problem. In a moment, I'll give you the

Speaker:

practical steps to start checking your own site, whether it was AI built or

Speaker:

not. First, a quick break.

Speaker:

Here's what I want you to do. First of all, check your

Speaker:

H1. View your source on your homepage, search for H1 and ask

Speaker:

yourself honestly whether what's in there is something a potential customer would

Speaker:

type into Google. If it's your tagline or something a bit

Speaker:

vague, fix it. I'm sorry. I'm sorry you're proud

Speaker:

of that tagline? I'm sorry it's the best line you've ever written. I'm

Speaker:

sorry you want everyone to see it. Even if Google ranks you for

Speaker:

it, no one's searching for it. Get rid of it. Put it somewhere

Speaker:

else. Test your schema. Go to Google's rich results

Speaker:

test, put in your URLs and see what comes back. If there are

Speaker:

errors, they need fixing. If there's nothing there at all,

Speaker:

schema is worth adding. Check your canonical tags.

Speaker:

Use the SEO metaring one click browser extension. It'll show

Speaker:

you those in seconds. The canonical tag on your homepage should point

Speaker:

to the exact URL you want Google to treat as authoritative,

Speaker:

including whether it has www or not, whether it has

Speaker:

a trailing slash. If it's pointing somewhere else or is missing

Speaker:

that needs sorting. Check your Cloudflare settings.

Speaker:

If your site's on Cloudflare and was set up after July 25th,

Speaker:

you may be blocking AI crawlers by default without knowing it.

Speaker:

Go into the Cloudflare dashboard and check. Whether you want them

Speaker:

blocked or not is your call, but it should be a decision you made,

Speaker:

not a default you inherited. And then, I'm

Speaker:

sorry, but get someone like me to look at your site properly. That's

Speaker:

not because AI has done a fucking terrible job.

Speaker:

Probably hasn't in lots of ways, but the things

Speaker:

AI misses are specifically the things that are

Speaker:

invisible without knowing where to look. A

Speaker:

proper SEO audit will tell you what needs attention and in what

Speaker:

order. AI built sites are getting better.

Speaker:

The design output is already strong, the

Speaker:

structural foundations are broadly solid, and some of the commercial

Speaker:

content I saw was genuinely impressive.

Speaker:

The gap is in the layer between the site works

Speaker:

and the site is properly set up for search. And right

Speaker:

now, closing that gap needs a human.

Speaker:

So that's a wrap on the AI website series. I've always

Speaker:

wanted to say. That's a wrap. If you've been listening to all three

Speaker:

parts and you're now looking at your website slightly differently.

Speaker:

Good. That was the point. Follow SEO

Speaker:

fucking what? Wherever you're listening so you don't miss what's coming next.

Speaker:

And if you want me to look at your site and tell you exactly what's

Speaker:

going on underneath the surface, find me in the show notes.

Speaker:

Until next time. Get found. Make money.

Speaker:

Stop assuming your website is fine just because it looks

Speaker:

fine.

Links

Chapters

Video

More from YouTube