19.6 C
Washington
Monday, August 8, 2022
spot_img

Googlebot Crawls & Indexes First 15 MB HTML Content Via @sejournal, @BrianFr07823616

Date:

Share:

Google reveals its web crawler only uses first 15MB of a page’s HTML to determine rankings.

Googlebot Crawls & Indexes First 15 MB HTML Content

In an update to Googlebot’s help document, Google quietly announced it will crawl the first 15 MB of a webpage. Anything after this cutoff will not be included in rankings calculations.

Google specifies in the help document:

“Any resources referenced in the HTML such as images, videos, CSS and JavaScript are fetched separately. After the first 15 MB of the file, Googlebot stops crawling and only considers the first 15 MB of the file for indexing. The file size limit is applied on the uncompressed data.”

This left some in the SEO community wondering if this meant Googlebot would completely disregard text that fell below images at the cutoff in HTML files.

“It’s specific to the HTML file itself, like it’s written,” John Mueller, Google Search Advocate, clarified via Twitter. “Embedded resources/content pulled in with IMG tags is not a part of the HTML file.”

What This Means For SEO

To ensure it is weighted by Googlebot, important content must now be included near the top of webpages. This means code must be structured in a way that puts the SEO-relevant information with the first 15 MB in an HTML or supported text-based file.

It also means images and videos should be compressed not be encoded directly into the HTML, whenever possible.

SEO best practices currently recommend keeping HTML pages to 100 KB or less, so many sites will be unaffected by this change. Page size can be checked with a variety of tools, including Google Page Speed Insights.

In theory, it may sound worrisome that you could potentially have content on a page that doesn’t get used for indexing. In practice, however, 15MB is a considerably large amount of HTML.

As Google states, resources such as images and videos are fetched separately. Based on Google’s wording, it sounds like this 15MB cutoff applies to HTML only.

It would be difficult to go over that limit with HTML unless you were publishing entire books’ worth of text on a single page.

Should you have pages that exceed 15MB of HTML it’s likely you have underlying issues that need to be fixed anyway.


Source: Google Search Central

Featured Image: SNEHIT PHOTO/Shutterstock

Subscribe to SEJ

Get our daily newsletter from SEJ’s Founder Loren Baker about the latest news in the industry!

Ebook

FREE GIFT







more like this

Practical SEO Solutions To Your Website And Business

If you're working to make money online, increasing the traffic that visits your site is extremely important, because it effects simply how much you are making. Seo is a terrific way to create your website more accessible. Learn some good search engine optimisation methods in the article below.As you may build up your site, avoid…

Techniques For Becoming Successful At Email Promoting

Lots of business owners aim to attract customers while spending less than possible. A terrific way to try this would be to take full advantage of marketing with email. Should you wish to find out more about utilizing this tactic to enhance the success of your organization, read this article. The piece that follows will…

KIDS PARTY SUPPLIES ECOMMERCE WEBSITE BUSINESS WITH 1 YEARS HOSTING & DOMAIN

 Buy Now  Price: 3275 USDKIDS PARTY SUPPLIES ECOMMERCE WEBSITE BUSINESS WITH 1 YEARS HOSTING & DOMAINvar descGaugeInfo = {descST:(new Date()).getTime()}; window.onerror = function(message, url, line)...

Unlimited Website Visitors for from Popular Social Platforms!!

 Buy Now  Price: 1995 USDUnlimited Website Visitors for from Popular Social Platforms!!var descGaugeInfo = {descST:(new Date()).getTime()}; window.onerror = function(message, url, line) { return true; }; var ebayItemID='254621505425'; var...
spot_img

LEAVE A REPLY

Please enter your comment!
Please enter your name here