ArtsAutosBooksBusinessEducationEntertainmentFamilyFashionFoodGamesGenderHealthHolidaysHomeHubPagesPersonal FinancePetsPoliticsReligionSportsTechnologyTravel

Why use an XML sitemap on your website?

Updated on December 2, 2012

Why Use An XML Site Map On Your Website

Every website are concerned that once search engines locate their website they locate every single one of the web pages on the website and add these pages to their index. It’s crucial for a website owner that the search engine picks up the Keywords and page description they have loaded into their page content.

It is the existence of these Keywords, within a search engine’s index that helps get the web page displayed within the search engine’s result page. This occurs whenever someone does a search using the same keyword(s) within their search string in a search engine’s - search box.

It’s every website owner’s responsibility to make it as simple as possible for search engine spiders to locate and index every single webpage on their website, whenever search engine spiders visit the website.

A tried and tested way of doing this is to include a Site Map within the website. Normally the site map is accessed via a menu item. Hence both site visitors and search engine spiders can locate the site map easily enough. So let’s take a look at what a site map is. How many types of site map there are? How is a site map created and attached to the website?

What is a site map?

A sitemap is a graphical ( visual ) presentation that lists all the different pages contained within a website.

How many types of site maps are there?

There are two types of sitemaps, HTML ( Hyper Text Markup Language – type ) and XML ( eXtensible Markup Language - type).

The HTML based site map

An HTML sitemap is a graphical presentation that displays hierarchically grouped, lists of hyperlinks belonging to all the pages of a website. It's primarily designed for humans.

By adding an HTML sitemap in your website, site visitors can easily navigate through the website. Additionally, a sitemap of this sort helps in - Sitemap based SEO - because it allows search engine spiders to easily find all the hyperlinks to every page on the website thus avoiding - missed pages.

Missed pages simply mean that these pages will not be in the search engine’s index. Therefore these pages never show up for those who use search engines ( like Google, Bing and Yahoo ) to locate appropriate pages for them to visit.

The XML based site map

An XML sitemap, is basically a list of the different URLs of a website but the list is created using very specific syntax ( i.e. XML ) that all search engines spiders are trained to understand with ease.

Using an XML site map for – Sitemap based SEO - accelerates search engine indexing because an XML site map informs search engine spiders about the different URLs of the website in a language they understand.

Most search engine spiders are trained to look for and identify if an XML based site map exists on a website.  If it does its accessed and used immediately by the spider. 

A search engine’s spider does not have to visit each page of the website and navigate through the links on the page to understand the website’s architecture.  The spider only has to locate XML based site map to see and understand the entire website architecture.

Hence a website that uses an XML based site map for – Sitemap based SEO – would be indexed faster and more accurately and would perhaps rank well in search engines because the website has been thoroughly checked.

 The Importance Of Registering Your Sitemap with Google

Google actually assists webmasters in multiple ways. In the past, - Sitemap based SEO - was more of a guessing game.   Then Google launched a program called Google Analytics. This program provides valuable data about your site. It reports a ton of useful information including:

  • The last time the Googlebot paid a visit to your website
  • The keywords ( keyphrases ) people use in searches to locate your site
  • Problems in your website that need fixing

And a whole lot more.

If you have not created a Google analytics account for your website you really are missing on a whole lot of very important information about your website offered freely by the biggest search engine in use on the Internet.

One of the of Google Analytics data entry forms permits a website owner ( webmaster ) to enter the URL that points to their website XML site map.  This means that all webmasters have the power to inform Google exactly where their XML site map is to be found.  This goes a really long way in ensuring that your website is found and indexed by Google’s spiders.

What’s really nice about the using an XML based site map on your website is that search engine spiders other than Google also access and use the same XML site map.  There are a lot of tools available on the Internet that create will create an XML based site map for a website – for Free.  Do a search in Google and you will be pleasantly surprised at how many there are. 

All of these tools essentially do the same thing. Once you’ve given them a website URL they thoroughly scan the website and deliver a properly formatted XML file, normally named sitemap.xml as the output of this exercise.  This file must be saved to your local computer. Once done, use FTP and place sitemap.xml in the root directory of your website, which is almost always/public_html.

If you are constantly making changes to your website, i.e. adding pages or deleting old pages, then after every such change you need to go back to the XLM based sitemap creation tools and re-create the file sitemap.xml and overwrite the old sitemap.xml file in your website’s root directory. This is because the website architecture has changed and hence the contents of the current sitemap.xml are obsolete. 

Using an obsolete sitemap.xml file on a website hurts the website’s ranking in search engines. This is because one or more of the URLs contained within sitemap.xml could point to a nonexistent resource on the website.  Hence, it’s pretty important to update sitemap.xml immediately after making any structural change in the website.  Not to do so is unwise.

According to Google, XML site map optimization is specifically helpful for the following reasons:

Your website contains dynamic content

Your website is new and doesn't have links that point to it. Since the spiders look at the inbound and outbound links during the crawling process, your site may not be scanned if it hardly has links that lead to it

If your website has pages that Googlebot cannot easily discover, e.g. pages with Rich AJAX or images.

Your site contains a huge archive of webpages that are not well interlinked to each other, or not linked at all

Do take a look at diagram 1 and diagram 2 below.

Ivan Bayross
Open source tutorials | open source training

Diagram 1: Create an XML sitemap using Google
Diagram 1: Create an XML sitemap using Google
Diagram 2. Submit a XML sitemap to Google
Diagram 2. Submit a XML sitemap to Google
working

This website uses cookies

As a user in the EEA, your approval is needed on a few things. To provide a better website experience, hubpages.com uses cookies (and other similar technologies) and may collect, process, and share personal data. Please choose which areas of our service you consent to our doing so.

For more information on managing or withdrawing consents and how we handle data, visit our Privacy Policy at: https://corp.maven.io/privacy-policy

Show Details
Necessary
HubPages Device IDThis is used to identify particular browsers or devices when the access the service, and is used for security reasons.
LoginThis is necessary to sign in to the HubPages Service.
Google RecaptchaThis is used to prevent bots and spam. (Privacy Policy)
AkismetThis is used to detect comment spam. (Privacy Policy)
HubPages Google AnalyticsThis is used to provide data on traffic to our website, all personally identifyable data is anonymized. (Privacy Policy)
HubPages Traffic PixelThis is used to collect data on traffic to articles and other pages on our site. Unless you are signed in to a HubPages account, all personally identifiable information is anonymized.
Amazon Web ServicesThis is a cloud services platform that we used to host our service. (Privacy Policy)
CloudflareThis is a cloud CDN service that we use to efficiently deliver files required for our service to operate such as javascript, cascading style sheets, images, and videos. (Privacy Policy)
Google Hosted LibrariesJavascript software libraries such as jQuery are loaded at endpoints on the googleapis.com or gstatic.com domains, for performance and efficiency reasons. (Privacy Policy)
Features
Google Custom SearchThis is feature allows you to search the site. (Privacy Policy)
Google MapsSome articles have Google Maps embedded in them. (Privacy Policy)
Google ChartsThis is used to display charts and graphs on articles and the author center. (Privacy Policy)
Google AdSense Host APIThis service allows you to sign up for or associate a Google AdSense account with HubPages, so that you can earn money from ads on your articles. No data is shared unless you engage with this feature. (Privacy Policy)
Google YouTubeSome articles have YouTube videos embedded in them. (Privacy Policy)
VimeoSome articles have Vimeo videos embedded in them. (Privacy Policy)
PaypalThis is used for a registered author who enrolls in the HubPages Earnings program and requests to be paid via PayPal. No data is shared with Paypal unless you engage with this feature. (Privacy Policy)
Facebook LoginYou can use this to streamline signing up for, or signing in to your Hubpages account. No data is shared with Facebook unless you engage with this feature. (Privacy Policy)
MavenThis supports the Maven widget and search functionality. (Privacy Policy)
Marketing
Google AdSenseThis is an ad network. (Privacy Policy)
Google DoubleClickGoogle provides ad serving technology and runs an ad network. (Privacy Policy)
Index ExchangeThis is an ad network. (Privacy Policy)
SovrnThis is an ad network. (Privacy Policy)
Facebook AdsThis is an ad network. (Privacy Policy)
Amazon Unified Ad MarketplaceThis is an ad network. (Privacy Policy)
AppNexusThis is an ad network. (Privacy Policy)
OpenxThis is an ad network. (Privacy Policy)
Rubicon ProjectThis is an ad network. (Privacy Policy)
TripleLiftThis is an ad network. (Privacy Policy)
Say MediaWe partner with Say Media to deliver ad campaigns on our sites. (Privacy Policy)
Remarketing PixelsWe may use remarketing pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to advertise the HubPages Service to people that have visited our sites.
Conversion Tracking PixelsWe may use conversion tracking pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to identify when an advertisement has successfully resulted in the desired action, such as signing up for the HubPages Service or publishing an article on the HubPages Service.
Statistics
Author Google AnalyticsThis is used to provide traffic data and reports to the authors of articles on the HubPages Service. (Privacy Policy)
ComscoreComScore is a media measurement and analytics company providing marketing data and analytics to enterprises, media and advertising agencies, and publishers. Non-consent will result in ComScore only processing obfuscated personal data. (Privacy Policy)
Amazon Tracking PixelSome articles display amazon products as part of the Amazon Affiliate program, this pixel provides traffic statistics for those products (Privacy Policy)
ClickscoThis is a data management platform studying reader behavior (Privacy Policy)