ArtsAutosBooksBusinessEducationEntertainmentFamilyFashionFoodGamesGenderHealthHolidaysHomeHubPagesPersonal FinancePetsPoliticsReligionSportsTechnologyTravel

Generating a Good Sitemap

Updated on February 6, 2012
Sitemap and Search engines
Sitemap and Search engines

If the internet was a three-dimensional object, it would be a massive entity. Because, even though the world of “0’s and 1’s ” cannot be seen, or even imagined by most people - it is a really wide e-space. But to get an idea of how big it is all we need to do is consider a few numbers1:

- As of May 2011, there were approximately 39 billion web pages online.

- As of May 2011, Google had indexed around 36.5 billion web pages. Bing had indexed 11.5 billion web pages and Yahoo! had 10.6 billion pages in its search index.

Map Needed

With all these pages floating around on the internet it would be almost impossible to find a single page without the help of search engines. These engines are on a constant crawl over the internet discovering and indexing each and every web page created. Any webmaster looking to have his or her site surface on search results would have to make sure that all the help is given to the web crawlers. One way of making sure that a website is properly crawled, and hence indexed, is by submitting a Sitemap.

What is it?

A Sitemap, spelt with a capital ‘S’, is an XML file that lists the pages in a website2. It is created with the intention of serving as a map for search engine crawlers. This is especially good if the site has dynamic content that changes frequently. Also, if the site contains pages that are not linked to one another, search engine crawlers might not find and index them, because search engines find pages by following link.

How to create it

To understand how to create a Sitemap, it is necessary to understand the structure and commands it contains. As mentioned earlier, a Sitemap is an XML file; therefore the first line will read:

<?xml version="1.0" encoding="UTF-8"?>

Next, the URL of the website needs to be included:

<urlset xmlns="">

Now the individual pages can be added, but with the pages, more information can be submitted to the search engine crawlers. These are:

· The exact location of the page: <loc></loc>

· The day it was last modified: <lastmod> 2009-09-22</lastmod>

· The frequency of update in the page: <changefreq>monthly</changefreq> Here it should be noted that the frequency can be set to ‘always’, ‘hourly SDHG’, ‘daily’, ‘weekly’, ‘monthly’ or ‘never’.

· The importance of the page with respect to other pages: <priority>0.8</priority> the range here is between 0 and 1 (0 = ‘Low’, 1 = ‘High’ importance) with 0.5 being the default.

Putting it all together we would get:

<?xml version="1.0" encoding="UTF-8"?>

<urlset xmlns="">


<loc> </loc>

<lastmod> 2011-01-21</lastmod>





<loc> </loc>

<lastmod> 2011-02-23</lastmod>








Once all the pages have been added as <url> </url> the file is ended with a closing </urlset> and that becomes a neatly created Sitemap.

After saving it as an XML file, all that remains to be done is to upload it to the root folder of the website and then submit the URL of it to all the search engines.

1 – World Wide Web Size:

2 – Google Webmaster Central:


    0 of 8192 characters used
    Post Comment

    No comments yet.


    This website uses cookies

    As a user in the EEA, your approval is needed on a few things. To provide a better website experience, uses cookies (and other similar technologies) and may collect, process, and share personal data. Please choose which areas of our service you consent to our doing so.

    For more information on managing or withdrawing consents and how we handle data, visit our Privacy Policy at:

    Show Details
    HubPages Device IDThis is used to identify particular browsers or devices when the access the service, and is used for security reasons.
    LoginThis is necessary to sign in to the HubPages Service.
    Google RecaptchaThis is used to prevent bots and spam. (Privacy Policy)
    AkismetThis is used to detect comment spam. (Privacy Policy)
    HubPages Google AnalyticsThis is used to provide data on traffic to our website, all personally identifyable data is anonymized. (Privacy Policy)
    HubPages Traffic PixelThis is used to collect data on traffic to articles and other pages on our site. Unless you are signed in to a HubPages account, all personally identifiable information is anonymized.
    Amazon Web ServicesThis is a cloud services platform that we used to host our service. (Privacy Policy)
    CloudflareThis is a cloud CDN service that we use to efficiently deliver files required for our service to operate such as javascript, cascading style sheets, images, and videos. (Privacy Policy)
    Google Hosted LibrariesJavascript software libraries such as jQuery are loaded at endpoints on the or domains, for performance and efficiency reasons. (Privacy Policy)
    Google Custom SearchThis is feature allows you to search the site. (Privacy Policy)
    Google MapsSome articles have Google Maps embedded in them. (Privacy Policy)
    Google ChartsThis is used to display charts and graphs on articles and the author center. (Privacy Policy)
    Google AdSense Host APIThis service allows you to sign up for or associate a Google AdSense account with HubPages, so that you can earn money from ads on your articles. No data is shared unless you engage with this feature. (Privacy Policy)
    Google YouTubeSome articles have YouTube videos embedded in them. (Privacy Policy)
    VimeoSome articles have Vimeo videos embedded in them. (Privacy Policy)
    PaypalThis is used for a registered author who enrolls in the HubPages Earnings program and requests to be paid via PayPal. No data is shared with Paypal unless you engage with this feature. (Privacy Policy)
    Facebook LoginYou can use this to streamline signing up for, or signing in to your Hubpages account. No data is shared with Facebook unless you engage with this feature. (Privacy Policy)
    MavenThis supports the Maven widget and search functionality. (Privacy Policy)
    Google AdSenseThis is an ad network. (Privacy Policy)
    Google DoubleClickGoogle provides ad serving technology and runs an ad network. (Privacy Policy)
    Index ExchangeThis is an ad network. (Privacy Policy)
    SovrnThis is an ad network. (Privacy Policy)
    Facebook AdsThis is an ad network. (Privacy Policy)
    Amazon Unified Ad MarketplaceThis is an ad network. (Privacy Policy)
    AppNexusThis is an ad network. (Privacy Policy)
    OpenxThis is an ad network. (Privacy Policy)
    Rubicon ProjectThis is an ad network. (Privacy Policy)
    TripleLiftThis is an ad network. (Privacy Policy)
    Say MediaWe partner with Say Media to deliver ad campaigns on our sites. (Privacy Policy)
    Remarketing PixelsWe may use remarketing pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to advertise the HubPages Service to people that have visited our sites.
    Conversion Tracking PixelsWe may use conversion tracking pixels from advertising networks such as Google AdWords, Bing Ads, and Facebook in order to identify when an advertisement has successfully resulted in the desired action, such as signing up for the HubPages Service or publishing an article on the HubPages Service.
    Author Google AnalyticsThis is used to provide traffic data and reports to the authors of articles on the HubPages Service. (Privacy Policy)
    ComscoreComScore is a media measurement and analytics company providing marketing data and analytics to enterprises, media and advertising agencies, and publishers. Non-consent will result in ComScore only processing obfuscated personal data. (Privacy Policy)
    Amazon Tracking PixelSome articles display amazon products as part of the Amazon Affiliate program, this pixel provides traffic statistics for those products (Privacy Policy)