Newbie questions – Thankyou-Help Doc?/Question

Hello All,

Firstly this is a really great site (so who ever created it, it’s something to be proud of), I have just built my first site and the whole Sitemap topic can scare the living nunchaku’s out of you, but with a little patience and reading the stuff on here it’s been a real education, so that said (a heart felt) ‘Well Done’.

The questions:

Q: Is there a document/help file that explains what each part of the XML Sitemap Tool does, especially for someone who doesn’t know the references along the top? I’d like to understand more about things like the ‘In-l & Out-l’. Well all of it really.

Can I be cheeky and ask you to have a look at www.executivecoachingsolutions.co.uk on the xml tool.

Q: Why do Nr.1 and Nr.17 have the same title?

Q: Why is the level for Nr.1=0 and Nr.17=1? If they have the same title.

Q: On the site itself, pages such as ‘What we do, About Us, Modus Operadi, Life Long Learning, Terms and Conditions’ all sit as a menu off the Home page, so why are they all level 1?

Q: The ‘links page’ which sits as a link off the TandC page is a level 2 and so are JPGS, which appear on a Level 1 page?

The big question!

Q:Is there anything about the xml that would indicate the site can’t be seen by a search engine. I have a 404 (Not Found) dated April 1st for the index page, but have uploaded the xml sitemap since then?

Gosh, that’s alot. But I hope the answers (if there are any) helps others as well.

Comments

  1. AMPC says:

    Thanks for the kudos! Flattery will get you everywhere

    There is documentation on AuditMyPC.com’s sitemap page. In and out-l are inbound and outbound links (these can really help you find problems!)

    I looked at your site and yes, you can be cheeky

    1 and 17 indicate a SEO mistake and you are going to want to fix that. The sitemap generator, including search engines and humans, should not see the index.html file. Without going into great detail, the website executivecoachingsolutions has the exact same content that the index page has, correct Although Google may not this as duplicate content, others may.

    Cause: somewhere in your code you are calling the index.html
    Solution: find that call to index.html and put the full address or your website or just ‘/’

    Fix the problem and then run the sitemap and see if that answers your question(s).

    And no, I didn’t find any problems – all looks good on my cursory review. No robots.txt and server headers report
    HTTP Status Code: HTTP/1.1 200 OK
    Date: Fri, 18 Apr 2008 16:06:29 GMT
    Server: Apache/2.0.52 (CentOS)
    Last-Modified: Fri, 18 Apr 2008 16:05:40 GMT
    ETag: "3a58416-73ca-ebe05d00"
    Accept-Ranges: bytes
    Content-Length: 29642
    Connection: close
    Content-Type: text/html; charset=UTF-8

    All ok

    Best regards,

    Jim.

  2. themodeller says:

    Hi, thanks for the reply.

    I’m trying to think this through, but am struggling.

    I don’t quite understand your Solution.
    When you say find that call to index.html what does that mean?

    Regards,

    Guy

  3. AMPC says:

    That means you have in your code a reference to index.html and you should have a reference to the domain name (website name) instead.

    I’m assuming that when someone visits your site for the first time (main page) , index.html is referenced.

    Best regards,

    Jim.

Speak Your Mind