Hello All,
Firstly this is a really great site (so who ever created it, it’s something to be proud of), I have just built my first site and the whole Sitemap topic can scare the living nunchaku’s out of you, but with a little patience and reading the stuff on here it’s been a real education, so that said (a heart felt) ‘Well Done’.
The questions:
Q: Is there a document/help file that explains what each part of the XML Sitemap Tool does, especially for someone who doesn’t know the references along the top? I’d like to understand more about things like the ‘In-l & Out-l’. Well all of it really.
Can I be cheeky and ask you to have a look at www.executivecoachingsolutions.co.uk on the xml tool.
Q: Why do Nr.1 and Nr.17 have the same title?
Q: Why is the level for Nr.1=0 and Nr.17=1? If they have the same title.
Q: On the site itself, pages such as ‘What we do, About Us, Modus Operadi, Life Long Learning, Terms and Conditions’ all sit as a menu off the Home page, so why are they all level 1?
Q: The ‘links page’ which sits as a link off the TandC page is a level 2 and so are JPGS, which appear on a Level 1 page?
The big question!
Q:Is there anything about the xml that would indicate the site can’t be seen by a search engine. I have a 404 (Not Found) dated April 1st for the index page, but have uploaded the xml sitemap since then?
Gosh, that’s alot. But I hope the answers (if there are any) helps others as well.

Thanks for the kudos! Flattery will get you everywhere
There is documentation on AuditMyPC.com’s sitemap page. In and out-l are inbound and outbound links (these can really help you find problems!)
I looked at your site and yes, you can be cheeky
1 and 17 indicate a SEO mistake and you are going to want to fix that. The sitemap generator, including search engines and humans, should not see the index.html file. Without going into great detail, the website executivecoachingsolutions has the exact same content that the index page has, correct
Although Google may not this as duplicate content, others may.
Cause: somewhere in your code you are calling the index.html
Solution: find that call to index.html and put the full address or your website or just ‘/’
Fix the problem and then run the sitemap and see if that answers your question(s).
And no, I didn’t find any problems – all looks good on my cursory review. No robots.txt and server headers report
HTTP Status Code: HTTP/1.1 200 OK
Date: Fri, 18 Apr 2008 16:06:29 GMT
Server: Apache/2.0.52 (CentOS)
Last-Modified: Fri, 18 Apr 2008 16:05:40 GMT
ETag: "3a58416-73ca-ebe05d00"
Accept-Ranges: bytes
Content-Length: 29642
Connection: close
Content-Type: text/html; charset=UTF-8
All ok
Best regards,
Jim.
Hi, thanks for the reply.
I’m trying to think this through, but am struggling.
I don’t quite understand your Solution.
When you say find that call to index.html what does that mean?
Regards,
Guy
That means you have in your code a reference to index.html and you should have a reference to the domain name (website name) instead.
I’m assuming that when someone visits your site for the first time (main page) , index.html is referenced.
Best regards,
Jim.