Some questions/feature requests about URLs

Hello, thanks for doing this Google Sitemaps thing. It’s really quite handy.

I have some questions, and if they can’t be answered, some feature requests.

I have got a very large site, and it’s hard to go over it with a fine tooth comb, in order to weed out mistakes and duplicates or add in missing entries.

1) URLs shouldn’t be case sensitive. You say that AuditMyPC.com is the same as auditmypc.com, but folders are still case-sensitive. For instance, example.org/foo/ is not the same as example.org/FOO/

2) Is there a way to weed out duplicate index files? For example, example.org/bar/ and example.org/bar/index.cfm return two separate results, even though they are the same file.

3) Can I include no-www URLs along with www urls? For example, on my site, we are moving from http://www.example.org to http://example.org. If I enter the first URL, it doesn’t follow any no-www links. If I enter the latter, it doesn’t follow any www links.

4) I don’t want to follow any https links. Is this possible?

5) Can you give some examples as to what results I can expect to see (and not see) when I load a site from Anywhere, Server, and Directory? The documentation here is a little vague. What I mainly know is that "Anywhere" will add external links to my sitemap, and that’s about it.

6) Is there a way to retry broken links?

Again, thanks for the wonderful applet. Hopefully some of my questions can be answered.

Comments

  1. AMPC says:

    Hi Ragdoll,

    What version of the sitemap generator are you using? The older one, version 1.3 is not the one you want to use. Version 2.32 is the latest public release.

    There is also another version that has just been completed which has a TON of new features and not only builds sitemaps, but helps you find errors on your site, server headers and much more.

    PM me if you want the link to the new webmaster tool. Fair warning, I have not written up any docs on this yet, but it’s not too difficult to understand.

    Thanks!

    Jim.

Speak Your Mind

Comment moderation is enabled. Your comment may take some time to appear.