Google Sitemap Restrictions?

The Sitemap Generator appears to be a great tool, but I’m fairly new to maintaining a (large) website and would greatly appreciate some advise concerning sitemaps.

Specifically, does Google enforce their statement? -

"If you want to list more than 50,000 URLs,
you must create multiple Sitemap files."

They also state that the file must not be more than 10MB when uncompressed and suggest multiple sitemaps if needed and a sitemap index file.

This info isn’t prominently posted on Google. Has anybody had a sitemap rejected because of these requirements?

Does Yahoo have any restrictions similar to this?

Is there a tool for making a XML site index file to list in robots.txt to point to multiple sitemaps?

Terry

cant seem to generate a site map

I have set up a website at http://irvinescotland.co.uk using Dragonfly CMS and been trying to generate a site map using a variety of tools with a success rate of zero.

So far your tool is the only one that has even admitted that there is a problem, (promising start,) the error code is:

fatal error, cause: java.lang.InterruptedException:sleep interrupted

Have you any views on this, better still do you know what’s wrong?

Generator not generating all pages

I only have 32 pages on my site but this xml generator used to pick them all up when generating the xml / html files. However, now I’m seeing that it only picks up 8 or so.

I have used other sitemap generators and they pick up all 32 pages ok but they don’t have the option of generating html sitemap.

I do use js menus for getting to the other pages but I would think that since it used to pick up the hyperlinks for those it still should.

I need someone who knows how this generator works to help me please!

The website is www.mindquesthypnosis.com

Thanks in Advance!

tube site sitemap

Hi: I was already using this great tool for normal, static webpages. I’ve recently started www.nylonstockingstube.com and I’ve just tryed to make sitemap for it.

But, after several hours of scanning and finding more then 17000 pages (!?), I’ve decided to stop.

Is it possbile somehow to make some usefull sitemap for that site?

thank you for advice.

Error Forbidden

I’m new here and I just ran a sitemap on my site which seems to be working just fine. Except the site map generator indicates I have an error and I’m not not sure what it means It show me that the error is forbidden for the file "www.thenewbieaffiliate.com/.html and it the pages that are causing this error. I had a look at the source for these pages and find nothing that reference /.html. What could this be?

A big thank you for all your great work

I was browsing through the forum and found that 99% of all comments left here are complaints or questions about things that are not working (even though thez are explained on the actual web site)!

I therefore simply wanted to give you guys a big round of applause for your great work and the fact that you do offer all this for free! That’s amazing and I wish there were more pepole like you on this planet.

I’m using your sitemap generator for my own shopping site and was amazed how easy it all worked out!

Thanks again!

How do I automate sitemap generation?

Google’s version of the sitemap generator lives on the server and is Python based. You can set up a cron job to automate it so that the sitemap.xml always reflects all the changes and additions you make to your site without having to intervene physically.

Is there a way to do this with Audit My PC’s version? Is there a version of this generator that can be installed on our server and then automate it in the same fashion?

Thanks.

Slow Project Save/Open

Hi all,

I am using this tool in supporting my article site, which has more than 40,000 article URLs listed in the sitemap.

When I Save or Open the project it takes about 15 minutes to save the project, and more than an hour to open it. If I remove all the URL’s before saving the project, savie/open takes only a minute or so.

I have a 3Ghz Pentium4, 1Mg of RAM, and Windows XP Home SP2.

Is there anything I can do to speed up the Save/Open process, without removing all the URL’s?

Thanks for any advice!
Bob

No Sitemap generated

Hello an good morning.

I try the ROR Sitemap Generator 1.0

My Site URL: http://www.schweigler.org/

When i click on "Genrate Sitemap!"

I get the Message: "Crawling Please wait"

for about 2 Seconds and that all….

I don’t find a sitemap. Whats wrong?

Please advise.

ROR sitemap

Jim,

A while back I was on a website creating an ror.xml sitemap. I can’t remember what the site was. Can you give me some info on ror sitemaps. Do you think it can adversely affect my site.

Also, what does lastmod mean on your sitemaps. It’s been 9/21/07 since, I guess, 9/21/07. Some how I thought that date would be the date the xml sitemap was created. I created one today and got 9/21.

Thanks Jim

Roger

Accents problems

What a wonderful tool
I thank you so much for it.

Now I have a little issue. I applied it to my site http://easyviz.com and it came back with 10% of failed URL because of the accents.
It understood
"http://easyviz.com/Gafas-101/c3/p44/Garant?as/pages.html"
when the right URL is
"http://easyviz.com/Gafas-101/c3/p44/Garant%EDas/pages.html"

How can I solve this issue ???

Thx

All connectivity is lost after sitemap generator runs for a few seconds

Hi,
I’ve successfully used this tool back in March, thought it was great. Now it’s time to update our site and I seem to be running into some problems. First thing I’ve noticed is that when I do start the sitemap generator, after a few seconds of it running I lose all of my network connectivity. I’ve also noticed that it’s sometimes causing my browser (Firefox) to crash. Now I’ve already gone through the posts on this forum and have tried a few things, such as assigning Java more memory, but to no avail. I’ve also tried setting the no proxy options in Java, also to no avail. This is a corporate website and we are behind a firewall and are using proxy servers. Any suggestion would be greatly appreciated. By the way, nothing major has changed on our network that I believe would cause this problem.

Thanks.

All I Receive Is A Java Sign .. Doesn’t Load

Hi There
I’ve been using the Site map Generator for a while now with GREAT Success.
For some reason now when i try to activate the site map generator all i get is a loading Java sign. Wait Wait Wait .. .. and still only a loading Java Sign. ?
I’m working on a Mac Book Pro and have tried to load in Firefox and Safari with no luck. For some unknown reason site map generator doesn’t load.
Please any help/suggestions would be muchly appreciated.
Vinylrecordmac

Google Error

I keep getting the message from google when I try to add site map.

Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.

Not sure what to do

Any advice?

Thanks

Google Error

I keep getting the message from google when I try to add site map.

Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.

Not sure what to do

Any advice?

Thanks

Cannot see exported sitemap in Vista/IE7

I have run the sitemap generator, looks ok, export google xml file to my docs directory. but if i search for the file using any program , I cannot see it.
However if I start the generator and "open" the saved map file I can see it and load it.
I cannot gain access to copy the saved file.

I have tried using "open windows explorer as administrator" to no effect.

I am running Vista and IE 7

New B Question

Is there a way to set the crawl frequency and priority with out going row by row or is there a default setting?

Also my first run had errors due to images however the second time around filtering out the images I came up with no errors. Will not having the images crawled have a negative effect? I have over 1,000 images on my site?

Thanks for any help you can give.

Duplicating

Jim,

Used your sitemap genereator ("Site map only"). For every different page it also has the title and link to the index page. I do have a Home link on every page except the Home page itself. Is it supposed to do this? There’s over 25 listings if the title and link to the index page in the sitemap.

Thanks,

Roger

sitemap has only 1 field?

the only field in sitemap is the "loc" unless you manually go into each item and add priority or frequence? what about "lastmod" is it not required?
does each url has to be encoded as in the example on your site? my sitemap has them as is.

thanks for the help

Constantly getting Page Error and Dreamweaver says no page error

Hello
I keep running the sitemap generator on my site
http://www.mymobilenotary.us
and I keep getting a page error on http://www.mymobilenotary.us/resources_mortgage.htm showing a incorrect link to resourcess_funstuff.htm

I have checked and there is no such error on that page. I also checked dreamweaver and found no error. I have closed Dreamweaver and auditmypc.com and reopened and run again and still the error persists! Help! I’m at a loss at this point as to what to do. Thanks for any help.

problem running sitemap generator

I am trying to run the sitemap generator but the system returns no files.
I am using vista and IE7. I attach the Java script log
site name www.wickedtickles.co.uk
Any help would be appreciated.
26.10.2007 17:41:35 – DEBUG – worker ‘crawler’ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawler’ started job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_0′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_1′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_2′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_3′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_4′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_5′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_6′ startJob()
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_0′ started job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_1′ started job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_3′ started job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_2′ started job
26.10.2007 17:41:35 – DEBUG – crawlerThread_3: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – crawlerThread_2: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_5′ started job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_6′ started job
26.10.2007 17:41:35 – DEBUG – crawlerThread_5: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_7′ startJob()
26.10.2007 17:41:35 – DEBUG – crawlerThread_6: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_3.urlWorker’ started job
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: StatusChanged:connecting
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: ConnectStarted
26.10.2007 17:41:35 – DEBUG – crawlerThread_3: StatusChanged:connecting
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_4′ started job
26.10.2007 17:41:35 – DEBUG – crawlerThread_4: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – crawlerThread_0: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – crawlerThread_1: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlConnector started
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_7′ started job
26.10.2007 17:41:35 – DEBUG – crawlerThread_7: StatusChanged:waiting
26.10.2007 17:41:35 – DEBUG – UrlConnector.connect( http://www.wickedtickles.co.uk )
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlConnector: ConnectStarted
26.10.2007 17:41:35 – DEBUG – Connected to http://www.wickedtickles.co.uk
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlConnector: ConnectFinished
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: StatusChanged:connected
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: ConnectFinished
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlStreamer started
26.10.2007 17:41:35 – DEBUG – UrlStreamer.stream()
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: StatusChanged:streaming
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: StreamStarted
26.10.2007 17:41:35 – DEBUG – crawlerThread_3: StatusChanged:streaming
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlStreamer: StreamStarted
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlStreamer: StreamFinished
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: StreamFinished
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_3.urlWorker’ finished job
26.10.2007 17:41:35 – DEBUG – crawlerThread_3: StatusChangedrocessing
26.10.2007 17:41:35 – ERROR – Invalid redirect location: /onlineshop/home.php
26.10.2007 17:41:35 – ERROR – java.net.MalformedURLException: no protocol: /onlineshop/home.php
at java.net.URL.<init>(Unknown Source)
at java.net.URL.<init>(Unknown Source)
at java.net.URL.<init>(Unknown Source)
at jmaster.util.http.HttpHelper.createURL(Unknown Source)
at jmaster.webtool.model.impl.crawler.CrawlerThread.A (Unknown Source)
at jmaster.webtool.model.impl.crawler.CrawlerThread.M (Unknown Source)
at jmaster.webtool.model.impl.core.AbstractWorker.mak eJob(Unknown Source)
at jmaster.webtool.model.impl.core.AbstractWorker.run (Unknown Source)
at java.lang.Thread.run(Unknown Source)
26.10.2007 17:41:35 – DEBUG – crawlerThread_3: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – UrlConnector.destroy()
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlConnector: Destroyed
26.10.2007 17:41:35 – DEBUG – UrlStreamer.destroy()
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlStreamer: Destroyed
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_3′ finished job
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlConnector_zomby finished
26.10.2007 17:41:35 – DEBUG – crawlerThread_3.urlWorker.urlStreamer_zomby finished
26.10.2007 17:41:35 – DEBUG – crawlerThread_2: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_2′ finished job
26.10.2007 17:41:35 – DEBUG – crawlerThread_7: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – crawlerThread_1: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – crawlerThread_0: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – crawlerThread_4: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – crawlerThread_6: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – crawlerThread_5: StatusChanged:ready
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_7′ finished job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_5′ finished job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_6′ finished job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_0′ finished job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_1′ finished job
26.10.2007 17:41:35 – DEBUG – worker ‘crawlerThread_4′ finished job
26.10.2007 17:41:35 – DEBUG – worker ‘crawler’ finished job

Sitemap identifier

Hi,

I keep getting session ids in my sitemap, what is the sitemap identifer so i can stop this happening?

Thank you

Include/Exclude url sets

I have been trying to exclude a wide range of url sets, so it only queries a included url sets. Purpose of this is to "only" add actual product pages to the sitemap. No searches, image enlarge url’s, categories, etc.

Example:

1. Include: www.yoursite.com/products.php?Operation=ItemLookup&ItemId=
2. Exclude: www.yousite.com/products.php?Operation=ItemSearch&Keywords=

Even though I placed these in the include/exclude filter, it still crawled and index ed a very large series of url’s I do not want in my sitemap.

This poses kind of a headache.

Am I doing something wrong?

Cheers

Google error

hello,

After trying to get this error away and posting on over 20 forums, I finaly hope that somebody can help me…

This is the error i Get for every sitemap of my sites, i check them over 20 times and they are all correct, my robots.txt got them listed (whit the tag Sitemap: sitemap.xml)

Here is my site: http://sbonline.be
this is the error:

Code:
Network unreachable: robots.txt unreachable
We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.

This is my sitemap: http://sbonline.be/sitemap.xml
This is my robots.txt: http://sbonline.be/robots.txt

This error just happend… I didn’t change anything on my sitemaps or robots.txt, in one day it just gives errors and doesnt send any urls anymore.

Please somebody help, thnx

~Shub

Generator producing urls that don’t exist

I just ran the Sitemap Generator on my blog, and it’s producing rafts of urls that do not exist, for dates that pre-date the founding of this country.

Example:
<loc>http://www.mysite.com/index.php?year=1747&#x26;month=2</loc>

How do I get the generator to stick to urls that actually exist on my site?

Quarks of Sitemap Proggie

So, I have been trying to run the proggie on some very large sites and here are a few quarks I have come across. Some of these would be great if it were implemented. For me, my Java is set 4096. This is extreme, but works for me.

1. Finished: 65,000; Queued: 600,000. Can’t pause total to let the Que catch up in a day or so. I have a total of about 3.5 million on a few different sites, so it gets kind of crazy.

2. When Exporting, no option to just go ahead and export entire list, not matter if it is finished or just queued. If user knows for sure that all links are good, it shouldn’t even matter if it processed. In the case of massive sites that sitemaps need created, then waiting a few weeks for it to finish is nerve racking.

3. This one is a good one. For very large sites, multiple sitemaps need created. In this case, it would be nice that there can be a Tag block. So if this giant block of 200,000 url’s is selected, the Sitemap Generator will not export them. The proggie will only export those that are not tagged. This way, the admin can keep it going for a very long time, save, open project, or anything else. They will always remain tagged. But then, it also allows the admin to continue with the crawl without the worry of creating duplicate sitemaps when exporting.

In my own case, since none of these are in Sitemaps Generator, I have to create a null in the file to force it to stop, so I can let the queued catch up. Then when I want to continue, remove the null value. This is all done in the xml file itself after saving and re-opening the file. Hifi butchering.

Thoughts and opinions greatly welcomed. Cheers

Vital Pages Not Crawled Successfully

Hi Jim and everyone else!

First, Jim, let me thank you for the best sitemap application I’ve had the pleasure of running. I’ve always appreciate you making these great tools available!

I’ve recently run into a problem with some of my website’s pages being successfully mapped, and since they are sales pages, they’re rather important.

All of a sudden I’ve noticed that about 20 pages of mine are failing their crawl every time I run the sitemap, and I see this in the middle of the URL of each failed page: /../

I’ve tried to figure out what this means, how it got there,and most importantly, how to get these pages crawled again. If someone could point me in the right direction, I’d truly appreciate it!

Chris B.

sitemap concerns

I just used the sitemap generator and uploaded the saved xml file to my server…I then went and tried to view the xml file online domain.com/Sitemap.xml and it states there is no style for the xml file and the tree has a bunch of gibberish in it

<entry id="529a626a" url="aHR0cDovL3d3dy5wYi1uZXR3b3Jrcy5jb20=" length="20508" modified="2007-09-20T09:58:17.000-0500" state="2" mimeType="text/html" encoding="UTF-8" httpCode="200" level="0" title="VXNlZCBDaXNjbywgQnV5IFVzZWQgQ2lzY28sIFNlbGw gVXNlZCBDaXNjbywgUmVmdXJiaXNoZWQgQ2lzY28=" requested="2007-09-28T12:52:52.149-0500" pingTime="263" getTime="69" changeFrequency="0" priority="0"/>
<entry id="1b9d9dc0" url="aHR0cDovL3d3dy5wYi1uZXR3b3Jrcy5jb20vc3R5bGUvZ 2xvYmFsc3R5bGUuY3Nz" length="184" modified="2007-04-01T10:37:59.000-0500" state="2" mimeType="text/css" encoding="UTF-8" httpCode="200" level="1" requested="2007-09-28T12:52:54.949-0500" pingTime="191" getTime="-1" changeFrequency="0" priority="0"/>
<entry id="1d5f8e37" url="aHR0cDovL3d3dy5wYi1uZXR3b3Jrcy5jb20vaW1hZ2VzL 1VudGl0bGVkLTIucG5n" length="86525" modified="2007-04-14T09:56:04.000-0500" state="2" mimeType="image/png" encoding="UTF-8" httpCode="200" level="1" requested="2007-09-28T12:52:54.961-0500" pingTime="1135" getTime="-1" changeFrequency="0" priority="0"/>

I do not remember the xml file looking like this on previous uses. Am I doing something wrong?

No title in xml sitemap

i have used other xml sitemap generators and they all have seem to have more fields then this one. the only one this one makes is the url. How do ii make it include titles as well as priority,..etc

thank you
any help appreciated

Can’t export sitemap xml

When I export the sitemap XML I provide the filename, then click Save and nothing gets saved. I see a screen with a red button with an "x" followed by "name", then a green check mark with an "OK" buttn. I click the OK button. I check the exported sitemap file and it is zero bytes in length.

I can export everything else in the export menu – no problem, but the sitemap XML file won’t export.

I used to be able to use the sitemap generator tool without this problem.

What is the problem?

Thanks in advance.

uploading sitemap to website

We have made the sitemap and have no errors. Now we can’t figure out how to upload the sitemap into the home page of the website. Help please!!!!!!

Blank Window

I cannot get the new Webmaster Tool to work in Opera. The old one worked great and was so easy I just let my clients use it themselves. I decided to use the new tool to generate a sitemap for my own website and discovered this problem. Opera says it is loading an Applet but after the Java Security check I just get a blank page. It works OK in Internet Explorer, but I hate IE! Does anyone else have a problem using the Tool in Opera? Any ideas?
Thanks.

numerous ‘sleep interrupted’ errors

Hi,

I keep getting numerous ‘sleep interrupted’ errors, and none of the photos are indexed. Please advise.

Thanks,
JoeB

SMF Forums Sitemap Generator

I have been using your sitemap generator on my SMF forum for a long time now and I really appricate the free service. I have noticed that after generating a sitemap, about two percent of the time, posts come up missing. Any idea?

Thanks again for the free service!

Sitemap Writer Pro

Try Sitemap Writer Pro for Generating XML Sitemaps

Sitemap Writer Pro is a professional XML sitemap creator program. With this program you can automatically generate XML sitemaps for your websites, keep up-to-date, edit them, upload to web server and submit to all search engines that support XML sitemaps (Google, Yahoo, Ask.com and MSN).

It costs $25 dollars Sitemap Writer Pro requires Windows 2000/XP/2003/Vista and Microsoft .NET Framework 1.1.

Nothing is happening when I click SiteMap on Webmaster Tool

Help!!! Nothing is happening when I try to create a sitemap for my site. I haved tried all of the troubleshooting tactics suggested: Made sure the url for the home page was correct, increased Java memory, turned off "respect robots" and "respect no follow" to make sure there wasn’t a problem going on there; changed the user agent in case my server wasn’t accepting Audit MyPC WebMaster Tool. Nothing is working! FYI: The url I am trying to create a sitemap for is www.myunderthesun.com. Any help is GREATLY appreciated.

Generator misses files in one directory

I have used the sitemap generator several times to generate a map for

www.kouroupis.gr

I have always used a piece of html to open pages in a new window
as below

<html>
<body>
<p style="font-family:arial;font-size:70%;color:black">

<a href="http://www.kouroupis.gr/shop/skls2.html" target="_blank">

<img border="0" alt="Sklavenitis Poseidon bronze sculpture Kouroupis,Koutouloufari,ceramics,Greek,Greece,Cret e"
src="/pictures/bronze/broposeidon20x56.jpg" width="150" height="420" />
</a>

Just recently I changed the piece of code to

<html>
<body>
<p style="font-family:arial;font-size:70%;color:black">

<a href="#"
onClick="MyWindow=window.open(‘http://www.kouroupis.gr/shop/skls1.html’,
‘MyWindow’,'toolbar=no,location=no,directories=no, status=no,menubar=no,scrollbars=yes,resizable=yes, width=800,height=800′); return false;">
<img border="0" alt="Sklavenitis Dionysos bronze sculpture Kouroupis,Koutouloufari,ceramics,Greek,Greece,Cret e"
src="/pictures/bronze/brodionysos15x34.jpg" width="150" height="340" />
</a>

This was done so the browser opened without the tool,menubar, and status bar etc.

Pages with the new script are not detected by the generator, and in fact if I force it to read only the sub-directory /shop it still does not index the pages!!

Any ideas?

If you do a test run exclude */pictures/* to speed it up!!

Thanks

Queued Status – What to Do?

I have 4000 images that are sitting in a Queued status – they are all less than 200k.

How do I get them into the Finished status?

Thanks in Advance,

Jesse Arana

Disconnecting Internet Connection

Everytime l try to use the sitemap generator it gets half way through indexing my site and then disconnections me from the internet and fails to find the rest of the other pages it is spidering (obviously because there is no internet connection).

Any idea as to how this is happening so l can prevent it?

maximum limit ?

Hi,

My site contains about 500.000 links and I was wondering whether the Webmaster Tool – Sitemap Generator could handle this… I know it is a lot and also know that google as a limit of 50.000 links per files. Would your tool split the results in 50.000 links files ?

thanks a lot !

Patrick

PS: so far, your tool is the best around ! I looked a lot and it is really the fastest ! I just wonder whether it can handle my site…

not working for me too…

I too keep getting an error when submitting the sitemap to google …they keep saying it is not in the right format…. ????

here is the link to where my site map is in my site…
cupcakedreams.net/site08.xm

I for life life of me do not know what is wrong …I also tried to do this a month ago with the same results ?? any ideas whats wrong ???

Great tool

Took a little tweaking, but what a time saver! Great tool, super results.

Bug with ä,ü etc.

when I export a Sitemap the xml file as an error with links that include

Cannot fix memory problem

Hello,

First of all, thumbs up for a great tool.

I’m having problems completing the sitemap, due to low memory problems. I’ve tried what you’re suggesting on the help video (adding "-Xmx512m"), but so far all I’ve got are error messages: "The Java runtime environment cannot be loaded".

I’ve followed another member suggestion and tried "Xmx512m", instead of "-Xmx512m" and that did load the environment and the tool, but looking at the System tab, I see that the memory allocated is still 95,3MB. That has to be the default value, right?

Am I doing something wrong? I’d really appreciate any suggestions. The tool is great and I’d like to finally get the complete sitemap.
BTW, our site is www.in2life.gr.

Thx in advance
NasosP

how can I include a WordPress installation also in my sitemap?

Jim
It’s been a while since we spoke, have been busy building.
I have a question I can’t seem to find the answer for
In your instructions you say re placement of sitemapxml

[Now, you need to upload the new sitemap to your website. It seems every hosting company has a different method of doing this, but they are all basically the same:

Think of your sitemap.xml file as any htm (html, php or asp) file that you're going to place on your website.

There is probably some type of import option that your hosting company provides you - use it to move (FTP, Publish, etc) the sitemap.xml file from your computer onto your website.

Place it in the same directory that holds the main page for your website.]

I have a construct that contains sales letters set up as
www.StephenHenryConsulting.com/letter with "index" so it is the default read file at my public_html section

I also have set up WordPress in my public_html section as I believe it will help my seo (I know this has it’s own sitemap but I have not activated it yet)

In the past (on other sites using th esame construct) I have set up your sitemap under my /letter folder so do I have to set up 2 different sitemaps to include WordPress or is there a place to put 1 sitemap that will include both
Tell me if you think this is stupid
Cheers
Stephen Henry
Brisbane Australia
P.s. I tell everyone about your product, it is (still) fantastic! Keep up the great work.

Problems accessing Sitemap using Vista

Hello,

The only way I can save the sitemap files in Vista is to save them to my personal folder in the Users folder. Now I can’t seem to open it or send it to anyone. The file is useless unless I can access it, which appears to be impossible right now.

Any ideas would be very, very appreciated!

Thanks.

Alan

Only my index page is showing in the sitemap

Dear Jim,
I would like to thank you for this great application! I am only facing a problem with the sitemap by not reading any other pages i have. In the XML sitemap only the index page is showing. I have tried all the things you said about checking the robots and metatags but still the problem persists.
Can you help me please !!!

Thank you alot,
Have a great day!
Misha

Cannot Update Sitemap

I have created and saved my sitemap. I reloaded the sitemap after modifying most files on the site (including the index file) but am unable to get it to update the sitemap – it just reads the index file and stops.

I am able to create the sitemap from scratch, which works fine, but I don’t want to put all the Priorities back in. I did tick the ‘Download only new and modidfied files’ but without luck. Anything else I should be doing?

Sitemap links

Sorry if this is a dumb question, but how do I change the colors of the links generated for an html sitemap. Mine come out blue and would like to change them to green. I’ve been trying to use my FCK editor, but it won’t work. When trying to edit the file in Crimson Editor, I can’t see where to change the link colors.

I cannot export xml sitemap

I have VISTA OS. I can generate, but I cannot export (save) the xml sitemap. It shows that it has been exported, but I cannot find it with Windows Explorer. I tried different folders: no difference. Please help. Thanks.

Including images

Hi,
Thanks for this really useful tool. I have used it a few times now, but I’d like to ask a question about how to include images. When your tool scans my site it finds and lists all my jpegs, but then when I click ‘export’ and make an XML sitemap the jpegs are not included in the sitemap, only the html pages. Am I doing something wrong? Thanks
Ian

Sitemap Generator not working

Howdy, First of all I want to say thanks for such a great online tool.

I have used the generator for a few months but havent been able to access it today. In firefox I just get a blank page and in IE7 I get this error message:

Line: 35
Char: 5
Error: Object doesnt support this property or method
Code: 0
URL: http:www.auditmypc.com/xml-sitemap.asp

Error in the google.xml file

I get an error message when I try to see the sitemap file.
The error is:

Parse error: syntax error, unexpected T_STRING in /home/sexylek/public_html/sitemap/google.xml on line 1

You can see the sitemap at this url:
http://www.sexyleker.no/google.xml

I created it using the tool on this site.

Hope someone can help me

Leif-Harald

Sitemap only contains top level URL

Hi,

I’ve used the Sitemap Generator successfully with two other sites, but am unable to make it work with my new one. All 3 are frame-based sites so seems like that can’t be the problem. What happens is that it only finds the initial URL and not any of the pages, so the XML looks like this (NB my site is not really called "mysite.com").

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>http://www.mysite.com/</loc>
<lastmod>2007-06-11T18:59:20-07:00</lastmod>
</url>
</urlset>

When crawling the Crawler tab just flashes "connecting" then finishes immediately.

I’ve tried setting various parameters and nothing changes.

What am I doing wrong?

Thanks for any help you can give!

Elisabeth

Title is missing from all but the index pages of my site

Hello,

I am generating a sitemap for www.retiredswarovski.com .

The only page appearing with a title is index.html. The title is missing from all other pages. I have verifed that the pages do indeed have titles.

I have tried to generate a sitemap about 10 times over the last 3 days, with the same results.

Any suggestions as to what is wrong with my pages or if perhaps this is a bug?

Thanks,
Angie

Nice Tool – Couple of questions

Tim,

I have watched the video, gone thru the initial process and determined that I have some duplicate keywords and links due to a menu that expands as false or true. Do I simply select exclude to remove one of each from the XML or should I have our team rewrite the code on the site? I don’t want to get penalized for duplicate content. If you would take a look it would be appreciated. datastreamservices.com

Also there are a number of options that one can select that aren’t fully explained in the video. I’m interested in the frequency tab and would like to have a more detailed explanation of what this refers to and how best to utilize it.

Lastly, I attempted to find a written document on your site that has greater detail about the full functionality of the tool. I noticed in a couple of posts that you mentioned you hadn’t yet completed it. If you have would you point me to where I can find it.

Much thanks for the tool, I’ve been bugging our IT team to create this and finally decided I would do it myself. Your program has made it very easy.

Regards,

Craig

Is it an absolute vs. relative url error or something else?

Hi Jim,

It’s me again. I keep picking away at trying to find the error on the pages pointing to this HTTP 404 not found link. You mentioned the possibility of an error with an absolute vs. relative url.

http://www.zone4kids.com/

I just can’t find it. I’m at my wits end. I checked with my original hosting company who managed just my domain name until I setup my site and redirected it to my current hosting company. They are not able to look at my zone file to see if both my relative url zone4kids.com and my absolute url www.zone4kids.com are pointing to the same place. They said I needed to check with my current hosting company.

I’ve written to my current hosting company and they said everything is setup properly and to stop beating myself up over this. They are not able to find that link anywhere in my code. The keep pointing me back to the sitemap generator software.

In trying to get to the bottom of all this I corrupted my site Saturday morning by removing my domain and trying to reload it to the server. In that whole debacle I learned from my host that I had a lot of extraneous code in my site because I was copying the complete code from Microsoft FrontPage and inserting it in to fields the hosting company’s software wizard was using to generate pages. In a sense I had duplicate html headers, meta tags etc.

I’ve since stripped out all the extraneous information (hours and hours to do it) and reloaded my products and pages. I ran the sitemap tool again last night and this http://www.zone4kids.com/ error is still there. The good news is that all the other errors I had are gone. I have a few images that I know are less than 100K that have connect time issues, but I can shrink the images to improve connect time.

I wonder if this may be a clue. The very first time I ran your tool I put in my site address and the tool didn’t run. I don’t remember exactly what the error was or how I put the address in the tool. I immediately ran it again and it created the site map, but with the error mentioned above. I don’t know if something is corrupted in memory on my PC or what. If I put in the absolute or the relative url I still get the error and it doesn’t matter if I use Firefox or IE. Both have the same error.

I’m really stumped and really bummed out by the whole thing.

Thanks for listening.
Loretta

Images showing as failed when I run the sitemap generator

Hi Jim,

Steve here…..the license plate guy from Tennessee. I’ve been working away on my site and getting closer to being ready to add a XML site map. My site is:

autotags4u.com

When I use your generator tool and don’t exclude images (images are important on my site to show folks what I sell and can do) it shows lots of errors that I don’t understand. The images are actually on the site where they’re supposed to be (I think!). In other words…..the sitemap tool says that it failed….but when i look at my site….the image is there where it’s supposed to be. I don’t understand.

What am I missing? Any info would be appreciated.

One more question: Am I correct in understanding that now the XML site map is all that one needs for all of the major search engines? Or, do I need to to have a different sitemap for the different engines….google, yahoo and msn? Please advise. Thank you again!

Hope all is well with you and your family.

Best Regards,

Steve Edwards

Google and others not seeing external links

I did the sitemap generator in April and uploaded it and pointed to that file in my Robot.txt file. When I ran the sitemap generator I used a lot of excludes, including images, because I have ecommerce site – www.bluesagebeads.com

The problem is the Google, MSN and Ask are not seeing my external links when I do a link search. For some reason, Yahoo is seeing them – about 900. The weird thing is, when I log into my Google sitemap account it shows some external links – about 78. All of the SE’s are seeing my pages, but not my external links. I’ve posted to Google Webmasters forum and really didn’t get anywhere. The answer I got was that Google is won’t list links unless they are high-quality links. Considering that DMOZ and Yahoo (the second largest SE) should be considered high-quality links and they’re not being listed, I don’t think that answer makes much sense.

It seems to be more of a technical problem that isn’t exclusive to Google. Could the problems be the excludes? Any ideas on what the problem is would be appreciated. Thanks in advance.

Helkat

Removing dynamic urls

Hi,

I have a dynamic website with an OsCommerce shop. Because of this every URL has something like "?osCid=…" after it.

The problem is that the sitemap generator repeats the same pages mulitple times, each time with a different "?osCid=…" reference. So the generator can have 100 pages listed but actually it’s just 5 pages each listed 20 times.

I’ve tried to filter out these but it just seems to not be able to get past the first page.

Is there anyway to filter these out from the results? The "?osCid=…" part is not required to view a page correctly

Thanks for any help.

Can not open saved project

Hi,

I have been using your site map builder and I can not open my saved project. This is the error message I get.

java.lang.NumberFormatException: Invailid Character : in base 64 string

I also have another matter.
After building my site map I got a bill for $120 excess usage from my host. Is the a way around reducing this in the settings.

Thankyou,
Toni-ann

Sleep Interuppted?

Can you please tell me what a sleep interuppted error is?

I started the crawler, but realized that I forgot to put an exclude for *.css extentions. So, I stopped it without saving and reloaded my former saved project file that I had saved prior to running the crawler. The former saved project file only had settings saved to it. BUT, when I started the crawler again, 2 of the 5 threads I was running stayed at 0 B/s for several minutes. So, in the crawler tab, I selected the corresponding threads and stopped then re-started them. When I did, I got these sleep interuppted errors.

What does this mean?

Can I re-crawl these urls?

How?

Pausing the crawler

When creating a new sitemap do I need to leave my browser open and running when I am away from the computer until the sitemap is complete?

Can I pause it and re-start it from where it was paused?

If so, can you explain how to do it without losing any data?

I have a large site and I know it will take some time to complete the crawl so I need to know. I also need to know how to re-crawl certain pages after an error is found and then fixed so that it may be included into the sitemap properly before submitting.

where is the exported xml file?

I found my file, I tried to save it to a shortcut on the desktop, this could have been the problem.

Generator help please. TYIA.

I saw a thread with the same question, but can not determine the solution. I am running the generator and am getting:

Warning: Entry processing failed, url=http://www.bridalgown.net/StoreFrontProfiles/deluxeSFshop.aspx?sid=1&sfid=101638&c=40001211, error=Connect timeout

I have also tried to slow it down.

Help? Thanks, in advance.

JEL

Unique Titles – How Many Characters?

Hi,

Does anyone know how many characters the search engines look at to determine the uniqueness of a page title? Do they search the first 40 characters or some number or do they check your whole title?

I have some pages that start with the same words/characters so I’m wondering if the search engines go far enough to see they do become unique.

Thanks,
Z4K

Google Webmaster Tools Error

My google webmaster tools is reporting this error…

"We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit."

The sitemap is at http://www.psychedelic-art.net/sitemap.xml

Everything looks fine with the sitemap to me, anyone got any clues?

Thanks,

Duane.

Export Problem

I have a problem with exporting a sitemap to my computer. The program works fine till I want to export a file to my computer. He saves the file on my computer, but the file is empty, thus useless. Could someone help me with this?

Keyphrases become Keywords

Hi,

I have a question and this may not be the right forum for this question so I apologize in advance.

My hosting company takes what I consider to be a keyphrase that people search and turns it into individual keywords. For example they take the keyphrase "flower girl dresses" I put in the keywords field and turn it into "flower, girl, dresses" They say that if they didn’t do it that way the site would actually be penalized on keywords.

I’m concerned about it because I’m not sure how the search engines search when someone types in a phrase. I’ve seen the metatags in the source code for some other sites and the the words are together as a phrase.

Any thoughts?

Thanks,
Z4K

Removing date & time stamp

I would like to be able to remove the date & time stamp <lastmod> when exporting the sitemap. The site is updated regularly and the sitemap will need to be updated every time too. Is this possible and if the <lastmod> entry is out of date will this affect search engine ranking?

Sitemap for a blog

Hi,

I am trying to create a sitemap for my blog and someone at an seo site gave me the link for auditmypc.I love the software that helps with the sitemap generation, but I am not sure if I am doing it right.Does using the ‘exclude filters’ showed in the video – *trackback/*, *xmlpc" (not sure about this one), *wp-admin* , *css , */wp-content/* , Testing.php , testing.php also work for a blog ? I was getting too many errors in the sitemap for my blog.So,I took out the videos,news and adsense to create the sitemap and then put them up again.I don’t know if it makes sense though.Could someone tell me if we can create a sitemap for a blog ( at blogger ) in the same way as for a website ? I am getting too many errors at the moment.:~

Images Not Saved In Google XML Sitemap

Hi Jim

I want to create a sitemap for my site that includes my images directory entries. When I run the script at http://www.auditmypc.com/xml-sitemap.asp it lists them all in the Sitemap window, however when I export the results as a Sitemap XML none of the images URLs are included.

Do I need to carry out an extra action for them to be saved?

Duncan

Crawl Errors – Finding old versions of pages

Hi,

I’m new at this and don’t really have a clue how to get to the bottom of old data coming up in crawls.

Google showed I had a couple dozen 404 link errors. I started researching sitemaps and tried a few (horrible tools) and then found your tool. (Great tool BTW!) When I ran your tool I came up with the same errors as Google and a few more. Your tool was great at showing me the connections to the bad links. However, when searching those pages to correct the errors I can’t find any links to the bad links. I also noticed that a lot of the pages were in some old formatted templates I was trying out.

I asked my web hosting company if there was stranded data on the server because the urls pointing to the bad links show old images of my site. I also noticed with your tool that my titles are missing. I know they were not there when I first launched my site so again I think somehow my new data is not being crawled.

My hosting company said they could not find any of the bad links and recommended I contact you and Google. Contacting Google is impossible….your site is much more user friendly!

Thanks,
Z4K

max level set to zero but spider crawls past root

Hi,

I have a very large site and don’t want to crawl all links so I set the max level to zero but the spider didn’t stop at root level.

Any suggestions?

Thanks,
Anna

Sitemap stops after only 2191 of 9600 pages

Jim,

Excellent program. I especially like the "map for people" html format.

I have tried 10 or 12 times over the past 3 days with various Settings but the site map always stops at around 2191 (including images) even though I have approx. 9600 pages on the site.

Initially, I saw 5 or 6 Low Memory Problem errors, so I increased the Java memory to 256 and those errors disappeared.

I tried Load From Server, and later tried Load From Anywhere, same result.

I tried setting Connect and Timeout to infinity but same result.

Site is www.pumplocker.com

Thanks

Is that Normal?

My sitemap result look like this : (it’s only part of the SiteMap)

HTML Code:
<properties sitemapView_rowFilterVisible="false" mainView_currentView="settingsView" sitemapcolumnFilterVisible="false"></properties><urlChecker connectTimeout="-1" rateLimit="-1" readTimeout="-1" saveContentToFile="false" saveContentToView="false" url="http://"></urlChecker><sitemap><entry id="8013678" url="aHR0cDovL3d3dy5oYWlyc3RyYWlnaHRlbmVyc29mZmVyLmNvbS8=" length="-1" state="2" mimeType="text/html" httpCode="200" level="0" title="UHJvZmVzc2lvbmFsIEhhaXIgU3RyYWlnaHRlbmVycyAtIFN0cmFpZ2h0ZW5pbmcgSXJvbnM=" requested="2007-05-05T21:52:47.062-0400" pingTime="109" getTime="1375" changeFrequency="0" priority="0"></entry><entry id="a7a5d38c" url="aHR0cDovL3d3dy5oYWlyc3RyYWlnaHRlbmVyc29mZmVyLmNvbS9pbWFnZXMvdG9wLmdpZg==" length="-1" modified="2007-02-22T20:14:33.000-0500" state="2" mimeType="image/gif" httpCode="200" level="1" requested="2007-05-05T21:52:48.968-0400" pingTime="125" getTime="-1" changeFrequency="0" priority="0"></entry><entry id="39359ae4" url="aHR0cDovL3d3dy5oYWlyc3RyYWlnaHRlbmVyc29mZmVyLmNvbS9pbmRleC5waHA=" length="-1" state="2" mimeType="text/html" httpCode="200" level="1" title="UHJvZmVzc2lvbmFsIEhhaXIgU3RyYWlnaHRlbmVycyAtIFN0cmFpZ2h0ZW5pbmcgSXJvbnM=" requested="2007-05-05T21:52:49.078-0400" pingTime="281" getTime="3141" changeFrequency="0" priority="0"></entry>

Strange code, is that how it should be? I don’t remember seeing this code in the past

301 Permanent Redirects

Hi Jim

First, thanks for such a great sitemap maker.

I’ve just made my first Google sitemap and would like to know whether I should include my 301 redirects in the sitemap or not?

Also, when I made my first sitemap it didn’t include my images folder content because I had it disallowed in robots.txt file. So, I removed the disallow, but now unless I tick "add skipped links to sitemap" it doesn’t include them. Any ideas on why this is happening?

Duncan

Help in saving Google SiteMap

I am confused when attempting to save the the sitemap into the google compliant form. First of all my tool set does not look like the one in the Webmaster Tool video. On completion of of my website crawl I have the option to Export, not save my site map. I do not see an option for google format. I see am option for URL Raw, delimited, XML, sitemap XML, HTML, and HTML report. Any help would be wonderful.

Filters

I’d like know how can i do to filter some directory.

Example

my domain is www.domain.com

- on rules i set Load from anywhere

i’d like check the follows directories:

- www.domain.com/sport
- www.domain.com/news
- music.domain.com
- film.domain.com

Now, if i insert in Include filters

*domain.com/sport*
or
*music.domain.com*

runs ok, but only for each rule

but if i insert as rules

*domain.com/sport*
*domain.com/news*

doesn’t work.

how can i do to insert all those rules together?

Thanks

Help!

I used your old sitemap generator and had some success with it. Now, when I attempt to use the new sitemap generator, when I click on the icon to enter, I get a blank page with a notation on the lower left that says "errors on page". I haven’t been able to get past this problem.

Using W3C to Validate XML Sitemaps

Jim,

Went to http://www.w3.org/2001/03/webdata/xsv and ran a different google xml map, since I wasn’t having any luck with yours, and got back

"
* docElt: {http://www.sitemaps.org/schemas/sitemap/0.9}urlset
* schemaLocs: http://www.sitemaps.org/schemas/sitemap/0.9 -> http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd
* The schema(s) used for schema-validation had
no errors
* The target was not assessed

Low-level XML well-formedness and/or validity processing output

Error: can’t retrieve "http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd": 404 Not Found
Error: can’t retrieve "http://www.sitemaps.org/schemas/sitemap/0.9": 404 Not Found

Schema resources involved

Attempt to load a schema document from
http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd
(source: schemaLoc) for
http://www.sitemaps.org/schemas/sitemap/0.9,
failed:

couldn’t open

Attempt to load a schema document from
http://www.sitemaps.org/schemas/sitemap/0.9
(source: docElt) for
http://www.sitemaps.org/schemas/sitemap/0.9,
failed:

couldn’t open "

Both "could not open" were in red. I updated the site map 10 minutes ago and reran the validatrion – no changes.

Does "Target not assesed." mean the googlemap xml code wasn’t assesed ?

What should a no errors WC3 validation look like?

Thanks,

Roger

Incomplete sitemap

Hi,
Was just wondering if someone can throw me some wisdom why i cannot get thsi link to show…

i used the sitemap tool on http://www.lasvegascustoms.com and it will not find this link that is on trhe main page in the bottom navigation…
http://www.lasvegascustoms.com/truck…pholstery.html

I do not know why…any ideas…it is pretty importantant link to be missed by the robots…any advice is appreciated…

THANKS,
Mark

Isn’t Google XML sitemap good for Yahoo and MSN as well?

Good Day,

I gather Google, Yahoo and MSN have agreed on a standard xml sitemap format, as per the website: http://www.sitemaps.org/protocol.php

If so, why should we generate different sitemaps for Google and Yahoo? Shoudn’t the Google XML sitemap be good for Yahoo, MSN and ASK as well? Please let me know.

Thanks,
DorinG

Password prompt

Hi,

When running the sitemap generator on my site, I repeatedly get a popup window that asks for my username, password and domain. Portions of my site are restricted to members-only. I have tried entering my site login information but the popup keeps appearing. If I hit "no" or close the popup another window usually appears. After repeated attempts to close the popup window it usually goes away but will often reappear later in the scan.

Is the popup window so the member section can be scanned? What login information do I need to enter in order for it to proceed?

Thanks in advance.
Will

Google Sitemap

When I added Google Sitemap to our site, and checked on Webmaster Tools section, the Web Crawl report showed several "URLs restricted by robots.txt" including index page. Our robots.txt file has only following lines.

User-agent: *
Disallow:

Since then, I have removed Sitemap fearing problems with Google search results. What could be the reason for this?

For your information, we do have some 301 redirect pages, and sitemap was showing old as well as new urls.

Please guide me, so that I can generate a correct Google sitemap and add to my site.

URL of our site http://www.lth-hotels.com

Thanks in advance….

fatal error

The new site and sitemap generator look great Jim but, unfortunately, it is not working for me, I get this message-

16.04.07 22:01:01, Error: Fatal error, cause: jmaster.webtool.model.api.crawler.exception.Crawle rException: Memory low-

I tried increasing the applet runtime memory but there was no change in performance, I still get more than half of my pages as unfindable… any explanation of what this error means &, more importantly, what to do about it will be appreciated.

While I am talking to people who understand computers better than I do I’m going to venture another question: can anyone tell me how to get the favicon (ico) I uploaded to my site’s root folder to work?
thanks,
Paul

Saving as Google and XML doesn’t work

Good Day,

I used the Sitemap Generator for the first time, and I’m very impressed by the many options available! I was able to create exactly what I wanted a sitemap that contains only .html and .jpg files, excluding .jpg files less than 5KB. I used the exclusion filter and the row filter, and they seem to work very well. The sitemap looks great within the Generator. However, when I saved the sitemap, first as Google sitemap, and then as .XML, I didn’t get what I expected: the Google sitemap contains only .html files and only the "Last modified" parameter, while the XML sitemap contains everything, including ping times, etc. I ended up with a Google sitemap of 72 KB and a XML sitemap of 572 KB.
Please help.

Thanks,
DorinG

Help!


Hi Jim

Thanks so much for this site it’s great – but I’m a bit lost. I want to create a site map for humans within my site which I think I’ve done with your tool. I also want to create sitemaps for google etc which I also thought I had done but when I submitted it to Google I go this response
Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.
I also put the location in my robots.txt file
Sitemap: <http://www.speakonlytome.com/sitemap.xml>
My site is www.speakonlytome.com – it’s only an interim site while our main site is being developed but if I can get this right I won’t have to pester you later.

Thanks
Caroline

Google sitemap

Hi

great program – it let me fix problems I didnt know I had !!

when I submit my sitemap to google it keeps coming up with URL recognition errors

ive tried editing the code manually using www. and then taking it out, but google still hates it

any ideas

thanks

amazingwally

Connection timeouts

I am having problems with the site map generator, starting fine, then it seems to lose connection to my server, resulting in failed, connection timeouts as the failure cause, I am using the latest version online..

a c/p from the error log:

04.04.07 13:58:36, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Greek_Language&action=edit, error=Read timeout
04.04.07 14:01:02, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php/Special:Recentchangeslinked/Star, error=Connect timeout
04.04.07 14:01:02, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Star&oldid=20, error=Connect timeout
04.04.07 14:01:03, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Template:Three_other_uses&action=e dit, error=Connect timeout
04.04.07 14:01:04, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Special:Upload&wpDestFile=NewSolar System2.jpg, error=Connect timeout
04.04.07 14:01:04, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Celestial_body&action=edit, error=Connect timeout
04.04.07 14:01:18, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Eris_%28dwarf_planet%29&action=edi t, error=Aborted
04.04.07 14:01:18, Error: Fatal error while processing entry http://www.dsswiki.net/DssWiki/index.php?title=Eris_%28dwarf_planet%29&action=edi t, cause: jmaster.webtool.model.impl.core.AbortedException
04.04.07 14:01:18, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Dwarf_planets&action=edit, error=Aborted
04.04.07 14:01:18, Error: Fatal error while processing entry http://www.dsswiki.net/DssWiki/index.php?title=Dwarf_planets&action=edit, cause: jmaster.webtool.model.impl.core.AbortedException
04.04.07 14:01:18, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Clearing_the_neighbourhood&action= edit, error=Aborted
04.04.07 14:01:18, Error: Fatal error while processing entry http://www.dsswiki.net/DssWiki/index.php?title=Clearing_the_neighbourhood&action= edit, cause: jmaster.webtool.model.impl.core.AbortedException
04.04.07 14:01:18, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Rigid_body&action=edit, error=Aborted
04.04.07 14:01:18, Error: Fatal error while processing entry http://www.dsswiki.net/DssWiki/index.php?title=Rigid_body&action=edit, cause: jmaster.webtool.model.impl.core.AbortedException
04.04.07 14:01:18, Warning: Entry processing failed, url=http://www.dsswiki.net/DssWiki/index.php?title=Definition_of_%22planet%22&action= edit, error=Aborted
04.04.07 14:01:18, Error: Fatal error while processing entry http://www.dsswiki.net/DssWiki/index.php?title=Definition_of_%22planet%22&action= edit, cause: jmaster.webtool.model.impl.core.AbortedException

It ran for about 4 mins before this happened.

I have tried saving , reloading and then used retry failed, it does the same thing each time.

The pc I am running on is a 3.2ghz p4, 1 gig ram running suse 10.1, jave 1.5 and i have set the memory for java to 256, and it has free ram available when failure occurs according to the system info given by the tool.

Any ideas?

Help with exact syntax for filters

Hello,

On each of my product pages I have an "Email this to a friend" link. I do not want to have this on my sitemap because I have over 30,000 products.

On the filter I have tried

*email*

and

http://www.mydomain.com/servlet/send?$email.Target.URL*

Neither works…. The email links are still being queued up which means it is taking twice as long to create my site map.

Am I doing something wrong in regards to the syntax of the filters?

Thanks!

Video Help

Hi…I don’t have any sound coming out of the 60 sec video. I have shut down a rebooted checked my audio and everything seems in order, can you help

Thanks!
Caroline

Google sitemap stops after the index page

First of all I want everyone to know I read the FAQ section before posting this question.

When I attempt to generate a Google site map and enter http://www.acecomputerguy.net….the generator only lists the the index page…www.acecomputerguy.net

Next, I entered http://acecomputerguy.net (without the www) and the generator picks up the entire site. I saved this site map and submitted to Google. I checked the status of the site map the next day and Google reported sitemap errors. Apparently Google expects to see the www on the sitemap url’s.

I manually edited the sitemap with notepad to add the www to all the url’s
and resubmitted to Google. The sitemap is now verified.

However, I am confused as to why the generator only picks up the index page when I enter….http://www.acecomputerguy.net.

Could this issue be affecting how the spiders see this web site.

Thank you in advance for any advice offered.

Regards,
allinet

Windows Vista Uncompabality

I used this java applet of Sitemap Generator in my Windows Vista, Internet Explorer. It worked fine, with crawling all urls and recording them as a list.

When I clicked on Save Sitemap under Sitemap tab, it would save the file, but the file was never saved when I checked the folder.

Again, when I try to save the file second time, it shows as saved, and asks me If i want to overwrite the saved file. So basically the bottom line is, that Sitemap Generator does saves the file, but Windows Vista doesn’t show the saved file, or may be Vista delete’s it unknowingly. From another point of view, I think that this could be some advanced security feature of Windows Vista as well, which stops files saved from web via Java being displayed.

Can anybody throw some light on this topic please?

Cannot download: Webmaster Tool – Sitemap Generator

Hi ,
could someone please help us with downloading Webmaster Tool – Sitemap Generator as we cannot access the download page.

May be provide the direct download link would great.

thanx alot

Xdesign studio

Stripping ?… parameters such as session id

I’m getting multiple results for some pages, with different select, sort, and/or session id parameters.

Example:

2550 http://www.avalanche-center.org/Education/Courses/NewMexico.php?tfm_order=ASC&tfm_orderby=date
2551 http://www.avalanche-center.org/Education/Courses/NewMexico.php?tfm_order=ASC&tfm_orderby=state
2552 http://www.avalanche-center.org/Education/Courses/NewMexico.php?tfm_order=ASC&tfm_orderby=location
2553 http://www.avalanche-center.org/Education/Courses/NewMexico.php?tfm_order=ASC&tfm_orderby=course
2554 http://www.avalanche-center.org/Education/Courses/NewMexico.php?tfm_order=ASC&tfm_orderby=sponsor
2555 http://www.avalanche-center.org/Education/Courses/NewMexico.php?tfm_order=ASC&tfm_orderby=cost

Is there a way to get the tool to ignore anything after a question mark? And thus combine entries like the five above into a single one?

Thanks

Jim

HTML pages with a Title

I can’t delete my posts I guess.

This was a about a problem with Titles not being found and indexed but when I ran the tool again there was no problem. No idea why it happened or why it went away but it doesn’t appear to be a problem with the tool.

Google sitemap says: Unsupported file format

Hi, I tried several google sitemap builder tools, read articles, I submitted it several times but google always says:

Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.

my web site adress is http://www.freewebtown.com/aymavisi
I uploaded it to my root directory as sitemap.xml also sitemap.gz (my root is same as above.)

By the way I can not add my url to google sitemap like this: http://www.freewebtown.com/aymavisi/sitemap.xml
It gives error:
The Sitemap must be located at http://www.freewebtown.com/. To add a Sitemap at http://www.freewebtown.com/aymavisi/, first add that site to your account and then click the Add a Sitemap link beside it.

I can only add: http://www.freewebtown.com/aymavisi can this cause this error?

I really don’t know what to do help please.

Better Stats

Jim,

I’ve got the three types of sitemaps loaded but I don’t see Google search ratios changing. "Numbers after + are successful hits on "robots.txt" files" , and over the last couple of weeks my numbers for Googlebots look like this: 82+13, 90+14, 95+16, 103+17. Google’s last crawl was on the 16th. How long does it take for the Googlebots to take advantage of the google sitemap? My MSN bots aren’t doing to good either: 114+68, 118+73, 141+76, 147+79. What sitemap does MSN use? Yahoobots are at 100%.

Thanks,
Roger

Exclusions not working

I’ve used this tool preiodically, including older versions. Tonight it is quitting without going beyond the first page unless I remove all exclusion rules. I’ve tried making sure there is no empty line, and I’ve tried entering a single exclusion rule. As soon as I do that it appears to read and process the entry page but quits right away. If I empty the exclusions entirely then it indexes the site. But doesn’t exclude anything.

Jim