Sitemap Gen Not Finding Interior Page

Hi Jim and Others.. Again I posted in 2008 (coconutvillasfl.com) and you are great. Hopefully you can help me again…
Site cures.org/index.html found yes the rest is flash
Site gen found it but cant find
cures.org/paypal.html

i put links on index.html for paypal.html but made them WHITE font color so you cant see them but they are in the source code. Any ideas… Im sure
Thanks in advance

Stephen Goulart

Sitemap Details – Unable to refresh errors

I’ve used your Sitemap tool today to re-crawl all my client’s sites. On some I would receive an error in the Sitemap details. After fixing the error, I would go back to your program and click Retry failed. The error would remain. What am I missing in the program that would remove the error. I can delete the error, but I was thinking that the Retry would do the same thing.

Great tool. I went ahead and purchased you a Lobster Gift Certificate.

Please helpppp!!!

When I first used the Sitemap Generator it worked perfectly, however, my site of 24 pages came up with 10 errors…most of them being detected due to my favicon code, one being from a URL that had ‘&’ that in it that was linked in many of the pages and then a deleted sitemap page was detected. Like the video demonstrated, I deleted all of these failed pages, then had the bright idea to close out with exporting the file and restrat the generator and this time include my favicon in the "filter" are so it wouldnt interfere. However, now when I try to generate a new sitemap it only shows one page, my home page as for the total amount of pages on my site and thats whether or not I include the favicon file in the filter or not. I think the ‘&’ url was something important. Can you please help, does this sound familiar. I am building my site on office live, if that helps. Thanks for your attention to this and thanks for this great tool!

Priorities & Change Frequences

Hi Jim,

I’m using v1.1 which is the one taken from your update email.

On an earlier version of your google site map generator I could use the SHIFT and/or CTRL keys to select the urls and change the priorities and change frequences of multiple urls at one go.

This facility seems to have disappeared.
Would it be possible to re-instate this please?

Thanks
Rob

PDF files aren’t inclued when exporting as Sitemap XML

Hi, I’ve got a web site that I want to create an xml site map for using the Sitemap generator. The tool seems to crawl my site fine. PDFs appear in the Sitemap list view. However, when I Export > Sitemap XML, the tool does not include URLs for PDF files. When I try exporting a "URL Raw List" or CSV file, I get the PDF files.

Has anyone else experienced this? The site has 191 pdfs with good content that I definitely want included and I’d rather not manually enter the xml nodes for all of those pdfs.

Thanks in advance…

HELP!!! Too Many Connections And Won’t Disconnet

While running the sitemap program we noticed that after about 8000 pages everything started timing out. Finally got the program closed thinking we would just restart the program but, no, it didn’t work that way. I am getting an 1040 error (too many connection) and it’s been that way for about 20 hours now. It’s lock up the datebase file so that none of the sites on the server are working. All sites are down so essentially we are out of business! We were going to try to restart MYSQL but can’t do that either because it say there are too many connections. Can someone please help us.

Timeout at 120,000+ pages and Xmx1500m

Hi all,

I have been trying this over and over again for the past 2 weeks probably to no avail. I’ve set Java to allocate 1.5GB since that is all the memory I have on this PC, I put it at Xmx1500m and it runs for a long time stopping at about 120,000 pages and there still seems to be more to go after that. I have no idea how much more but I was surprised that our site was this large to begin with.

What do I do? We really need this site map. Can anyone tell me why 1.5GB isn’t enough, and if so what other options do I have?

Thanks

Need advice to generate Sitemap for my website

Hi there,

I have recently encountered a problem with generating a Sitemap for my
website. I desperately need some help here.

The website which I want to create a Sitemap is gemondo.com. It’s
a retail website which sales jewelleries. On the website, it has got
a search bar above product lists.

One of such examples is /index.php?page=cat_view&path=10

Visitors can search necklaces according to different material,
gemstone, style and price etc. by using the search bar.

When I tried to generate a Sitemap for this site ,
it basically crawls all the possible pages that could be generated via
this search bar. Because of this, creating a Sitemap always goes into
a process of an endless loop. The waiting records can
reach as many as 300,000 and still keeps on increasing, but the
website has less than 3,000 products listing on it.

Since the website is a dynamic one, therefore, all the URLs are on the
same level. The only difference is the URL of a subcategory has an
additional code.

For example,
Pendants & Necklets:
index.php?page=cat_view&path=10

Pendants & Necklets > 18ct Yellow Gold:
index.php?page=cat_view&path=10-3

Pendants & Necklets > Aquamarine
index.php?page=cat_view&path=10-21

Pendants & Necklets > Aquamarine > Heart
index.php?page=cat_view&path=10-21-43

In general, I want all the category pages to be included in the
Sitemap. However, it seems impossible in this case. To compromise, I
hope at least the main category pages can be included; therefore, I
tried to filter URLs with “-“. However, nothing has changed.

Any advices, please?
Reply With Quote

frustrated – timing out and exclusions

I am really abou to pull my hair out. The sitemap generator looks great. However, I kept having time out issues so i was able to increase the amount of memory by following the instuctions and I got a lot farther before it times out the next time.

So I tried escluding my images (which I am not sure I really want to do) but I really want to exclude a folder /phpbb/. This folder seems to have tons of files. I put */phpbb/* in the exclusions but it didn’t work. I have read through all of the forums to make sure I am not missing something but I am at the point where I really need help. My url is http://www.bonniesplants.com

Thanks for you help in advance
Layla

Sitemaster Generator Will Not Start

Hi:

I am having problems starting Sitemap Generator. I have used it many times before and it worked great. Now, each time I click on the site master logo a window opens and nothing happens. I have to kill the process to get it to close.

Any suggestions would be appreciated.

John

No Export function indicated un Sitemap tab

Love(d) this tool and have previously and successfully created and uploaded in my sites’ root directories Google friendly .xml files for my two active domains. I have created an updated sitemap for one of my domains, but lo and behold, I cannot see the Export function under Sitemap tab as in Java in the past. My only options seem to be Row Filter and Column Filter.

Any suggestions please?

New content and site map

hi,

Hope this does not sound like a stupid question, but this line got me worried

"If you make a change, say add or delete information from the sitemap, you need to do the following:"

Does this mean the site map does not add automatically new content ?

So if i make a new page I have to redo the sitemap ? or add a few lines to the xml file.

I thought most sitemaps were kinda dynamic ?

sorry if this is silly question

cheers

Sitemap generator not loading

I’ve been using the sitemap generator for a couple of weeks now and its been really useful in getting more of my pages indexed by Google but now I can’t load it. When I click on the icon the Java logo displays as usual but then I get a small x in the top left of the screen (similar to when an image is not found)

Help please?

sitemap not complete

hello. i have dyslexia and a great deal of this is hard to follow – even trying to enter a question here. after a couple of hours reading, i tried your sitemap generator, which was different from the other ones i tried (i’ve been trying to do this for 3 weeks) and all i got was one URL and the rest were skipped. it said it downloaded but when i click on it on my desktop nothing opens up and i dont know where or how to upload it to my site. i know it shouldnt be this hard. i will happily pay for your services for any kind of help. this is the ninth place i have tried. thank you.

Not Respecting Robots

For some reasons its no longer respecting my meta name robots tags. It will respect the robots.txt file but for my dynamic pages it doesn’t seem to care about the meta tag.

Is anyone else noticing this?

Sitemap generator doesn’t get past home page

I’m using IE 7 and I have set "scripting for Java applets" for Prompt. My index.html page redirects to an index.cfm page so I’m also including that in the URL path. But every time I try to generate a site map and save to file, all I get in the file is the HTML script for the index.cfm page.

What’s going wrong?

Problem Loading Site Map Generator

Hi
I can’t seem to load the sitemap generator in my browser (using Vista, I.E.7).
When I load the screen from the browser, I get the orange Java Box & see the white loading bar giong up. When the loading screen goes, instead of loading the generator, I just get the grey background. I’m using the latest java version & have tried the memory workaround (-Xmx256m).

Thanks in advance for any help!

XML File not saving to PC

First I’d like to say that this tool is awesome!

When I export the file into an XML format it shows that it’s saving to my PC but when I go to look for it, it’s no where to be found. I re-save it and I can see the document in the Save As box but not on my PC. What can I do to get that file I created? Doing a search and find can’t seem to find it.

Thanks

seonut!

Sitemap Errors

Hi,

I was very excited to find your tool Sitemap generator, we are brand new website owners. We purchased a website from a company online and I am creating a sitemap for google. I have done what was laid out in the video in order to create my sitemap, I have also uploaded it to my site. Now I am trying to submit it to google but I am getting the following warning errors

Line 3

Status Invalid XML tag
This tag was not recognized. Please fix it and resubmit.

Details
Parent tag: urlset
Tag: entry
Problem detected on: Nov 9, 2008

This same line showed up 10 time.

Please let me know if there is additional information you need to help me resolve this problem.

Gooogle Unable to Find My Website…

Hi Guys,

Apologies if this question seems a litttle silly but i’m a complete novice & unsure what to do, I have created a very basic website but Google is unable to find this – when i try to figure out why (after submitting the site map to Google), I get the following:

Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.

Has anybody got any ideas? I built the site using MS web expression and like I said above – I’m a complete novice so I’d appreciate any help I can get.

Many thanks

Emma

Re: Help!

Hi All

I ran the generator and only 2 pages came up of a much larger site?
The home page and another minor file. What do I do?

Thanks
Caroline

Returns “///”

When I scan my site I get these locations in the site map:

mysite.com/about
mysite.com//about
mysite.com///about

If I let it go, it just keeps adding in slashes and it will do this for every file in the site.

Not sure what is going on here, but here is my exculde filter list:

*.JPG
*.jpg
*.gif
*.GIF
*.inc
*.INC
*/application.cfm
*.css
*.CSS
*.js
*.JS
*/news/includes/*
*/news/julsep04/*
*/news/english/*
*/news/spanish/*
*/news/test/*
*/images/*
*/Templates/*
*/pubsandorgs.cfm
*/query.cfm
*/XMLFeeds*
*/Graphics2/*
*/activitymessage/*
*/loginrequest/*
*/CSS/*
*/borders/*
*/admin/*
*.cfm?*

Thanks,
-HB

Memory Low

Jim

I have used your Sitemap generator successfully before and I have read the posts on expanding Java memory but when I enter -Xmx512m into the Java runtime settings for both installed versions, it doesn’t make any difference – the Sitemap Generator still runs out of memory. When i go back to the Java settings, I find the Runtime parameters boxes are empty. Am I doing something wrong? Can you generate a sitemap for my site (www.bbnint.co.uk)?

Blair Nimmo

Sitemap Generator Used to Work

Hi, The sitemap generator used to work great for me. I’m not sure what changed, but now it just keeps going and going. Looking at other threads I would guess that I most likely upgraded my java version. I didn’t see any solutions to the problem, just a repeated request for the website address having the problem. My website is jeanshub.com

I can’t say for sure that I upgraded java and then the problem started, but either way, it’s not working for me anymore. I get 5 showing green and other than that it just keep processing, way more pages than my site has.

Kbrown5523

many links “skipped”

How come the generator would be skipping most of our links? I am unable to see expanded information; under the "error" column, the skipped links say "server load…" and I cannot see the rest of the message.

Thank you for any assistance,

Babs

Total URL in sitemap = 0 / Unsupported file format

I generated one sitemap with use of your free sitemap generator tool. I submitted the link to Google and its status came back as OK.

Problem is, under URL submitted all I’m seeing is –. When I click on more details it reads:

Total URLs in Sitemap 0
Indexed URLs in Sitemap –

Why is it saying there are 0 URLs in the sitemap??? What could I be doing wrong? The generator indexed over 8,000 pages of my site.

I did many try to fix the problem.

To fix the problem I did save .xml file as www.myweb.com/sitemap.xml location but this time I got following error

Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.

earlier it was www.myweb.com/file/sitemap.xml for which I get OK status.

Why google Webmaster tool gives different status Error for same .xml file ?

Any help would be GREATLY appreciated!
Reply With Quote

UnSupported file format.

Thanks a lot for providing free sitemap generator tools!

I have create sitemap with your tools then put that .xml file on
www.mysites.com/sitemap/sitemap.xml its looks fine in IE but when I did upload on google webmaster tools It is given error "unsupported File Format" there. would you please find the reason for that?

Waiting for your reply.

can’t access sitemap generator

I can’t access sitemap generator to generate sitemap
http://www.auditmypc.com/

I clicked on the icon, then:
Path:
http://www.auditmypc.com/free-sitemap-generator.asp
verify certificate
http://www.auditmypc.com/xml-sitemap.asp
this is what it says next:This tool respects sessions, so make sure you are LOGGED OUT of your website BEFORE generating a XML SITEMAP!
(I am logged out)
Then goes no further,no buttons,no links to click,nothing but the above message.

My specs:
Firefox 2.0.0.17
Macintosh OS: Panther 10.3.9
Firefox preferences:
Enable JavaScript is checked
Enable Java is checked
Thanks in advance for help.
Art

Best Way Plz

hi,
thx lot for a great tool.
i am trying to use the sitemap tool on africabo.com and for the first 3 run it returned errors , then i increased the memory to 512mb and still going for a bit now but i fear its going to time out soon too as the memory is going up fast.

I just want some hints yu can gimme to lower the sitemaps pages plz as so far i have about 30000 pages queued and still going up.

also the site is multilanguage eng. and fr. so crawling samething in both language seem a bit lot and some pages aswell don’t have any data yet.

so plz can u gimme any good advise/tip on how to effectively use filters in my case plz to make it easier for bot ?

thx in advance

Excluded images but image files still show on sitemap

I’m building a sitemap for my site and I still get image links (ending with .jpg) in the results even though I tick the box "exclude images".

Thanks for your help.

Jos

not all url crawled

NOt all of the directories at my web site are crawled when I try to create a site map.

The first web site http://www.searchnowfind.com was crawled successfully, all directies were crawled.

I try to crawl http://www.mark9.com and only bout 1/2 the directories are crawled.

What could be the problem?

I’m not excluding any file names.

Thanks
Steve

Totally Bamboozled!

Hello & I apologize in advance for my ignorance.
Have just completed a new site, registered with Google & wanted to add a sitemap.

Ummmmmm…suspect I’m doing something wrong, as I’ve tried to use your wonderful SiteMapGenerator a couple of times, but it seems to be crawling for hours with thousands upon thousands of lines………and no sign of stopping

I’m confused about it all – worried I’ll "damage" my submission with Google by over-supplying (?)……..

I have tried to find info about all of this, but am over bamboozled by the amount of info out there – so thus very confused!

sorry if off topic: Also confused about robots.txt – is essential? or not? excluded files, and so on,
(so yes – I am very sorry for my ignorance – but I’m really at a loss……)

Thank you

Unsupported file format

I have searched threads with this problem and find several people that got this error because they were not exporting and instead saving the project as a file. I did the export xml sitemap & uploaded to Google but it still doesn’t recognize.

If I open the file with word & compare with google instruction for Sitemap guidelines everything seems to match up???

inconsistent backslash generation

note:
sorry if this is a repost. i was automatically logged off and not sure if my previous post was recorded. as such, i repost the same post again.

now i finally undertand why you exclude(skipped) files without backslash(/) in the sitemap.

it’s a requirement for a file to end with backslash as stated in the google webmaster guide.

but i found that your system have an incosistency when comes to backslash.

for example, i’ve a file in such format "abc.com/dir/subdir/index.php"

the system should generate this format "abc.com/dir/subdir/" for it to be valid.

but ironically, the system sometimes generate a file without backslash format such as this "abc.com/dir/subdir" and sometimes generate both formats.

it’s very rare for the system to generate correctly i.e. in this format "abc.com/dir/subdir/"

this has cause almost 50% of my files to be treated as skipped files.

yes, i know i can include those skipped files in the system settings but it’ll make those files invalid for xml format anyway in the eyes of google.

perhaps you should allow us to edit the Location to add a backslash at the end of those incorrect files format.

this would be much easier instead of detecting the bugs.

please help.

thank you.

Various Issues

Hello I like the sofware, but it really seems to struggle on large site.

So in an effort to overcome this I did some URL rewrites and specifically aimed the crawler at the forum first.

I get the following issues.

1. It will run then at around 4k urls threads can start to hang and rather than 5 or 7 threads being proccessed it’s 2 while the other look like they are still streaming but no KB being transfered.
2. I know in the forum alone it should produce over 100k but it if lucky I might get 10k.
3. Memory isn’t an issue I’ve a quad processor and with 4GB ram and I assigned 1gb to java, so it shouldn’t get any memory issues.
4. It would be nice to see a feature that if threads hang you can kill them and reset them. Even if I stop the crawler this has no effect.

Don’t get me wrong I think the tool is fantastic just wish I would work much better. If you’ve any suggestions please advise. The URL is www.rcheliaddict.co.uk

My other option is to write up some .php and generate my own site map tapped direct to the db and output the .xml’s. Time is the issue and I’d rather use a reliable system than have to start on a code.

Site map geirator

I have read the entire page about the site map geigator and the thing is were do I find the thing.
TINMAN61

travelmondo.biz

Hello, First let me introduce myself , my name is Andrea from Nazareth Israel ( sorry about my English ) , I had generated several XML Sitesmap with your program, but all the time google shows ERROR ! I resubmitted its again after I had encoded it and tell goole , but nothing change!

my question is, if I generated the map right or not? I excluded images, I deleted errors , but files include or include , I don’t know what to do in this field and how, I need help please

2- if I had uploaded the Site map in the right way to my site! I just uploaded the file as without any changes after the generating, then I linked it under a text (Sitemap) in my site (as I link an URL ) but in this case I liked a file ,is that right?

3- I placed the file top in the page (one time bellow the robot agent file) by the way, I had uplosed the robot in the same way like the map! and then I placed the map above the robot ( this 2 text are not nice to visitors to see it on the top of the page.

PLEASE HELP because I’m not internet guru (I know basic using of the webmaster tools) and because of my English to under stand.

If you laike to see where I have placed the map in my site here is the link travelmondo.biz

Exclude Filter Doesn’t Seem To work

First got to say the Sitemap Generator is Great. Glad I was able to find it online.
But,,, the exclude Filter doesn’t seem to work correctly, or I might be doing something wrong.

I have tried */amember/* this seems to work
*/GoldTraining/* This doesn’t work This is a Sub-Domain of my base domain. I have also tried */goldtraining/*
*/FreeMember/* this also doesn’t work. This two is a sub-Domain. Both sub-domains are Joomla CMS programs.
*.zip works nicely.

My domain is http://acssystem4u.com is a setup with 3 different Joomla programs running different levels of member training. Right now there are over 3000 pages between these 3 programs with duplicate info but different types of links.

Thank you for any help you can give.
Robert Anderson
Admin

Links with “403 forbidden” error when creating a sitemap

I used the http://www.auditmypc.com/xml-sitemap.asp tool to create my sitemap, but I got most of links (85%) with the "403 forbidden" error.
Whan can be the reason..?

Thanks for any help

sitemap problems

Hi i have used the sitemap generator today and works well. exported it as sitemap.xml, then uploaded it to my server using ipswich ftp. thats where the problems begin. when i seach for my sitemap i get the error messege access denied, with the extension of xsl intead of xml.
i then imported it into my site as an html file google then tells me its full of errors because it is the wrong format.
any help would be greatly appreciated.

can’t find exported sitemap xml

Hi I generated and exported a sitemap xml to my docs. i have gone into my docs and can’t find the exported file But when i go to export the file again The original exported file shows up, I am using vista home basic.

Error: Service Unavailable

In the past I have used auditmypc.com’s sitemap generator tool to create sitemaps.

I recently tried to create an updated sitemap using the tool. When I did this, only the links directly from the homepage (26 links total) came up on the list and most (21) of those failed with the error, "Service Unavailable."

How can I fix these issues and create an updated sitemap? The site I am trying to create the sitemap for is AdHocElectronics.com

Thanks for any help.

Arabic – utf problem

Hi your tool is great, the best…
but it cant support arabic / utf8 URLs …
I tried to sitemap arab-jokes.net and google rejected the sitemap because some characters where invalid….
<loc>http://www.arab-jokes.net/?query=jokes-2&#x26;name=??ت-سع?د???</loc>

Which have to be
<loc>http://www.arab-jokes.net/?query=jokes-2&amp;name=نكت-صعيد</loc>

Failed index.html

A complete sitemap is generated in spite of index.html being flagged as ‘failed’. Expanding the error line doesn’t give any details which are helpful to eradicate the problem.

Help please?

Ken

more than 3 pages not included

Hello everyone , I had submitted the site map 6 times, google said O.k ! but it doesn’t shows all my links for travelmondo.biz, and the more important more than 3 pages URL not included in goole ?! in the sitemap itself I see all my pages URL. what could be the reason?

cheers
Andrea
Reply With Quote

links not followed

Hi,

I have stripped the index.html from all links on my site, because google reported that I have duplicate titles. Google doesn’t consider the link with and without the index.html to be the same anymore.

Now I have the problem that your sitemap generator doesn’t follow links without filename extension. What Now?

Jos

exclude problems

This is a nice tool, but I must be doing something wrong on exluding.

I try to exclude file types and directories but they show up anyway.
This is what I use

*.jpg*
*/members/*

etc. etc.

But those things still show up.

Am I doing something wrong?

url without filename ext. not crawled

Hello,

I have had a problem with google, who assumes that I have duplicate titles, because google consideres /directory/ and /directory/index.(s)html to be different. So I removed the index.html from all url’s implemented a rewrite to redirect every index.html to only the url + dir name.
But now your sitemap generator does not follow the link http://example.com/directory/ What now?

Jos

301 Redirect creates extra urls in sitemap

I have implemented a 301 Redirect to send all variations of mysite.com/index.php to mysite.com/.

This should prevent "duplicate" urls and, theoretically, help my Google ranking.

Now, however, Sitemap Generator creates a sitemap with both the pre-redirect addresses and the new, redirected addresses, which sort of defeats the purpose. And Google’s sitemap manager does not like all these extra urls.

So, should I go back and change all my internal links to reflect the redirect? I’d rather not, as I thought the redirect was intended to handle that job.

Or, is there a way to filter the old urls (mysite.com/php) out of the sitemap?

Generator quits unless I re-create project every time

I’m able to create sitemaps the first time out, but when I re-open my project and run the crawler, the session craps out each time with this error:

Code:26.08.08 01:37:02, Error: Fatal error, cause: java.lang.InterruptedException

I’ve found the only thing I can do is create a new project every time out.

The Sitemap window still shows all 689 urls from my previous run. I was thinking that I might clear the window before re-starting my project, and that might prevent errors. Is there a way to do this?

Thanks.

Error: Crawler failure …

I have already used and tested Sitemap Generator without any significant problems. But, now when trying to generate Sitemap for my site I’m getting strange error: "Crawler failure: class "org.apache.regexp.REUtil"’s signer information does not match signer information of other classes in the same package". Please advise about the issue. Thanks.

Sitemap Generator problem

I apologize as I realize you have had this question numerous times but the only page that it returns is the homepage: antiquarianart.net

I’ve tried everything you’ve suggested but no joy. Can you suggest anything else?

Much appreciated.

Sitemap Not Generating Full Site, My Fault Im sure

Hi Jim and Others, and Thanks for a great tool.
The site i am currently working is
coconutvillasfl.com

First run– Found root directory 5 pages but not the 6th
caladesi-island-best-american-beach.htm
also
it didn’t find the directory or pages related to www.coconutvillasfl.com/caladesi-condos

However i can manually add them in to " settings — URL —
and then it finds them with no errors.???
Did i give you enough info?

Perplexed.
Stephen

Where can I find the Sitemap Generator?

can you give download link please, I would like to run the sitemap generator on my site.

l didn’t find it.

Can’t save sitemap to computer… tried everything

Hello,

While running the sitemap everything goes fine and great but when i try and export xml file to my computer and go to retrieve it to upload it to my server it is no where to be found… but when i go to run the sitemap and save again it shows that it is saved in the previous folder… But again when i check for the actual file in my computer it is not there… I have tried everything from what people have posted and responded to on this issue… i am using windows vista with IE7 and if anyone could help me that would be great thank you.

gray screen

Hello and thank you for offering this great service!
Not sure I’m doing this right – when I click on the sitemap generator icon at
http://www.auditmypc.com/free-sitemap-generator.asp,
I’m directed to http://www.auditmypc.com/xml-sitemap.asp
which has a blank gray screen with this writing on the top:

"This tool respects sessions, so make sure you are LOGGED OUT of your website BEFORE generating a XML SITEMAP!

Here is why: Visitors have pointed out that CMS systems such as YaBB provide a link to delete text without prompting you to confirm. The sitemap generator will follow all links that it finds, and in cases like YaBB, if it sees a ‘Delete’ link, it will follow it. This could result in data being deleted when logged in as a user with delete privileges. Play it safe and log out of your website before using this tool."

I am logged out of my website – is something supposed to show up besides a blank gray screen? Must I go to some other page for the sitemap generator?
Thanks SO much for your help!
Nancy

Sitemap for dynamic web page content

My web page is largely dynamic content, using PHP scripts to get restaurant and bar data from a MYQL database. I ran the sitemap generator and it did find all my forum pages, aboutus pages etc but I do not think my dynamic content is being indexed. My website is eatdrinkdeals .com Any suggestions? When I exclude the forum files I get only a handful or urls.

Failed when retrieving *some* relative references

When generating a sitemap for bigtubresort. ca, the tool fails on some relative references but not others. For instance on
bigtubresort. ca/motel.php
the tool will correctly index

bigtubresort. ca/images/one-bed-700×525.jpg
bigtubresort. ca/images/one-bed-700x525t.jpg

but not

bigtubresort. ca/images/lighthouse.jpg
bigtubresort. ca/images/big-tub-harbour-resort-tobermory-ontario.jpg
bigtubresort. ca/images/two-bed-700×525.jpg
bigtubresort. ca/images/two-bed-700x525t.jpg

I’m using the latest Sun Java as per java. com/en/download/help/testvm.xml on Gentoo Linux with Firefox 2.0.0.16.

Thanks for any help you can give me.

Modified files don’t seem to be picked up by sitemap generator

I’ve used the sitemap generator to identify some problems with pages on my site (twowheelsforever .com), updated those pages, and then re-run the generator.

For some reason, I can’t get the generator to recognize the changes to my site. It looks like it’s using cached files or something. Pretty much every page has been modified today, but the generator is showing modified dates of two weeks ago or older. Any suggestions? I tried a couple of different settings, but they didn’t seem to make any difference…

URL or Sitemap with queued

Is there any way to get it to spit out a URL listing or Sitemap that includes the queued as well as the crawled URLs? I have about 70,000 queued URLs that will each take about 6-10 secs to load, and I would prefer not to wait for them to be crawled.

If we can’t do it now, I nominate it for the next version.

Status: Failed – Error: Connection reset

I’ve tried updating the sitemap for http://www.yourpetnme.com and keep getting the following.

Status: Failed.
Error: Connection reset.

The website works and has been updated to the server, which is working properly.

When I checked the URL using the "Sitemap Generator", I get the following message:

Java.net.SocketException: Connection Reset

Can someone explain what is happening and what "Connection reset" means?

Thanks,
Alex

Totally Clueless – Edit Sitemap

I used Google Web Tools to find problems with my sitemap and I have a url that is listed incorrectly. How do I correct this. I thought I would make the change through notepad, but it did not work.

Still Crawling After 30 Minutes?

Jim,

You helped me with an earlier question where I created some links between my index and a review script install.

But now I have a problem I would really appreciate your help with.

When I run the sitemap crawl it is taking over 30 minutes and is doing almost 5000 searches and counting. It can’t be this big can it? I mean I only installed it last week and there are only about 20 reviews in there?

May I please ask for your help in having a look at this as I suspect I am including something that will piss google off if I submit this huge little thing.

www.ProbateRealEstateCourseReviews.com

BTW I am doing a tutorial on this review script for my club so I will put your sitemap generator (which we recommend) answers there so you don’t have to answer this again.

Thanks

Stephen

P.S. still going, 30 minutes, almost 5000?
If you need any access codes or anything let me know.

Sitemap Error: Unsupported file format

Hi,

I generated sitemap for my site from http://www.auditmypc.com/xml-sitemap.asp and saved it as sitemap XML.

When i submitted the file to Google sitemap, gives an error ‘Unsupported file format’.

What would be wrong here??? Is it possible to have more than one sitemap for single site???

Jussy

can’t get sitemap to see review script installed?

Hello
I have just created a xml sitemap for my site
ProbateRealEstateCourseReviews .com

but I couldn’t seem to get it to see anywhere in the read anything about a script put in at (same site) public_html/reviews

the script is from Tim at review-script .com

Is there anything you think I should know as to where to start looking to solve this?

Thanks
Stephen

can’t find a part of my website in sitemap?

hello
I installed a review script in public html sect of my site (it allows anyone to visit and leave reviews) but when I run the sitemap gen. it doesn’t get referenced anywhere?

my permission for the script are set to 755 and there are no redirects?

Any ideas as to why it wouldn’t show up?

Thanks
Stephen

301 redirects

30,000 of my 159,000 pages have 301 redirects (php based in the header). Will the tool respect those and leave those pages off of the sitemap, or do I need to add a robots text to those pages as well?

Pause Every Few URLs

I’m receiving frequent pauses making the generator unusable. It processes maybe 1-5 URLs before hanging precisely 1 minute then repeats the same behavior. The crawler shows a status of streaming with a rate of 0 B/s during the pauses.

The first 8,000 URLs did fine a few days ago. Any ideas?

Can’t generate a good Sitemap?

Hello,

The first time I ran the XML sitemap tool it was going great but I got to about 20,000 links and it came to a halt. I did more reading and since my site is WordPress I put in the recommended exclude files to reduce links. Now whether I use the exclude paths or not the tool finds about 200 links, but looking at the sitemap tab many other links fail. The problem is I don’t know why they fail? They all seem to be valid links. My site is www dot hotmommagossip dot com in case your interested. Seems strange that it found so many links the first time but now I can’t get the tool to work properly? Any thoughts?

Best,
B.

Large amount of Queued url’s

Hi
I have ran Sitemao generator and it run ok It finds around 25’000 url/file. Out of this around 18’000 are queued. Can I just retry the them. It has processed about 25/30% of my site. I’m guessing the I would need higher then that.
Thank you for your help in advance

Setting Memory Limit in Opera – how?

I’ve been trying for a while and failed, then googled for 2 hours and couldn’t find it.

adding -Xmx???m in Java CP does not change a thing – for opera (works just fine for IE and Firefox).

Why Opera? applet causes random FF crashes (with -Xmx750m it manages to crawl 1500-3000 urls and crashes, with memory usage on ~30% due to 250k queued urls).

I am completely stupid

I have been working on my site map for two days. I generated it, saved it and then copied the code to my webhost’s editor. (no ftp capability)

it looks beautiful but google says its an invalid format…

please help……..I am about to give up

Google Error – Unsupported file format

I generated my sitemap and saved it as an XML file. When I submitted the sitemap to Google, I got the following error message:

Unsupported file format
Your Sitemap does not appear to be in a supported format. Please ensure it meets our Sitemap guidelines and resubmit.

Thanks for you help,
Dianne

Newbie questions – Thankyou-Help Doc?/Question

Hello All,

Firstly this is a really great site (so who ever created it, it’s something to be proud of), I have just built my first site and the whole Sitemap topic can scare the living nunchaku’s out of you, but with a little patience and reading the stuff on here it’s been a real education, so that said (a heart felt) ‘Well Done’.

The questions:

Q: Is there a document/help file that explains what each part of the XML Sitemap Tool does, especially for someone who doesn’t know the references along the top? I’d like to understand more about things like the ‘In-l & Out-l’. Well all of it really.

Can I be cheeky and ask you to have a look at www.executivecoachingsolutions.co.uk on the xml tool.

Q: Why do Nr.1 and Nr.17 have the same title?

Q: Why is the level for Nr.1=0 and Nr.17=1? If they have the same title.

Q: On the site itself, pages such as ‘What we do, About Us, Modus Operadi, Life Long Learning, Terms and Conditions’ all sit as a menu off the Home page, so why are they all level 1?

Q: The ‘links page’ which sits as a link off the TandC page is a level 2 and so are JPGS, which appear on a Level 1 page?

The big question!

Q:Is there anything about the xml that would indicate the site can’t be seen by a search engine. I have a 404 (Not Found) dated April 1st for the index page, but have uploaded the xml sitemap since then?

Gosh, that’s alot. But I hope the answers (if there are any) helps others as well.

xml sitemap wont save to local system

Has anyone had this problem before?

Using Vista, I go to save xml generated site map to disk, it says it saves when I go to retrieve file, it’s not there?

Memory issues on large sites.

I’ve used auditmypc for sometime with great success. The last few months as my sites have grown I have had several issues. Running out of memory, program stopping with large amount of queued url’s. I have tried both IE and Mozilla. Same issues.

Java memory has been increased to -Xmx1028m on one of my pc’s. On other PC’s if I increase from -Xmx128m to -Xmx256m I get an error that says the java environment can’t be started.

I am using XP pro SP 2. This occurs on 5 different pc’s.

Any help to generate large (20,000 – 30000+ urls) would be appreciated.

Sumitting to MSN, Ask etc.

Hi all,

Hope some of your experts can offer a novice a little guidance.

The sitemap generator is great, and I have now created xml sitemaps for 3 of my 6 domains and submitted them via the advised links to both Google and Yahoo. Can anyone suggest how I submit to other search engines such as Ask and MSN? Is it merely a case of going to their submit a site page and entering the sitemap url in the hope that all of the other pages will then get crawled?

Should this also be the rule when submitting to any search engine; when asked for the home, main or index page, enter the sitemap url instead?

Thanks in advance of any suggestions.

John C
www.faux-fur-throws.com

Disable cookies

Is it possible to disable cookies in the current Sitemap Generator v1.2?

I would love to have this possibility, since I am running a shop page with Virtuemart. So I as everyone else suffers from the always appending "vmcchk" in the urls for first visits AND ALWAYS when cookies are disabled.

So when Googlebot indexes my pages, it does so by ALWAYS having "vmcchk" in the link adress, but

when I index my site with for example Sitemap Generator v1.2 (which is pheneomenal btw) it does so by having cookies "enabled" and therefore I don’t get the pages indexed with vmcchk, which ultimately ends with a duplicate page.

I hope anyone can help, thanks for reading.

404 & 406 Errors – Can’t Find to Correct

I’ve created a sitemap for my website and I have 4 errors showing. I know I can clear them from the sitemap, but I also want to clear them from my site and there lies the problem.

I’ve followed your instructions and watched the video a number of times.

On one of the 404 errors, when I click the + every page on the site is listed. The error as I see it is that URL ends as follows: .com /www. I’ve searched my code and checked my "File Manager" on the server. The only thing that I find is in the file manager I have a file named "Public_html" and a file named "www". The "www" is a duplicaton of "Public_html". My question is: Is it the www file that may be causing the problem and if so, can I delete it without fouling up my entire website?

2nd – The other 3 errors are on specific pages, but I cannot find them to make the corrections.

I could use whatever suggestions or input you may have.

Thanks,
Alex

Sitemap with just one URL

Hi,
I hope you can help me. I found your sitemap generator really great, much better than the others I’ve seen.

My problem is that when I generate de Sitemap, the only URL that appears is the main page. I mean, just de domain. All the files (PHP and HTML) are in the same directory that index.html, but nothing appears, just www.mydomain.cl (sorry I cant tell you the domain in public). The home page has the links in Flash animations. Is that the reason? does the sitemap generator "reads" the files in server or the links?

Lot of thanks from Chile!

Generator timing out?

I have been running sitemaps for a couple sites. One is around 10000 lines the other is close to 20000. My problem is that the sitemap tool will start very quickly until it reaches a certain number let’s say 3400? The it gets very bogged down and smonetimes stops all together. When it does this it also fails several lines. Not enough memory, and connect time are the big errors? I am running on windows 2000. I know you can increase java memory but I cannot find where I this edition.
Do you have any ideas what may be happening? Is it the site structure or just not enough memory to run the sitemap generator?
thanks.

Sorry I’m clueless!

Hi all, i have just used the sitemap generator and found it to be quite painless !
I was expecting a lot more confusion after reading the stuff on google and other websites regarding sitemaps.
Anyway, it all went very well until I submitted it to Google as I have been given an Unsupported file format error message

I thought it was too easy!

Any help would be appreciated.

Thanks,

Mark

sitemap generator broken

i’ve used the sitemap generator numerous times, successfully, for over 6 months, typically weekly.

for 2 days now when i try to use it – all i get is a red x – and nothing.

to be sure i updated java – and tried again – still nothing, just a red x.

What does a good sitemap look like?

I have created sitemaps for 2 of my sites using this webmaster tool. They are very different from each other. One is very simple and easy to understand. The other is very long and all the words look like jibberish. The urls are not understandable.
The question is do the search engines care which way it turns out? As long as it links through the site is it ok?
Also where can I add priority to pages?
Thanks.

Sitemap.txt

Jim,

Does your generator write sitemap.txt files? Someone was asking me why I didn’t have a sitemap.txt on my front page. (I forgot to add the sitemap.html link with my last template update). Does it matter .txt or .html for visual sitemaps?

P.S Someone noticed that out of 1 million pages on Google, my site came up #25. You’re part of that accomplishment – Thanks

Thanks,

Roger

Does not generate sublevel pages

I ran the site generator several times and with different parameter. The only pages that show up are those that are linked on the main page. There is one sub directory called catalog that has 14 links but only those directly shown on the main page show up. i.e. needled_evergreens.htm appears but not vines.htm.

site is Http://www.andersonfarms.net

Thanks,

John

Rookie Questions

Hi,

I am a rookie who has just started using your Sitemap Generator. I continue to appreciate it more as I learn more and I have spent some time looking at previous question/answers. Thank you very much for this tool.

I have some very basic questions:

1) I have lots of duplicate (+) pages showing up and I am not sure if this is normal or if something is wrong. Many of them are shown as different levels, for example:

www.giveashare.com/2PartDelivery.shtml
www.giveashare.com/../2PartDelivery.shtml
www.giveashare.com/../../2PartDelivery.shtml

I do not know if it has anything to do with it, but I use an Include file for my header which contains many hyperlinks with “../” .

2) I have read many of the posts and still am unclear about whether I should include images in my sitemap. I have a lot of images on my website and when I included them in the SiteGenerator, I got some funny looking entries that repeated the image directory in the url, for example: www.giveashare.com/images/images/images/images/images/starbucksstock.jpg It would show 6 or 7 entries for the same image with the only difference being one more repetition of “image/”

Any ideas?

Capitalization Causing Duplicates

I meant to include this in my last post but apparently some of my duplicates are a result of inconsistently using capitalization in my hyperlinks. For example, I have entries in my SiteMap showing:

StockListAlpha.asp
stocklistalpha.asp

There is truly only one file. For sitemap purposes, I assume that I just clear one but does this inconsistency impact me negatively in terms of pagerank – in which case I really want to go fix it at the source??

Thanks

Not getting past the first page

Hi,

First, I want to say a big big thank you to Jim for helping me get my registration to this forum completed, without his personal assistance, I would not have access to this great resource, Thanks Jim.

The sitemap generator only finds the index page of my website, I have almost 50 subfolders with a lot of content, I am wondering if this is an .htaccess file problem or something else in my host root directory causing this.

Thanks for any suggestions
Bob T.

strange characters in html sitemap

Hello,

I write my webpages in utf-8. When I export a sitemap as html, it has strange characters in it, in those strings where I use č for example. Is there a setting to enable utf-8 in the webtools I forgot?

Jos

Level 4 files excluded from sitemap XML export

I love this sitemap generator tool and have used it successfully many times.

Today my situation is that when I export my sitemap using the Sitemap XML option, entries that are in my level 4 do not get exported. These same files are included when I export URL raw list as a text file.

All of these level 4 files are PDF files, where I have an extensive PDF library on this site.

Thanks for any assistance!

JAVA memory error

It seems as if several individuals are having memory issues. My sitemap generator used to work fine. Now on my PC’s I get a "low memory error" when generating a large sitemap.

I can increase the java memory from the java control panal up to -Xmx185m but still get the error. If I go any higher than that I get a java environment error when I start the sitemap generator. What can I do. This happens on all 3 of my home pc’s.

Forums

Sorry if this is a really "dumb" question….but here goes.

Is it worth submitting a sitemap for a forum that I have created

Cheers

Sitemap Newbie

Hi

I have created a sitemap as per the instructions and submitted it to Google but get a 404 Error. After reading some posts on here, think I might have a few more problems though such as (www.website/categories.php?cat=1.) I am right in saying that question marks are not good?

Also it is quite a large site with lots of images would it be a good idea to include these or not

I am a complete newbie to these sort of things and getting bogged down by it so if someone could have a look I would really appreciate it

Thanks
Adrian

General HTTP error: 404 not found

Hi, all

I have followed all the instructions but when I submitted to Google I get the above message ?

Sure I have missed something simple, any help appreciated

Cheers
Adrian

Problem with latest version of Java

I have used the Audit My PC Sitemap Generator for almost a year without any problems. I had been running version 5 of Java and a few months ago, I tried installing Version 6. As soon as installed it, the Sitemap Generator started having problems mapping my web site. It would find all the HTML files, but would report failures on all the graphics files such as GIF files. I still had Version 5 installed, so I simply uninstalled Version 6 and the problem went away.

Recently, I started receiving notices that I should install version 6 and I thought maybe the problem was due to the fact that I had two versions installed. As a result, I uninstalled Version 5 before I installed Versions 6. Now the old problem is back and I can’t figure out what is causing it. Since I uninstalled version 5, I’m stuck trying to make version 6 work.

The site has about 1000 pages and maybe 3000 graphics. I have tried increasing the memory size, with no effect. I have also tried changing other settings and the problem still occurs. I’m pretty sure my site is not causing the problem, because it worked fine under Java 5 and the sit hasn’t changed since.

Looking at the problem in more detail, I can see that the failures are not being caused by a timeout, which is where I usually see errors. When the program tries to load a graphics file, it fails instantaneously. I don’t see any entries in the "Problem List" like I normally do when there is a timeout error.

The system page shows the following information:

Memory*usage
Free*memory 66.0*M
Total*memory 117*M
Max*memory 127*M
System*properties
package.restrict.access.org.mozilla.jss true
java.version.applet true
http.auth.serializeRequests true
os.version.applet

Does anyone have any ideas?
Thanks in advance for your help.
Larry

Links with Russian Symbols (windows-1251) Problem

It doesn’t work right if some links use the Russian symbols in windows-1251 encoding. It detects the page encoding table as utf-8 even if the content="text/html; charset=windows-1251" is set. Or am I doing something wrong?

Update problem

I´m now using your professional Sitemap Generator.

The problems are that the generator won´t update the pages date when
I´ve updated the pages. I must run the process from beginning and insert
priority and update frequency again to get the latest page up-dates in the
sitemap.

What can be the problem?

Best Regards
Eric Andersson
Sweden

Filtered Items Included when Creating Sitemap

Happy New Year to All!

I used the filter recently to limit the number of items displayed in my sitemap. However, when I saved the sitemap the filtered items were included in the sitemap.xml file.

My site has exceeded the 50,000 URL sitemap limit, so I need to implement the sitemap index method and to reduce the size of each individual map.

Since my web site is database driven, I can’t use directory structure to limit the sitemap contents. Filtering would seem to be a great option to seperate sitemap entries into smaller file sizes.

Using the INCLUDE and EXCLUDE filters on the settings tab doesn’t accomplish my task. When I have these filters applied, the program limits the URLs that are followed and recorded. I need the program to follow and record ALL URL’s, because many are not reachable unless other links are followed.

I would like to allow the sitemap program to run Filter free, record ALL URLs, and then allow me to filter which ones are output into the sitemap when saved to disk.

Is there a way to save the sitemap with the filters still in effect? Is this planned for future revision maybe?

JAVA Fatal Error

Help please -

Spent the last week reading forums on errors Java Runtime Environment cannot be loaded and several Java Virtual Machines running in same process. Tried lots and lots of suggestions without results.

Errors occur with -Xmx512m parameter everytime and most of the time with -Xmx256 parameter. 256MB is not enough for a site of an estimetad 180,000 URLs.

I use IE7 and Windows XP SP2 with 3GB RAM on a Dell computer.

Any thoughts on this problem would be most welcomed.

Thanks,

Terry