View Single Post
  #4 (permalink)  
Old 03-25-2007,
csac csac is offline
Junior Member
 
Join Date: Mar 2007
Posts: 6
Default

I'm thinking that if a checkbox was checked then anything after a ? in a url would be ignored. So all the ones listed above would then be the same and listed as one page.

I don't think a ? should appear in a url unless it's before parameters such as sort keys, etc. Although I'm not 100% certain.

Right now I'm excluding the urls with the ? in them and counting on at least one version of the page showing up without the ?. I think it would be more reliably thorough if these pages were not excluded, but condensed into a single page when all before the ? is the same. This way it shouldn't miss a page that only shows up with various parameters, and for some reason doesn't show up at all without a ?.

Hopefully that makes sense.

I've seen similar behavior (listing multiple variations) in an old version of linkscan I use (great for checking links, doesnt do sitemaps and isn't free so my version is very old now).

The urls above are the result of php code on the page to select and sort info from a database listing. The other exclusion rules come from scripts for reciprocal links and for something else, these are larger more complicated scripts I installed but didn't write.

Jim
Reply With Quote