Hi,
The sitemap generator has been running for more than 1700 minutes and it has scanned 150,000 + pages on my client's site already. I've no clue how long it'll take to finish crawling the site...
I found the need to prevent the generator from indexing some of the dynamic pages. So here comes my questions:
1. How can I use the "Do not parse URLs" configuration setting to exclude dynamic page with certain query string parameters from being scanned?
e.g. I've a dynamic page called forum_post.asp and the URL to this page contains 4 different query string paramenters and content is generated dynamically based on the query string values, ie: TID = thread id, PN = page number, GET = can't recall what it's for, and TPN = can't recall what it's for (eg: forum_post.asp?TID=1&PN=4&GET=last&TPN=5). How do I tell the generator to ignore the GET and TPN query string to reduce the number of scanned pages? There's not enough information/sample that discuss this "Do not parse URLs" as well as the "Exclude URLs:" feature... It'll be nice if you could provide a few examples.
2. How do I tell the generator to ignore certain query string sitewise? Like I want the generator to ignore the 'SortCol' query string among all the dynamic pages in my website so I don't have to specific the exculsion rule for each page?
The generator is very powerful but it gives me headache as I can't figure out how to ignore URL that do not contain unique links to other pages... thanks for the help!
eddie