I would have posted this under the bug reports, but that forum refuses to allow me to post. In fact, the Unlimited Sitemap Generator forum refuses to allow me to report, and I am referring to the paid version, not the free version.
The Sitemap Generator reads content in <script> tags, and it should not. This is causing all sorts of bogus entries to get read in, and then my "Broken Links" page is loaded with a bajillion errors about pages that should never have been treated as URLs.
For example, if the page, [ External links are visible to forum administrators only ] has:
<script type="text/javascript">
var somelink = 'a.html';
function DoSomething() {
document.getElementById.innerHTML = '<a href="' + somelink + '">text<\/a>';
}
</script>
Then a ton of links to this appear:
[ External links are visible to forum administrators only ]" + somelink + "
This creates a TON of superfluous extra page hits that never should have happened in the first place.
Secondly, when I view the Broken Links tab, these URLs are not properly scrubbed, so the HTML of that link above comes out to:
<a href="[ External links are visible to forum administrators only ]"%20+%20somelink%20+%20""> ... </a>
Which is broken HTML and doesn't link to the fake link in question
Steps to fix:
1) The Sitemap Generator needs to ignore content inside of <script> tags
2) Scrub your content (with htmlentities($url); if nothing else) before you put it into the <a href="">