I have a website with a large publications section, where resources link to PDF documents that have been uploaded to the server, and it would be nice to find a MODX way to generate a simple XML sitemap that includes those PDF documents plus the resources that correspond to web pages, excluding the usual suspects like the 404 page, etc. I have found a couple of MODX packages (Sterc’s SEOTools and pdoSitemap, but cannot see if they actually crawl the site so that the PDF documents are included, or if they just list the resources.
Is there a script that crawls (assuming crawling is necessary to fetch the PDFs)?
Does it also update itself automatically, or does it need to run manually by viewing the resource that calls the script?
At the moment, I am using Tristan Goossens PHP script, which seems to work, although only manually, but it feels like a terrible kind of disloyalty: