This is an update to my Sparks Update. I’ve revamped the code so that’s it much cleaner. I feel better that I’m now capturing most of the referenced URLs. I know I’m still capturing unnecessary items, but those can be filtered later or dealt with in subsequent tweaks.
I’ve received a few status requests and I understand the concern. As I originally stated, while this is something I want to do, it’s very low priority for me. Any errors or issues might be found immediately or might not be found for a few weeks. I’m making progress but it’s very slow. Also:
Some authors post less than others, so gathering a decent list of domains will take some time. The script runs nightly. It takes as long as it’s going to take.
This is a very simple Perl script. If you want a copy of what I have so that you can start working on it, email me and I’ll be glad to send it to you. Otherwise, time is our friend.