This is an HTML Tidy diagnostic script,
analysing initially a single URL.
Why Use It
For Microsoft users who do not have local Unix tools: to reformat,
check, & line number insert to debug their HTML.
Presume some search engines will rate you higher
if you have clean HTML
Some commercial HTML generator
programs fail to generate clean HTML, some use browser dependent
proprietary extensions, & humans fail too with
hand written HTML
If you think it slow
It is a script: It fetches your URL
from the internet (including whatever delays that URL might
impose with scripts or link redirections etc), then
analyses the data & writes analysis in 6 files
on a Berklix
(for you to later click on),
then builds this page, & sends it to your browser.
Julian H. Stacey,
Port wrapper in
SVN (subversion repository)
on Operating System:
You may use script under BSD licence, Please retain this credits line.
it could take a parameter to sample other URLs,
but there are issues to be considered first
(security design comments welcome to
Other tools at www.berklix.net
It would need to use different names for temp files,
so different pages do not collide if multiple simultaneous users.
It would need a sleep &/or a parallel crontab &
find -older to time delete them.
It would need something so it can not be abused to copy content I
do not want to my site, eg porn, terrorism, or MS adverts etc
Maybe restructure it so output goes straight to a pipe thus 1 per page,
not 6 per page ?
But even then it would need more protection, as the cat -n
output could be abused as an anonymising proxy, (& more so, if
someone elsewhere added a recipient line number stripper).
While I am in favour of some anonymising proxies (to allow
some citizens of some countries to evade repressive regimes), There is
also immoral/criminal usage of proxies possible, Complex possibilities
I do not have time for & prefer to leave to specialised
Tor operators; + my servers
do not need the load.