WWW tools
WWW tools is a package of tools
for fetching and manipulating web pages.
The programs in this package illustrate how to fetch a URL using the
Erlang socket interface, we show how to tokenise and analysis HTML and
also provide a simple macro processor for HTML.
- disk_cache.erl provides disk cache of URLs.
- html_tokenise.erl tokenises an html file.
- html_analyse.erl analyses an html file.
- html_expand.erl Is a macro processor which
adds a simple macro facility to expand macros in an HTML file.
- url.erl has routines to get a URL from a disk cache or the
network.
- url_parse.erl is a simple minded parser for URLs
- url_copy.erl Makes a deep copy of a URL.
We first the remote URL to the local file system. We then recursively copies
all images in the original to the local file system. Image names in the
original are renamed in a consistent manner.