url.list man page

url.list — websec url monitoring configuration

Description

The URL list consists of one or more sections separated by newlines. You can have sections without a URL, they will update the defaults for all the subsequenet blocks. The Name and Prefix parameters are required as well as one of Email, EmailLink and Program. The rest are completely optional.

The following parameters (case-sensitive) are recognized in each section:

URL        - URL of web page to monitor
Name       - Name of web site. Pages delivered to you will have the
             following format: "Name - Date (Day)" eg. "PC Magazine - 4
             Sep 98 (Fri)"
Prefix     - Prefix of filenames for archive files of web pages created
             by Web Secretary.
Email      - Comma-delimited list of email addresses to send highlighted
             pages to.
EmailLink  - Comma-delimited list of email addresses to send URL of
             changed pages to.
Program    - application to call with the diff-file, special cases:
             "mozilla", pages are opened in new tabs,
             "konqueror", pages are opened using "kfmclient openURL"
Auth       - Authentication information in "username:password" format. 
             Put "none" if no authentication needed.
Diff       - Put "none" if you want Web Secretary to always mail this
             page to you instead of checking for and highlighting
             changes in the page.  Put "webdiff" if you want Web
             Secretary to check for changes.
             Put "htmldiff" to use a different implementation of HTML
             diff output. Note that you need to install the perl-modules
             Algorithm::Diff and HTML::Diff which are availabe on 
             http://www.cpan.org/ for this to work.
Hicolor   -  Color used to highlight new or changed content. Currently,
             four colors are defined. They are: blue, pink, yellow and
             grey. You can also supply your own HTML color tag in the
             form "#rrggbb".
Ignore     - Comma-delimited List of section names containing ignore
             keywords. There must be NO SPACES between delimiters and
             section names. The ignore sections and keywords are stored
             in a file called "ignore.list".
IgnoreURL  - Comma-delimited List of section names containing ignore
             URLs. There must be NO SPACES between delimiters and
             section names. The ignore sections and keywords are stored
             in a file called "ignore.list".
AsciiMarker - If set to 1 it will add ascii markers around the changes so
                 that highlighting is noticeable in text mode too. Useful for
                             text MUAs users.
Tmin       - Every token containing <= Tmin words will not be highlighted
             for differences.
Tmax       - Every token containing >= Tmax words will not be checked for
             ignore keywords.
Proxy      - Specify proxy "http://your.proxy.here:portnum" if you are
             using one. (Alternatively, you can make use of the
             "http_proxy" environment variable)
ProxyAuth  - Specify proxy authentication in "username:password" format.
             The code for this feature was contributed by Volker Stampa.
MailFrom   - The E-Mail address to send mail from, this can be left empty
             and the user used to run websec will be used.
ProgramDigest - If specified "true", websec does not open all changed pages 
             separately with the application specified in "Program", but opens
             a summary page that contains links to all changed pages. 
Digest     - true|false or yes|no. This works only if EmailLink is
             specified. It consolidates all the changed URLs and sends
             them in one email.
UserAgent  - The User-Agent that will be sent by the web client. This can
             be used to bypass servers that prevent access based on the user
             agent.
DateFMT    - Date format to use in e-mail messages, can be empty for no date.
             Set it to " - %Y-%m-%d" for ISO dates. This is perl format for dates.
RandomWait - Websec waits for a random number of seconds between retries up
             to the value specified by the RandomWait keyword. This is to
             prevent websec from being blocked by websites that perform log
             analysis to find time similarities between requests.

Any line which begins with a '#' is treated as comment and ignored.

If a section does not contain a URL entry, the values provided will be treated as the default for the following sections.

For example,

# Defaults
Auth = none
Diff = webdiff
Hicolor = blue
Ignore = General,Date_Time
IgnoreURL = Adverts
Tmin = 1
Tmax = 10
Proxy = http://proxy.nus.edu.sg:8080
Email = vchew@post1.com
# Web page to monitor which does not require authentication
URL = http://browserwatch.iworld.com/news.html 
Name = Browser Watch
Prefix = browsewatch
# New defaults with authentication information
Auth = user:password
# More web pages to monitor which requires authentication
URL = http://www.infoworld.com
Name = Infoworld
Prefix = infoworld
URL = http://developer.javasoft.com/
Name = Java Developer Central
Prefix = jdc

See Also

ignore.list(5)

Author

Baruch Even <websec@ev-en.org> is maintaining this program.

Referenced By

ignore.list(5), websec(1).

2006-01-20 perl v5.22.1 User Contributed Perl Documentation