
# This is the rules file for CW3. The < is interpreted as delimiter 
# for texts since it can't appear in text paragraphs of HTML files.
# Any rule starting with # and empty lines are ignored.

# Every rule spans one line (!) and has the following format:

# ntext1<text2<text3.....<

# n is the rule number (see below), after that follow the rule 
# parameters, delimite by < signs. Additional parameters are ignored,
# place a last < sign to close the rule set since any spaces are stripped
# from the line end!
# If a rule has to few parameters or invalid format, it will be ignored.
# case does normaly not matter, only for case sensitive text search/replace

# n can be: 

# 0 - replace TAG 
# param 1 = TAG to replace without the < and the >
# param 2 = TAG to insert instead, can be ommited for removal of TAGs
# Examples: 
# 0<HTML<  	             will remove any <HTML> in the input file
# 0<STRONG<B<		     will replace any <STRONG> with <B>
# 0</STRONG</B<              will replace any </STRONG> with </B>

# 1 - replace TAG parameter
# param 1 = TAG for which this rule is valid
# param 2 = param to replace
# param 3 = param to insert instead, can be ommited for removal of params
# Examples: 
# 1<FONT<COLOR<	             will remove "COLOR=xxx" of FONT tags

# 2 - replace TAG parameter value
# param 1 = TAG for which this rule is valid
# param 2 = param for which this rule is valid
# param 2 = value to replace without the "
# param 3 = value to insert instead, can be ommited for removal of values
# Examples: 
# 2<FONT<COLOR<RED           will remove "COLOR=RED" of FONT tags
# 2<FONT<COLOR<RED<BLUE      will replace "COLOR=RED" w/ COLOR="Blue" of FONT
# Comments: values are matched case insensitive

# 3 replace TEXT
# param 1 = search flags bits, added together, 
#           1 for case sensitive, 
#           2 for words only
#           0 = case insensitive, all text, 3 = sensitive words only
# param 2 = TEXT to replace
# param 3 = TEXT to insert instead, can be ommited for removal of TEXTs
# Examples: 
# 3<1<Foo<  	             will remove any Foo, foo, FOO, etc from the text
# 3<0<foo<bar<	 	     will replace "foo" with "bar"
# Comments: foobar contains foo, so "foobar" will become "barbar" with the 
# second rule so use either "3<0<foo <bar<" or "3<2<foo<bar<" to avoid this.

# 4 replace TEXT with link
# param 1 = search flags bits, added together, 
#           1 for case sensitive, 
#           2 for words only
#           0 = case insensitive, all text, 3 = sensitive words only
# param 2 = TEXT to replace
# param 3 = URL to insert
# Examples: 
# 4<1<Foo<http://www.foo.com<    replace Foo by 
#                                     <A HREF="http://www.foo.com/">Foo</A>


# The rules follow now:

# text replace
3<0<&amp;sect;<&sect;<
3<0<&atilde;<&copy;<
3<0<&eth;<-<
3<0<<&auml;<
3<0<<&Auml;<
3<0<<&uuml;<
3<0<<&Uuml;<
3<0<<&ouml;<
3<0<<&Ouml;<
3<0<<&szlig;<
3<0<<&copy;<
3<0<<&auml;<
3<0<<&uuml;<
3<0<<&ouml;<
3<0<<&szlig;<
3<0<<&Uuml;<
3<0<<'<
3<0<<'<
3<0<<-<
3<0<<<
3<1< E-mail<email:<
3<1< E mail<email:<
3<1< Email<email:<
3<1<email::<e-mail:<
3<1<email:<e-mail: <

# text by link
4<0<Bundesanzeiger-Verlag<http://www.bundesanzeiger.de

# tag replace
0<BR<
0<META<
0<EM<I<
0</EM</I<
0<STRONG<B<
0</STRONG</B<
0<U<I<
0</U</I<
0<ADDRESS<I<
0</ADDRESS</I<
