The 20 Minute E-mail Solution!
TOC PREV NEXT INDEX

Merging\Cleaning Phrase Lists and URL Domain Black Lists (cleanlist.exe)


cleanlist.exe is used to merge anti-spam phrase lists and URL domain black lists, as well as remove any duplicate entries from these files.

Basic Command Syntax

cleanlist [/p] [/q] file1 file2 file3 ...

where file1 is the path and name of the list (either phrase or URL domain black list) that you are modifying. This file is overwritten with the new file. File2 and file3 are the path and name of additional lists that will be merged into file1.
Command Option Function
/p (Phrase Mode) Cleans the phrase list, If this parameter is omitted, it cleans the URL domain black list.
/q (Query Mode) How many labels a list of domains should contain.

Merging Phrase lists

To merge two phrase lists, enter the following command:

cleanlist.exe /p file1 file2

where file1 is the path and filename of the domain's existing phrase-list.txt file, and file2 is the path and filename of the updated phrase-list.txt file. This merges the contents of the files into file1 and removes duplicate entries.

Merging URL Domain Black Lists

To combine two URL domain black lists, enter the following command:

cleanlist.exe file1 file2

where file1 is the path and filename to the domain's url-domain-bl.txt file, and file2 is the path and filename of the updated url-domain-bl.txt file. This command merges the contents of the two files into file1 and removes duplicate entries.

Deleting Duplicate Entries in the URL Domain Black List

When cleanlist is run on a url domain black list, it eliminates variants of a domain. If a list of variants for a domain has an entry with two lables, that entry is retained and all others removed.

Example

If the following domain entries are found in the URL domain black list:

a.b.c.d.e

a.b.d.e

d.e

b.d.e

all but the d.e entry would be removed since the other s are redundant. Lists of variants are automatically reduced to the entries with the least number of labels.

To remove all redundant entries, enter the following command:

cleanlist.exe C:IMail\url-domain-bl.txt

Example

When cleanlist.exe is run with the /q option, all lists of domains with more than two variants and more than 2 labels are displayed and you are prompted for the number of lables that will be allowed. All redundant entries are eliminated based on the inputed lable length.

To define how many labels are allowed in a domain name, enter the following command:

cleanlist.exe /q C:IMail\url-domain-bl.txt

Deleting Duplicate Entries from the Phrase List

When cleanlist.exe is run with the /p option, it removes duplicate entries from the phrase list file and the revised files sorted in alphabetical order.

To clean the phrase list, enter the following command:

cleanlist.exe /p C:IMail\phrase-list.txt



Ipswitch, Inc.
http://www.ipswitch.com
TOC PREV NEXT INDEX
©Ipswitch 2004