This batch process will strip the HTML tags out of your pages,
leaving you with plain text, and save the resulting text to files in
the folder you specify. The process will not overwrite the existing
files, unless you select the folder the current files are in, as the
destination folder (not recommended!).
To use this batch process, add all the files you wish to process
using the button marked
.
You may remove unwanted files by selecting them and clicking the
button marked
,
and you can clear the entire list by clicking the button marked
.
You may also drag-and-drop files and folders into the file list.
Hold down the control button to add all the files in subfolders as
well. The file list will list the file name, the source folder and
the subfolder into which the processed file will be saved.
Next, select a target folder. This is the root folder, where the
processed files will be saved. If a file is destined for a
subfolder, it will be created under this root folder. This is done
so that files with the same name in different folders will not be
lost.
Next, select your options:
Introduction
This simple introduction tells you what the process does.
Tag List
You may select to include or exclude a list of tags. By default,
the process will exclude an empty list of tags in other words all
tags will be stripped. To add a new tag to the list, type the name
of the tag in the available box, and click Add Tag. It is not
necessary to include the tag brackets (< and >); they will be
removed automatically if you do. To remove a tag from the list,
select it and click Remove. Press Clear All to clear the entire
list.
Options
These options tell the process what to do in special situations:
Strip all HTML comments: You may select to strip or leave
all html comments. This includes the DOCTYPE definition.
Strip special tags: Special tags include the tags used by
programming languages like ASP and PHP to enclose their code. This
option will significantly slow the process, so if your pages do not
contain server-side code like PHP or ASP, you should not select this
option.
Strip the complete HEAD tag: If this option is selected, the
entire contents of the HEAD tag will be stripped; if not, only the
HEAD tag itself will be removed.
Strip complete SCRIPT tags: If this option is selected, the
entire contents of the SCRIPT tags will be stripped; if not only the
SCRIPT tag itself will be removed. Remember that most scripts are
enclosed by HTML comments, so if the Strip all HTML comments option
is selected, the script will be removed anyway.
Strip complete STYLE tags: If this option is selected, the
entire contents of the STYLE tags will be stripped; if not only the
STYLE tag itself will be removed. Remember that most style blocks
are enclosed by HTML comments, so if the Strip all HTML comments
option is selected, the style will be removed anyway.
Remove duplicate newline characters: This will remove
multiple newline characters, so that you do not end up with large
spaces in your text where HTML tags had previously been.
Overwrite existing files: This will overwrite any files in
the target folder. It will not overwrite the source files, unless
the source folder and the target folder are the same.
Once all your options have been selected, click Start to run the
batch process.
|
|