Batch Processes


Strip HTML Tags

This batch process will strip the HTML tags out of your pages, leaving you with plain text, and save the resulting text to files in the folder you specify. The process will not overwrite the existing files, unless you select the folder the current files are in, as the destination folder (not recommended!).

To use this batch process, add all the files you wish to process using the button marked . You may remove unwanted files by selecting them and clicking the button marked , and you can clear the entire list by clicking the button marked . You may also drag-and-drop files and folders into the file list. Hold down the control button to add all the files in subfolders as well. The file list will list the file name, the source folder and the subfolder into which the processed file will be saved.

Next, select a target folder. This is the root folder, where the processed files will be saved. If a file is destined for a subfolder, it will be created under this root folder. This is done so that files with the same name in different folders will not be lost.

Next, select your options:

Introduction

This simple introduction tells you what the process does.

Tag List

You may select to include or exclude a list of tags. By default, the process will exclude an empty list of tags in other words all tags will be stripped. To add a new tag to the list, type the name of the tag in the available box, and click Add Tag. It is not necessary to include the tag brackets (< and >); they will be removed automatically if you do. To remove a tag from the list, select it and click Remove. Press Clear All to clear the entire list.

Options

These options tell the process what to do in special situations:

Strip all HTML comments: You may select to strip or leave all html comments. This includes the DOCTYPE definition.
Strip special tags: Special tags include the tags used by programming languages like ASP and PHP to enclose their code. This option will significantly slow the process, so if your pages do not contain server-side code like PHP or ASP, you should not select this option.
Strip the complete HEAD tag: If this option is selected, the entire contents of the HEAD tag will be stripped; if not, only the HEAD tag itself will be removed.
Strip complete SCRIPT tags: If this option is selected, the entire contents of the SCRIPT tags will be stripped; if not only the SCRIPT tag itself will be removed. Remember that most scripts are enclosed by HTML comments, so if the Strip all HTML comments option is selected, the script will be removed anyway.
Strip complete STYLE tags: If this option is selected, the entire contents of the STYLE tags will be stripped; if not only the STYLE tag itself will be removed. Remember that most style blocks are enclosed by HTML comments, so if the Strip all HTML comments option is selected, the style will be removed anyway.
Remove duplicate newline characters: This will remove multiple newline characters, so that you do not end up with large spaces in your text where HTML tags had previously been.
Overwrite existing files: This will overwrite any files in the target folder. It will not overwrite the source files, unless the source folder and the target folder are the same.

Once all your options have been selected, click Start to run the batch process.