Skip to content
Tony Li Xu edited this page Aug 28, 2014 · 1 revision

Welcome to the text-processing wiki!

This Python script finds and removes unwanted lines from all ".html" files in a given directory.

Target directory is defined in "dir_path" in process_text.py

Patterns are defined in "patterns" array in process_text_.py

File type is defined in "file_ext" in process_text_.py

Clone this wiki locally