HTML Parser

This is a python library developed for my own purposes, that will return a list of word from any Text or HTML document.
It can parse Text and HTML, and contains utility to analyze webpages.
What makes it special it the fact that in the parsed text, you can give « weight » to any section, and these weights will be preserved in the parsed result.
The HTML parser then uses that feature to automatically give weight to words in the titles, words in bold…