Hypertext Markdown Transformer (HTMT)

Transforms HTML websites to a simple Markdown format. Note that the format is somewhat restricted. Parts of the HTML are completely ignored (e.g. header and footer tags). HTMT currently supports the following:

Paragraphs
Headings
Italic and boldface
Lists
Linebreaks and horizontal rules
Spans

Further, the following is supported in reduced form:

Tables (every row is a header row)
Links (some problems if a link text contains further tags)
Images (Only references to the actual image are possible)

Usage

You can install HTMT locally (it is not yet available at pypi). To do so, simply clone this repository to a path of your choice (say /home/you/hypertext-markdown-transformer). Then run pip install /home/you/hypertext-markdown-transformer/ (preferably in a virtual enviroment). Then you can use HTMT as follows:

import htmt

with open("testfile.html", "r") as f:
    content = f.read()

parser = htmt.HTMT_Parser()
print(parser.markdownify(content))

System Requirements

HTMT uses the feature of case matching introduced in Python 3.10. Thus, it will not run with a Python version before 3.10.

Contribute

Feel free to report bugs at Github. Pull requests are always welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
htmt		htmt
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.py		example.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hypertext Markdown Transformer (HTMT)

Usage

System Requirements

Contribute

About

Releases 2

Packages

Languages

License

claussmann/hypertext-markdown-transformer

Folders and files

Latest commit

History

Repository files navigation

Hypertext Markdown Transformer (HTMT)

Usage

System Requirements

Contribute

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages