XML-TMX-to-Text-(.txt)-file-converter-

This python script converts XML/TMX file into Text file (.txt) and uses beautifulSoup to extract only the text from the xml/tmx file and removes all html/xml/tmx tags.

1.) Just run the command - python3 extract_corpus_xml.py

Built With

Share this project:

Updates