XML-TMX-to-Text-(.txt)-file-converter-
This python script converts XML/TMX file into Text file (.txt) and uses beautifulSoup to extract only the text from the xml/tmx file and removes all html/xml/tmx tags.
1.) Just run the command - python3 extract_corpus_xml.py
Log in or sign up for Devpost to join the conversation.