We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Or do you ever extract the sentences before? I only find xml file there. And thank you for make such a great project.
The text was updated successfully, but these errors were encountered:
Depends on what do you mean by "extract raw text". Extract from what?
Sorry, something went wrong.
I mean how to extract the thai contents from the xml files which is downloaded from http://web-corpora.net/ThaiCorpus/search/
If you mean the text collection, these xml files were converted into a specific corpus format that allows indexing and searching. Converter is here.
No branches or pull requests
Or do you ever extract the sentences before? I only find xml file there.
And thank you for make such a great project.
The text was updated successfully, but these errors were encountered: