Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tika and Taro provide path on roadmap #5

Open
chrismattmann opened this issue Sep 7, 2017 · 4 comments
Open

Tika and Taro provide path on roadmap #5

chrismattmann opened this issue Sep 7, 2017 · 4 comments

Comments

@chrismattmann
Copy link

FWIW Tika and Taro.jl provide a path e.g., for NLTK integration and advanced text processing FYI here http://wiki.apache.org/tika/

@aviks
Copy link
Member

aviks commented Sep 7, 2017

Hi Chris, good to see you around here, Tika is a great project. As you know, Taro.jl wraps Tika, and therefore provides an API for getting metadata and content from document. Did you have any suggestions for further integration?

Also, I did not quite get what you meant by NLTK integration, could you clarify, please? How do you see that happening?

@chrismattmann
Copy link
Author

Thanks @aviks I was just saying we wrapped NER capabilities already for NLTK, OpenNLP, NLTK and MITIE so you may just want to use those check the wiki page and look for NER

@aviks
Copy link
Member

aviks commented Sep 7, 2017

Ah, Ok, did not know that. Thanks for the pointer, will check it out.

@chrismattmann
Copy link
Author

Anytime great work excited to follow your projects!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants