Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collect images of most frequently appeared 100 character classes #2

Open
3 tasks
yustoris opened this issue May 4, 2018 · 4 comments
Open
3 tasks

Comments

@yustoris
Copy link
Contributor

yustoris commented May 4, 2018

Overview

  • Currently, we have collected images for 50 of target 100 character classes, so go ahead to obtain rest 50 character classes.
  • Goal is to collect 200 images per class

Progress

  • Find workers
  • Attempt to ask the workers to collect images
    • On going
  • Scale out collection process
    • Try out cloud sourcing services like Amazon Mechanical Tank?
    • Attempt to active learning method?
@grosniko
Copy link

Hey, can I help with the manual work of labelling?

@yustoris
Copy link
Contributor Author

@grosniko
Sure, we'll welcome to work on gathering and labelling cuneiform character regions on the documents

@grosniko
Copy link

grosniko commented May 31, 2019 via email

@yustoris
Copy link
Contributor Author

@grosniko
Sorry to late reply.

do you have some sort of on-boarding kit?

Currently, there is no good toolkit unfortunately 😞. However, I'm wondering if we can use OSS annotation tools such as https://github.com/tzutalin/labelImg . Do you have any idea or better tools/platforms ?

I've been dreaming of an OCR smartphone app with which you could scan directly over tablets displayed in museums (such as the Louvre or in London) and get a realtime translation

Wonderful idea ! Your idea fascinates me so well 😍.

Anyway, I would like to tell you more detailed request after I decide ways how to annotate characters on the hand copy images. If you have any suggestions, fell free to let me know.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants