Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Many images have just one bounding box for a class that in reality has multiple instances on that image #65

Open
lamerman opened this issue Feb 16, 2018 · 10 comments

Comments

@lamerman
Copy link

For example this image:

download

It has multiple chairs, but only one of them has bounding box.

Is it how it was classified or was it done intentionally?

@rkrasin
Copy link
Contributor

rkrasin commented Feb 16, 2018

@samihaija Sami, can you please comment on this?

@samihaija
Copy link

Ivan, thank you for the mention!

The above is intentional, assuming the image came from the training partition. We only have one box per image per entity, for the training set. However, for the validation set, we tried to have all instances boxed.

@lamerman
Copy link
Author

lamerman commented Feb 17, 2018

@samihaija @rkrasin thank you.

@samihaija I'm not a big expert in object detection algorithms, but my first guess would be that a neural network that is learning on this data will be penalized while training. It predicts multiple chairs, but the training data has only one and my guess would be that when it predicts YES and the training data says it NO, when in reality it's YES, the network will be penalized for such predictions.

It's much more a question than statement, as I am not sure.

What do you think, could it be a problem?

P.s. I'm trying to teach YOLO using openimages.

@lamerman
Copy link
Author

I looked at the loss function of YOLO

screenshot_20180220_181624

And it seems like absence of bounding box for image when in reality it should be will affect the loss function on line 4. And it's interesting what was the reasoning behind making only one bounding box for openimages.

@phdung
Copy link

phdung commented Jun 18, 2018

I am curious about this method also. @lamerman @samihaija any updates about this?

@dbrazey
Copy link

dbrazey commented Jun 27, 2018

Does someone has an explanation concerning the fact that training data contains only one bounding box instead of all boxes ?

As a result, this dataset cannot be used to train object detection algorithm ?

@dashesy
Copy link

dashesy commented Jul 11, 2018

because of this, Yolo does not train well with this dataset.

@ShaneYS
Copy link

ShaneYS commented Jul 16, 2018

@dashesy hi, do you try any other detection algorithm ( like faster rcnn, ssd ) with openimages dataset? I try to train faster rcnn with mxnet with openimages, but I have many problems when preprocessing the dataset.

1 similar comment
@ShaneYS
Copy link

ShaneYS commented Jul 16, 2018

@dashesy hi, do you try any other detection algorithm ( like faster rcnn, ssd ) with openimages dataset? I try to train faster rcnn with mxnet with openimages, but I have many problems when preprocessing the dataset.

@kevinsu628
Copy link

Further, a lot of images have missing labels. Is this problem fixable?

openImage_issue_1
openImage_issue_2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants