Tennis table dataset #29

dfbakin · 2024-04-20T15:48:07Z

Main changes

Uploaded data to DVC
Added parser from binary data (.out files) to single-channel .png
added YAML file with intrinsic parameters and results of stereo calibration (translation and rotation vectors)

feat: added new parser from binary .out file to raw png

dpaleyev · 2024-04-29T12:49:33Z

Added final version of annotation. 4050 images from different sets were annotated.

Answering @BakinDF question, I'd prefer to left all images and annotation we have in master as it is now.

AndBondStyle · 2024-05-04T14:58:26Z

LGTM overall, haven't found anything critical. However, some things to consider:

Is there any raw video files? Looks like there are only images in dvc. Arguably, I'll store all videos in the raw format, and delegate any other manipulations (e.g. splitting the raw video into .png frames and converting bayer to rgb) to other scripts and pipelines. For labelling datasets, it should be possible to reference the raw frame by index inside the original raw video. But maybe it's too complicated, and just for labelling, data redundancy is acceptable.
It would be convenient to have some sort of description of each dataset. This includes: some sort of overview, or even a short preview video; which cameras were used and with what settings; how cameras was positioned relative to the table and each other (measured in real world); how the dataset is structured, how images/video is stored, how to work with it (e.g. example script to parse the images)
Folder names should be named consistently, so underscore_case or kebab-case is preferred
requirements.txt is still used, however there's also a modern pyproject.toml file in the project. Maybe we should consider moving requirements under poetry. We can revisit this when (if?) the @dpaleyev's ML training/inference pipeline will be added to this repo
binary_parser.py: useful tool, but I think we'll be extending its functionality in the future. Also, ffmpeg can be used to split the raw video into separate images, and it may be faster or more convenient in some cases. But having a python wrapper with a nice CLI is always ok

feat: added datasets via DVC

3b3841e

feat: added new parser from binary .out file to raw png

dfbakin added the enhancement New feature or request label Apr 20, 2024

dfbakin self-assigned this Apr 20, 2024

dfbakin and others added 7 commits April 20, 2024 19:01

fix: minor codestyle

90480ac

feat: added dataset for 2ms exposure

795de87

feat: added sorted frames json files for orange dark 2ms frames

bead8d7

feat: sorted dataset orange 2ms

05b908c

feat: stats for frames in dataset

31b1d46

fix: replaced annotation

a49d8ee

feat: segmentation masks for dataset

d4735e2

dpaleyev marked this pull request as ready for review April 29, 2024 12:49

dpaleyev requested review from dasimagin and AndBondStyle April 29, 2024 12:50

Denis Bakin added 2 commits May 2, 2024 16:32

feat: added recorded parameters to dataset

8a02d2d

fix: yaml codestyle

6c5530c

dfbakin mentioned this pull request May 2, 2024

Calibration node binary #30

Merged

AndBondStyle approved these changes May 4, 2024

View reviewed changes

feat: annotated datasets

015b6b1

dfbakin merged commit 784c41e into master May 4, 2024
3 checks passed