gratheon / image-splitter

Processes uploaded beehive frame to detect various objects. This is more of an orchestrator. It splits frame into multiple sections for better detection results (thus the name). In production its upload traffic is not going via graphql-router as federated graphql is not capable of forwarding binary uploads (yet?). Results are stored in mysql and forwarded to redis.

URLs

Dev: http://localhost:8800
Prod: https://image.gratheon.com/graphql

Architecture

Service diagram

flowchart LR
    web-app("<a href='https://github.com/Gratheon/web-app'>web-app</a>\n:8080") --> graphql-router("<a href='https://github.com/Gratheon/graphql-router'>graphql-router</a>") --> image-splitter("<a href='https://github.com/Gratheon/image-splitter'>image-splitter</a>\n:8800") --"poll images for processing every 500ms-10s\nstore inference results"--> mysql
    graphql-router --upload--> image-splitter --"store original upload, download for inference"--> aws-s3
	image-splitter --"detect bees"--> models-bee-detector("<a href='https://github.com/Gratheon/models-bee-detector'>models-bee-detector</a>\n:8700")
	image-splitter --"detect frame cells"--> models-frame-resources("<a href='https://github.com/Gratheon/models-frame-resources'>models-frame-resources</a>\n:8540")
	image-splitter --"detect queen cups\ndetect varroa\ndetect queens"--> clarifai("<a href='https://clarifai.com'>clarifai</a>")

	image-splitter --"event {uid}.frame_side.{frame_side_id}.bees_partially_detected"--> redis
    image-splitter --"event {uid}.frame_side.{frame_side_id}.frame_resources_detected"--> redis
    image-splitter --"event {uid}.frame_side.{frame_side_id}.queen_cups_detected"--> redis

Async worker processing / data flow

We use pure mysql DB for processing async jobs instead of redis pubsub and kafka (at least for now) because

we want to have persistance and to query state of async jobs
we want to have control over retries and error failure states

flowchart LR

upload --"insert new job"--> DB[("DB\njobs table")]
orchestrator --> jobsModel.processJobInLoop --> jobsModel.fetchUnprocessed --"fetch queued jobs"--> DB
jobsModel.processJobInLoop --> startDetection --"mark job as started"--> DB
jobsModel.processJobInLoop --> handler --> resizeOriginalToThumbnails
handler --> detectWorkerBees
handler --> analyzeCells
handler --> detectVarroa
handler --> analyzeQueenCups
handler --> detectQueens
jobsModel.processJobInLoop --"fail job if handler failed"> jobsModel.fail
jobsModel.processJobInLoop --> jobsModel.endDetection --"mark job as complete"--> DB

Development

Copy ./src/config/config.default.ts file as ./src/config/config.dev.ts and change values if needed (for example AWS S3 credentials)

Then start a service in dockerized mode:

just start

DB migrations

We use @databases/mysql and run migrations automatically from migrations folder. Its not perfect, its just pure SQL without ability to run programmatic migrations to have rollback support. It also assumes we have single container that runs this at the service start. To add migration, just add new file. Try to keep same naming convention.

Roadmap / ToDo

Change processing mechanism from polling to a queue (kafka?) to initiate processing faster
Add more test coverage & improve types

Testing

Minio is available at: http://localhost:9001/buckets minio-admin:minio-admin

Unit tests are executed with jest:

npm run test:unit

Integration tests spin up local docker containers and test against them:

make test-integration

Name		Name	Last commit message	Last commit date
Latest commit History 171 Commits
.github/workflows		.github/workflows
config		config
migrations		migrations
src		src
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
.nvmrc		.nvmrc
CODEOWNERS		CODEOWNERS
Dockerfile.dev		Dockerfile.dev
Dockerfile.prod		Dockerfile.prod
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml
justfile		justfile
package-lock.json		package-lock.json
package.json		package.json
restart.sh		restart.sh
schema.graphql		schema.graphql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gratheon / image-splitter

URLs

Architecture

Service diagram

Async worker processing / data flow

Development

DB migrations

Roadmap / ToDo

Testing

About

Languages

License

Gratheon/image-splitter

Folders and files

Latest commit

History

Repository files navigation

gratheon / image-splitter

URLs

Architecture

Service diagram

Async worker processing / data flow

Development

DB migrations

Roadmap / ToDo

Testing

About

Resources

License

Stars

Watchers

Forks

Languages