[request] Semantic segmentation documentation, training code and / or model weights #55

patricklabatut · 2023-04-24T22:47:52Z

Related issues:

kanishkanarch · 2023-06-10T15:43:41Z

I would appreciate an example code for semantic segmentation. Can't do much with the model's output embeddings yet.
Kindly point me out if I am overlooking a relevant reference.

innat-asj · 2023-06-13T20:46:56Z

STEGO, an unsupervised semantic segmentation model used DINO v1.

cc. @mhamilton723

itsprakhar · 2023-06-21T16:19:34Z

I have created this ( https://github.com/itsprakhar/Downstream-Dinov2 ) repo where I am writing code for using Dinov2 for downstream tasks such as segmentation and classification, you can take look, Create an issue or help improve it :)

Downstream Dinov2 Segmentation and Classification

innat-asj · 2023-06-22T10:33:24Z

@itsprakhar
Ideally, there should be no need for a mask/label for downstream tasks, right? (for self-sup)

itsprakhar · 2023-06-30T07:15:00Z

@innat-asj, the pretraining does not require labels but finetuning for downstream tasks do. However the number of training samples required would be much less. The finetuning is kind of "few-shot fintetuning" you need some examples because that's how you tell the model what you really want!

innat-asj · 2023-07-01T06:48:53Z

The finetuning is kind of "few-shot fintetuning" you need some examples because that's how you tell the model what you really want!

Probably missed if it's also followed in the paper, for segmentation and depth estimation. Coz, even if I need a few samples, that approach would be understood as semi-supervised.

Now, as DINO is meant to be self-supervised, I was wondering do we have to have a fine-tune for downstream tasks using target signal or instead contrastive loss!

TimDarcet · 2023-07-16T09:25:05Z

Hi @innat-asj

DINO (and DINOv2) are self supervised pretraining methods. Their goal is to create a pretrained vision encoder with only unlabeled data. This model can then output good embeddings that represent images.

They are not classification, segmentation or depth models. They are just pretrained encoders. You can, however, build a segmentation model using DINOv2, by adding a seg. / depth / classif. head and training the head. We show in the paper that the head can be extremely small (just a linear layer), be trained on very few samples (eg ~1k depth images for NYUv2) and still perform competitively, because the encoder outputs good representations.
These heads still need labeled samples to be trained.

If you are looking unsupervised segmentation, [STEGO] is a method leveraging a DINO to do that.

[STEGO] https://arxiv.org/abs/2203.08414

innat-asj · 2023-07-16T11:07:32Z

@TimDarcet
Thanks for the clarification.

levayz · 2023-08-07T12:51:16Z

Has anyone managed to reproduce the segmentation results (82.5mIoU) on the VOC pascal 2012 dataset?

APeiZou · 2024-09-11T06:26:32Z

how can I get the Semantic segmentation documentation and training code？

patricklabatut added documentation Improvements or additions to documentation enhancement New feature or request labels Apr 24, 2023

patricklabatut self-assigned this Apr 24, 2023

patricklabatut mentioned this issue Apr 24, 2023

Confused of linear segmentation #47

Closed

patricklabatut assigned TimDarcet Apr 24, 2023

This was referenced May 4, 2023

reuqest Model for semantic segmentation tasks #80

Closed

如何把获取的特征，用来做图像分割目标检测 #82

Closed

patricklabatut pinned this issue May 5, 2023

patricklabatut mentioned this issue May 11, 2023

Semantic segmentation #25

Closed

This was referenced May 15, 2023

Request for Semantic Segmentation Model #99

Closed

[request] Depth estimation documentation, training code and / or model weights #54

Open

NielsRogge mentioned this issue Aug 3, 2023

DINOv2 is now available in HF Transformers (with tutorial) #153

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[request] Semantic segmentation documentation, training code and / or model weights #55

[request] Semantic segmentation documentation, training code and / or model weights #55

patricklabatut commented Apr 24, 2023 •

edited

Loading

kanishkanarch commented Jun 10, 2023

innat-asj commented Jun 13, 2023

itsprakhar commented Jun 21, 2023

innat-asj commented Jun 22, 2023

itsprakhar commented Jun 30, 2023

innat-asj commented Jul 1, 2023

TimDarcet commented Jul 16, 2023

innat-asj commented Jul 16, 2023

levayz commented Aug 7, 2023

APeiZou commented Sep 11, 2024

[request] Semantic segmentation documentation, training code and / or model weights #55

[request] Semantic segmentation documentation, training code and / or model weights #55

Comments

patricklabatut commented Apr 24, 2023 • edited Loading

kanishkanarch commented Jun 10, 2023

innat-asj commented Jun 13, 2023

itsprakhar commented Jun 21, 2023

innat-asj commented Jun 22, 2023

itsprakhar commented Jun 30, 2023

innat-asj commented Jul 1, 2023

TimDarcet commented Jul 16, 2023

innat-asj commented Jul 16, 2023

levayz commented Aug 7, 2023

APeiZou commented Sep 11, 2024

patricklabatut commented Apr 24, 2023 •

edited

Loading