You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your work and open-source code! I have some questions:
From my perspective, the differences between TBD-Baseline and ADA-Track are mainly whether the track-query & det-query perform self-attention simultaneously in the shared decoder layer and it mainly influences the detection performance, which fuses temporal information, and then it increases MOTA metric indirectly.
As we know, MUTR3D is based on MOTRv1. Did you try the tricks of MOTRv3, such as better assignment for enhancing detection performance, on MUTR3D?
Regarding the DETR3D/PETR detector, have you tested whether adding association layers has any impact on detection performance
I'd appreciate it if you could answer the above questions. @dsx0511
The text was updated successfully, but these errors were encountered:
JingweiZhang12
changed the title
Some questions about TBD-Baseline
Some questions about TBD-Baseline and detection/tracking performance
Aug 9, 2024
Therefore, the differences between ADA-Track and TBD-Baseline do not only lie in self-attention. It also leads to differences in the association's performance: The learned association modules in ADA-Track fuse detection information with previous association results layer by layer, yielding a mutual optimization of both tasks. In contrast, the association layers of TBD-Baseline cannot influence the detection layers in the forward pass.
2/3. We have not tried it yet. But thank you for your advice and it is very valuable for our future work!
Hopefully, the response is not too late for you. And I'm looking forward to further discussion!
Thanks for your work and open-source code! I have some questions:
TBD-Baseline
andADA-Track
are mainly whether the track-query & det-query perform self-attention simultaneously in the shared decoder layer and it mainly influences the detection performance, which fuses temporal information, and then it increases MOTA metric indirectly.I'd appreciate it if you could answer the above questions. @dsx0511
The text was updated successfully, but these errors were encountered: