Skip to content

ZEROICEWANG/MEPNet

Repository files navigation

[MEPNet: Mask Prior-distribution Interaction and Edge Probability Estimation for Salient Object Detection]

by XX

Introduction

Salient Object Detection (SOD) has been researched extensively and achieved impressive performance. However, current methods still present defective results while facing complex scenes. Because the deceptive background hinders the methods from distinguishing the salient subject and presenting discriminative edges. To address this issue, we propose a method named MEPNet to realize two tasks: interacting the mask prior-distribution with multi-scale feature to suppress the background disturbance; and estimating the edge probability of salient objects to boost the detail of decoding results. The proposed method MEPNet is mainly composed of four kinds of modules, including Lite Receptive Filed Block (RFB-Lite) module, Prior Query (PQ) module, Full-scale sub-Decoder (FD) module, and Edge Auxiliary (EA) module. The RFB-Lite adopts the multi-scale convolution with grouped branches to efficiently reduce channel redundancy and enhance the diversity of semantics. The PQ introduces the mask prior-distribution into the fused feature by multi-head cross-attention. To solve the conflict between the number of attention heads and the computational complexity in cross-attention, Multi-head L-Cross Attention (MLAC) is proposed to self-weight the feature while calculating the attention score matrix and global attention. The FD realizes the bi-direction decoding with feature pyramid network (PFN) and reversed feature pyramid network (FPN-R). Three FDs adopted in MEPNet present a multi-basis decoding to fully utilize multi-scale features. The alternating use of PQ and FD ensures the suppression of background disturbance. The EA, as the last part of MEPNet, boosts the decoding results with edge filter estimating the edge probability and edge refine correcting the up-sampled decoding results. The SOD experimental results on DUTS-TE, HKU-IS, PASCAL-S, ECSSD, and DUT-OMRON datasets demonstrate that the proposed MEPNet is more robust under different complex scenes when compared to some state-of-the-art (SOTA) methods.

Prerequisites

Clone repository

git clone https://github.com/ZEROICEWANG/MEPNet.git
cd MEPNet/

Download dataset

Download the following datasets and unzip them into ../SOD_Data folder

Download model

  • If you want to test the performance of MEPNet, please download the model(Baidu Google) into models/RES_Model folder, and download the pretrained model (Baidu Google) into models/pretrained/prior_query2_channel_last_L_attention_rm_SA/pretrain folder.

Training

    python3 train.py # or using bash cmd_train.sh

Testing

    python3 predict.py # or using bash cmd.sh
  • After testing, saliency maps of PASCAL-S, ECSSD, HKU-IS, DUT-OMRON, DUTS-TE will be saved in predict_result/ folder.

Saliency maps & Pre-Trained model & Trained model

full source code will be released soon

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published