by XX
Fully convolutional neural (FCN) networks based on encoder-decoder structures are adopted for salient object detection (SOD) and achieve state-of-the-art performance. However, most of the previous works aggregate multilevel features layer by layer and ignore long-range dependencies of spatial context, leading to imbalanced feature processes and context separation. In addition, the burry edges obtained from previous SOD methods inevitably influence the detection accuracy. To handle these problems, this study proposes an effective recursion-based SOD network R-Net adopting a cascade structure, which contains a recursive decoder module (RDM), a long-range dependency module (LRDM), and an edge refinement module (ERM). The RDM can scale different layer features to the same size and realize multiple feature fusion while recovering data layer by layer. The LRDM bridges the encoder and decoder by weighting multilevel features to establish the long-range feature dependency of the spatial context. And the ERM attached to the decoder introduces the shallow reference feature to refine the burry edges for obtaining delicate detection results. The SOD experimental results on DUTS-TE, HKU-IS, PASCAL-S, ECSSD, and DUT-OMRON datasets demonstrate that the proposed R-Net is more robust under different complex scenes compared with some state-of-the-art methods.
git clone [email protected]:ZEROICEWANG/R-Net.git
cd R-Net/
Download the following datasets and unzip them into data
folder
- If you want to test the performance of R-Net, please download the model(Baidu Google) into
models/R-Model
folder
python3 train.py
python3 predict.py
- After testing, saliency maps of
PASCAL-S
,ECSSD
,HKU-IS
,DUT-OMRON
,DUTS-TE
will be saved inpredict_result/R-Net/
folder.