End-to-End Object Detection with Enhanced Positive Sample Filter
Discarding Non-Maximum Suppression (NMS) post-processing and realizing fully end-to-end object detection is a recent research focus. Previous works have proved that the one-to-one label assignment strategy provides the chance to eliminate NMS during inference. However, this strategy might also result in multiple predictions with high scores due to the inconsistency of label assignment during training. Thus, how to adaptively identify only one positive sample as a final prediction for each Ground-Truth instance remains important. In this paper, we propose an Enhanced Positive Sample Filter (EPSF) to filter out the single positive sample for each Ground-Truth instance and lower the confidence of other negative samples. This is mainly achieved with two components: a Dual-stream Feature Enhancement module (DsFE) and a Disentangled Max Pooling Filter (DeMF). DsFE makes full use of representations trained with different targets so as to provide rich information clues for positive sample selection, while DeMF enhances the feature discriminability in potential foreground regions with disentangled pooling. With the proposed methods, our end-to-end detector achieves a better performances against existing NMS-free object detectors on COCO, PASCAL VOC, CrowdHuman and Caltech datasets..
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:13 |
---|---|
Enthalten in: |
Applied Sciences - 13(2023), 3, p 1232 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Xiaolin Song [VerfasserIn] |
---|
Links: |
doi.org [kostenfrei] |
---|
doi: |
10.3390/app13031232 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
DOAJ080691307 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | DOAJ080691307 | ||
003 | DE-627 | ||
005 | 20240413070819.0 | ||
007 | cr uuu---uuuuu | ||
008 | 230310s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.3390/app13031232 |2 doi | |
035 | |a (DE-627)DOAJ080691307 | ||
035 | |a (DE-599)DOAJcb37a649f8db4f95b97f7a111f768263 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
050 | 0 | |a TA1-2040 | |
050 | 0 | |a QH301-705.5 | |
050 | 0 | |a QC1-999 | |
050 | 0 | |a QD1-999 | |
100 | 0 | |a Xiaolin Song |e verfasserin |4 aut | |
245 | 1 | 0 | |a End-to-End Object Detection with Enhanced Positive Sample Filter |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Discarding Non-Maximum Suppression (NMS) post-processing and realizing fully end-to-end object detection is a recent research focus. Previous works have proved that the one-to-one label assignment strategy provides the chance to eliminate NMS during inference. However, this strategy might also result in multiple predictions with high scores due to the inconsistency of label assignment during training. Thus, how to adaptively identify only one positive sample as a final prediction for each Ground-Truth instance remains important. In this paper, we propose an Enhanced Positive Sample Filter (EPSF) to filter out the single positive sample for each Ground-Truth instance and lower the confidence of other negative samples. This is mainly achieved with two components: a Dual-stream Feature Enhancement module (DsFE) and a Disentangled Max Pooling Filter (DeMF). DsFE makes full use of representations trained with different targets so as to provide rich information clues for positive sample selection, while DeMF enhances the feature discriminability in potential foreground regions with disentangled pooling. With the proposed methods, our end-to-end detector achieves a better performances against existing NMS-free object detectors on COCO, PASCAL VOC, CrowdHuman and Caltech datasets. | ||
650 | 4 | |a end-to-end object detection | |
650 | 4 | |a Enhanced Positive Sample Filter | |
650 | 4 | |a Dual-stream Feature Enhancement | |
650 | 4 | |a Disentangled Max Pooling Filter | |
653 | 0 | |a Technology | |
653 | 0 | |a T | |
653 | 0 | |a Engineering (General). Civil engineering (General) | |
653 | 0 | |a Biology (General) | |
653 | 0 | |a Physics | |
653 | 0 | |a Chemistry | |
700 | 0 | |a Binghui Chen |e verfasserin |4 aut | |
700 | 0 | |a Pengyu Li |e verfasserin |4 aut | |
700 | 0 | |a Biao Wang |e verfasserin |4 aut | |
700 | 0 | |a Honggang Zhang |e verfasserin |4 aut | |
773 | 0 | 8 | |i In |t Applied Sciences |d MDPI AG, 2012 |g 13(2023), 3, p 1232 |w (DE-627)DOAJ000144045 |x 20763417 |7 nnns |
773 | 1 | 8 | |g volume:13 |g year:2023 |g number:3, p 1232 |
856 | 4 | 0 | |u https://doi.org/10.3390/app13031232 |z kostenfrei |
856 | 4 | 0 | |u https://doaj.org/article/cb37a649f8db4f95b97f7a111f768263 |z kostenfrei |
856 | 4 | 0 | |u https://www.mdpi.com/2076-3417/13/3/1232 |z kostenfrei |
856 | 4 | 2 | |u https://doaj.org/toc/2076-3417 |y Journal toc |z kostenfrei |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_DOAJ | ||
951 | |a AR | ||
952 | |d 13 |j 2023 |e 3, p 1232 |