An Encoder-Decoder Architecture within a Classical Signal-Processing Framework for Real-Time Barcode Segmentation
In this work, two methods are proposed for solving the problem of one-dimensional barcode segmentation in images, with an emphasis on augmented reality (AR) applications. These methods take the partial discrete Radon transform as a building block. The first proposed method uses overlapping tiles for obtaining good angle precision while maintaining good spatial precision. The second one uses an encoder-decoder structure inspired by state-of-the-art convolutional neural networks for segmentation while maintaining a classical processing framework, thus not requiring training. It is shown that the second method's processing time is lower than the video acquisition time with a 1024 × 1024 input on a CPU, which had not been previously achieved. The accuracy it obtained on datasets widely used by the scientific community was almost on par with that obtained using the most-recent state-of-the-art methods using deep learning. Beyond the challenges of those datasets, the method proposed is particularly well suited to image sequences taken with short exposure and exhibiting motion blur and lens blur, which are expected in a real-world AR scenario. Two implementations of the proposed methods are made available to the scientific community: one for easy prototyping and one optimised for parallel implementation, which can be run on desktop and mobile phone CPUs.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:23 |
---|---|
Enthalten in: |
Sensors (Basel, Switzerland) - 23(2023), 13 vom: 03. Juli |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Gómez-Cárdenes, Óscar [VerfasserIn] |
---|
Links: |
---|
Themen: |
Barcodes |
---|
Anmerkungen: |
Date Completed 17.07.2023 Date Revised 18.07.2023 published: Electronic Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.3390/s23136109 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM359487041 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM359487041 | ||
003 | DE-627 | ||
005 | 20231226081000.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.3390/s23136109 |2 doi | |
028 | 5 | 2 | |a pubmed24n1198.xml |
035 | |a (DE-627)NLM359487041 | ||
035 | |a (NLM)37447960 | ||
035 | |a (PII)6109 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Gómez-Cárdenes, Óscar |e verfasserin |4 aut | |
245 | 1 | 3 | |a An Encoder-Decoder Architecture within a Classical Signal-Processing Framework for Real-Time Barcode Segmentation |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 17.07.2023 | ||
500 | |a Date Revised 18.07.2023 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a In this work, two methods are proposed for solving the problem of one-dimensional barcode segmentation in images, with an emphasis on augmented reality (AR) applications. These methods take the partial discrete Radon transform as a building block. The first proposed method uses overlapping tiles for obtaining good angle precision while maintaining good spatial precision. The second one uses an encoder-decoder structure inspired by state-of-the-art convolutional neural networks for segmentation while maintaining a classical processing framework, thus not requiring training. It is shown that the second method's processing time is lower than the video acquisition time with a 1024 × 1024 input on a CPU, which had not been previously achieved. The accuracy it obtained on datasets widely used by the scientific community was almost on par with that obtained using the most-recent state-of-the-art methods using deep learning. Beyond the challenges of those datasets, the method proposed is particularly well suited to image sequences taken with short exposure and exhibiting motion blur and lens blur, which are expected in a real-world AR scenario. Two implementations of the proposed methods are made available to the scientific community: one for easy prototyping and one optimised for parallel implementation, which can be run on desktop and mobile phone CPUs | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Radon transform | |
650 | 4 | |a barcodes | |
650 | 4 | |a classical signal processing | |
650 | 4 | |a encoder–decoder | |
650 | 4 | |a multiscale DRT | |
650 | 4 | |a pixelwise segmentation | |
650 | 4 | |a scale-space methods | |
700 | 1 | |a Marichal-Hernández, José Gil |e verfasserin |4 aut | |
700 | 1 | |a Son, Jung-Young |e verfasserin |4 aut | |
700 | 1 | |a Pérez Jiménez, Rafael |e verfasserin |4 aut | |
700 | 1 | |a Rodríguez-Ramos, José Manuel |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Sensors (Basel, Switzerland) |d 2007 |g 23(2023), 13 vom: 03. Juli |w (DE-627)NLM187985170 |x 1424-8220 |7 nnns |
773 | 1 | 8 | |g volume:23 |g year:2023 |g number:13 |g day:03 |g month:07 |
856 | 4 | 0 | |u http://dx.doi.org/10.3390/s23136109 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 23 |j 2023 |e 13 |b 03 |c 07 |