Study on Text Retrieval Based on Pre-training and Deep Hash
Aiming at the problem of low retrieval efficiency and accuracy in text retrieval,a retrieval model based on pre-trained language model and deep hash method is proposed.Firstly,the prior knowledge of text contained in the pre-trained language model is introduced by transfer learning,and then the input is transformed into high-dimensional vector representation by feature extraction.A hash learning layer is added to the back end of the whole model to fine tune the parameters of the model by designing specific optimization objectives,so as to dynamically learn the hash function and the unique hash representation of each input in the training.Experimental results show that the retrieval accuracy of this method is at least 21.70% and 21.38% higher than that of other benchmark models in top-5 and top-10,respectively.The introduction of hash code makes the model improve the retrieval speed by 40 times under the premise of only losing 4.78% accuracy.Therefore,this method can significantly improve the retrieval accuracy and efficiency,and has a potential application prospect in the field of text retrieval..
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2021 |
---|---|
Erschienen: |
2021 |
Enthalten in: |
Zur Gesamtaufnahme - volume:48 |
---|---|
Enthalten in: |
Jisuanji kexue - 48(2021), 11, Seite 300-306 |
Sprache: |
Chinesisch |
---|
Beteiligte Personen: |
ZOU Ao, HAO Wen-ning, JIN Da-wei, CHEN Gang, TIAN Yuan [VerfasserIn] |
---|
Links: |
doi.org [kostenfrei] |
---|
Themen: |
Computer software |
---|
doi: |
10.11896/jsjkx.210300266 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
DOAJ075344475 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | DOAJ075344475 | ||
003 | DE-627 | ||
005 | 20230502152741.0 | ||
007 | cr uuu---uuuuu | ||
008 | 230228s2021 xx |||||o 00| ||chi c | ||
024 | 7 | |a 10.11896/jsjkx.210300266 |2 doi | |
035 | |a (DE-627)DOAJ075344475 | ||
035 | |a (DE-599)DOAJc04c4058a58a4d30a2a95e0e2d2a561a | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a chi | ||
050 | 0 | |a QA76.75-76.765 | |
050 | 0 | |a T1-995 | |
100 | 0 | |a ZOU Ao, HAO Wen-ning, JIN Da-wei, CHEN Gang, TIAN Yuan |e verfasserin |4 aut | |
245 | 1 | 0 | |a Study on Text Retrieval Based on Pre-training and Deep Hash |
264 | 1 | |c 2021 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Aiming at the problem of low retrieval efficiency and accuracy in text retrieval,a retrieval model based on pre-trained language model and deep hash method is proposed.Firstly,the prior knowledge of text contained in the pre-trained language model is introduced by transfer learning,and then the input is transformed into high-dimensional vector representation by feature extraction.A hash learning layer is added to the back end of the whole model to fine tune the parameters of the model by designing specific optimization objectives,so as to dynamically learn the hash function and the unique hash representation of each input in the training.Experimental results show that the retrieval accuracy of this method is at least 21.70% and 21.38% higher than that of other benchmark models in top-5 and top-10,respectively.The introduction of hash code makes the model improve the retrieval speed by 40 times under the premise of only losing 4.78% accuracy.Therefore,this method can significantly improve the retrieval accuracy and efficiency,and has a potential application prospect in the field of text retrieval. | ||
650 | 4 | |a deep learning|similarity retrieval|pre-trained language model|deep hash | |
653 | 0 | |a Computer software | |
653 | 0 | |a Technology (General) | |
773 | 0 | 8 | |i In |t Jisuanji kexue |d Editorial office of Computer Science, 2021 |g 48(2021), 11, Seite 300-306 |w (DE-627)DOAJ078619254 |x 1002137X |7 nnns |
773 | 1 | 8 | |g volume:48 |g year:2021 |g number:11 |g pages:300-306 |
856 | 4 | 0 | |u https://doi.org/10.11896/jsjkx.210300266 |z kostenfrei |
856 | 4 | 0 | |u https://doaj.org/article/c04c4058a58a4d30a2a95e0e2d2a561a |z kostenfrei |
856 | 4 | 0 | |u https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-11-300.pdf |z kostenfrei |
856 | 4 | 2 | |u https://doaj.org/toc/1002-137X |y Journal toc |z kostenfrei |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_DOAJ | ||
912 | |a SSG-OLC-PHA | ||
951 | |a AR | ||
952 | |d 48 |j 2021 |e 11 |h 300-306 |