Details der Publikation - Study on Text Retrieval Based on Pre-training and Deep Hash

Study on Text Retrieval Based on Pre-training and Deep Hash

Aiming at the problem of low retrieval efficiency and accuracy in text retrieval,a retrieval model based on pre-trained language model and deep hash method is proposed.Firstly,the prior knowledge of text contained in the pre-trained language model is introduced by transfer learning,and then the input is transformed into high-dimensional vector representation by feature extraction.A hash learning layer is added to the back end of the whole model to fine tune the parameters of the model by designing specific optimization objectives,so as to dynamically learn the hash function and the unique hash representation of each input in the training.Experimental results show that the retrieval accuracy of this method is at least 21.70% and 21.38% higher than that of other benchmark models in top-5 and top-10,respectively.The introduction of hash code makes the model improve the retrieval speed by 40 times under the premise of only losing 4.78% accuracy.Therefore,this method can significantly improve the retrieval accuracy and efficiency,and has a potential application prospect in the field of text retrieval..

Medienart:	E-Artikel

Erscheinungsjahr:	2021
Erschienen:	2021

Enthalten in:	Zur Gesamtaufnahme - volume:48
Enthalten in:	Jisuanji kexue - 48(2021), 11, Seite 300-306

Sprache:	Chinesisch

Beteiligte Personen:	ZOU Ao, HAO Wen-ning, JIN Da-wei, CHEN Gang, TIAN Yuan [VerfasserIn]

Links:	doi.org [kostenfrei] doaj.org [kostenfrei] www.jsjkx.com [kostenfrei] Journal toc [kostenfrei]

Themen:	Computer software Deep learning\|similarity retrieval\|pre-trained language model\|deep hash Technology (General)

doi:	10.11896/jsjkx.210300266

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	DOAJ075344475

Internformat


LEADER	01000caa a22002652 4500
001	DOAJ075344475
003	DE-627
005	20230502152741.0
007	cr uuu---uuuuu
008	230228s2021 xx \|\|\|\|\|o 00\| \|\|chi c
024	7		\|a 10.11896/jsjkx.210300266 \|2 doi
035			\|a (DE-627)DOAJ075344475
035			\|a (DE-599)DOAJc04c4058a58a4d30a2a95e0e2d2a561a
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a chi
050		0	\|a QA76.75-76.765
050		0	\|a T1-995
100	0		\|a ZOU Ao, HAO Wen-ning, JIN Da-wei, CHEN Gang, TIAN Yuan \|e verfasserin \|4 aut
245	1	0	\|a Study on Text Retrieval Based on Pre-training and Deep Hash
264		1	\|c 2021
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a Aiming at the problem of low retrieval efficiency and accuracy in text retrieval,a retrieval model based on pre-trained language model and deep hash method is proposed.Firstly,the prior knowledge of text contained in the pre-trained language model is introduced by transfer learning,and then the input is transformed into high-dimensional vector representation by feature extraction.A hash learning layer is added to the back end of the whole model to fine tune the parameters of the model by designing specific optimization objectives,so as to dynamically learn the hash function and the unique hash representation of each input in the training.Experimental results show that the retrieval accuracy of this method is at least 21.70% and 21.38% higher than that of other benchmark models in top-5 and top-10,respectively.The introduction of hash code makes the model improve the retrieval speed by 40 times under the premise of only losing 4.78% accuracy.Therefore,this method can significantly improve the retrieval accuracy and efficiency,and has a potential application prospect in the field of text retrieval.
650		4	\|a deep learning\|similarity retrieval\|pre-trained language model\|deep hash
653		0	\|a Computer software
653		0	\|a Technology (General)
773	0	8	\|i In \|t Jisuanji kexue \|d Editorial office of Computer Science, 2021 \|g 48(2021), 11, Seite 300-306 \|w (DE-627)DOAJ078619254 \|x 1002137X \|7 nnns
773	1	8	\|g volume:48 \|g year:2021 \|g number:11 \|g pages:300-306
856	4	0	\|u https://doi.org/10.11896/jsjkx.210300266 \|z kostenfrei
856	4	0	\|u https://doaj.org/article/c04c4058a58a4d30a2a95e0e2d2a561a \|z kostenfrei
856	4	0	\|u https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-11-300.pdf \|z kostenfrei
856	4	2	\|u https://doaj.org/toc/1002-137X \|y Journal toc \|z kostenfrei
912			\|a GBV_USEFLAG_A
912			\|a GBV_DOAJ
912			\|a SSG-OLC-PHA
951			\|a AR
952			\|d 48 \|j 2021 \|e 11 \|h 300-306

Study on Text Retrieval Based on Pre-training and Deep Hash

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände