Research on named entity recognition method of marine natural products based on attention mechanism
Copyright © 2023 Ma, Yu, Gao, Wei, Xia, Wang and Liu..
Marine natural product (MNP) entity property information is the basis of marine drug development, and this entity property information can be obtained from the original literature. However, the traditional methods require several manual annotations, the accuracy of the model is low and slow, and the problem of inconsistent lexical contexts cannot be solved well. In order to solve the aforementioned problems, this study proposes a named entity recognition method based on the attention mechanism, inflated convolutional neural network (IDCNN), and conditional random field (CRF), combining the attention mechanism that can use the lexicality of words to make attention-weighted mentions of the extracted features, the ability of the inflated convolutional neural network to parallelize operations and long- and short-term memory, and the excellent learning ability. A named entity recognition algorithm model is developed for the automatic recognition of entity information in the MNP domain literature. Experiments demonstrate that the proposed model can properly identify entity information from the unstructured chapter-level literature and outperform the control model in several metrics. In addition, we construct an unstructured text dataset related to MNPs from an open-source dataset, which can be used for the research and development of resource scarcity scenarios.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:11 |
---|---|
Enthalten in: |
Frontiers in chemistry - 11(2023) vom: 21., Seite 958002 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Ma, Xiaodong [VerfasserIn] |
---|
Links: |
---|
Themen: |
Attention mechanism |
---|
Anmerkungen: |
Date Revised 28.02.2023 published: Electronic-eCollection Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.3389/fchem.2023.958002 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM353529818 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM353529818 | ||
003 | DE-627 | ||
005 | 20231226060219.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.3389/fchem.2023.958002 |2 doi | |
028 | 5 | 2 | |a pubmed24n1178.xml |
035 | |a (DE-627)NLM353529818 | ||
035 | |a (NLM)36846857 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Ma, Xiaodong |e verfasserin |4 aut | |
245 | 1 | 0 | |a Research on named entity recognition method of marine natural products based on attention mechanism |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 28.02.2023 | ||
500 | |a published: Electronic-eCollection | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a Copyright © 2023 Ma, Yu, Gao, Wei, Xia, Wang and Liu. | ||
520 | |a Marine natural product (MNP) entity property information is the basis of marine drug development, and this entity property information can be obtained from the original literature. However, the traditional methods require several manual annotations, the accuracy of the model is low and slow, and the problem of inconsistent lexical contexts cannot be solved well. In order to solve the aforementioned problems, this study proposes a named entity recognition method based on the attention mechanism, inflated convolutional neural network (IDCNN), and conditional random field (CRF), combining the attention mechanism that can use the lexicality of words to make attention-weighted mentions of the extracted features, the ability of the inflated convolutional neural network to parallelize operations and long- and short-term memory, and the excellent learning ability. A named entity recognition algorithm model is developed for the automatic recognition of entity information in the MNP domain literature. Experiments demonstrate that the proposed model can properly identify entity information from the unstructured chapter-level literature and outperform the control model in several metrics. In addition, we construct an unstructured text dataset related to MNPs from an open-source dataset, which can be used for the research and development of resource scarcity scenarios | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a attention mechanism | |
650 | 4 | |a conditional random field | |
650 | 4 | |a inflated convolutional neural network | |
650 | 4 | |a marine natural products | |
650 | 4 | |a named entity recognition | |
700 | 1 | |a Yu, Rilei |e verfasserin |4 aut | |
700 | 1 | |a Gao, Chunxiao |e verfasserin |4 aut | |
700 | 1 | |a Wei, Zhiqiang |e verfasserin |4 aut | |
700 | 1 | |a Xia, Yimin |e verfasserin |4 aut | |
700 | 1 | |a Wang, Xiaowei |e verfasserin |4 aut | |
700 | 1 | |a Liu, Hao |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Frontiers in chemistry |d 2013 |g 11(2023) vom: 21., Seite 958002 |w (DE-627)NLM237900548 |x 2296-2646 |7 nnns |
773 | 1 | 8 | |g volume:11 |g year:2023 |g day:21 |g pages:958002 |
856 | 4 | 0 | |u http://dx.doi.org/10.3389/fchem.2023.958002 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 11 |j 2023 |b 21 |h 958002 |