Species Classification for Neuroscience Literature Based on Span of Interest Using Sequence-to-Sequence Learning Model

Copyright © 2020 Zhu, Zeng, Wang and Huangfu..

Large-scale neuroscience literature call for effective methods to mine the knowledge from species perspective to link the brain and neuroscience communities, neurorobotics, computing devices, and AI research communities. Structured knowledge can motivate researchers to better understand the functionality and structure of the brain and link the related resources and components. However, the abstracts of massive scientific works do not explicitly mention the species. Therefore, in addition to dictionary-based methods, we need to mine species using cognitive computing models that are more like the human reading process, and these methods can take advantage of the rich information in the literature. We also enable the model to automatically distinguish whether the mentioned species is the main research subject. Distinguishing the two situations can generate value at different levels of knowledge management. We propose SpecExplorer project which is used to explore the knowledge associations of different species for brain and neuroscience. This project frees humans from the tedious task of classifying neuroscience literature by species. Species classification task belongs to the multi-label classification which is more complex than the single-label classification due to the correlation between labels. To resolve this problem, we present the sequence-to-sequence classification framework to adaptively assign multiple species to the literature. To model the structure information of documents, we propose the hierarchical attentive decoding (HAD) to extract span of interest (SOI) for predicting each species. We create three datasets from PubMed and PMC corpora. We present two versions of annotation criteria (mention-based annotation and semantic-based annotation) for species research. Experiments demonstrate that our approach achieves improvements in the final results. Finally, we perform species-based analysis of brain diseases, brain cognitive functions, and proteins related to the hippocampus and provide potential research directions for certain species.

Medienart:

E-Artikel

Erscheinungsjahr:

2020

Erschienen:

2020

Enthalten in:

Zur Gesamtaufnahme - volume:14

Enthalten in:

Frontiers in human neuroscience - 14(2020) vom: 15., Seite 128

Sprache:

Englisch

Beteiligte Personen:

Zhu, Hongyin [VerfasserIn]
Zeng, Yi [VerfasserIn]
Wang, Dongsheng [VerfasserIn]
Huangfu, Cunqing [VerfasserIn]

Links:

Volltext

Themen:

Brain science
Cognitive computing
Corpus annotation
Journal Article
Linked brain data
Multi-label classification
Neuroscience
PubMed

Anmerkungen:

Date Revised 28.03.2024

published: Electronic-eCollection

Citation Status PubMed-not-MEDLINE

doi:

10.3389/fnhum.2020.00128

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM30956364X