Systematic Protein Prioritization for Targeted Proteomics Studies through Literature Mining
There are more than 3.7 million published articles on the biological functions or disease implications of proteins, constituting an important resource of proteomics knowledge. However, it is difficult to summarize the millions of proteomics findings in the literature manually and quantify their relevance to the biology and diseases of interest. We developed a fully automated bioinformatics framework to identify and prioritize proteins associated with any biological entity. We used the 22 targeted areas of the Biology/Disease-driven (B/D)-Human Proteome Project (HPP) as examples, prioritized the relevant proteins through their Protein Universal Reference Publication-Originated Search Engine (PURPOSE) scores, validated the relevance of the score by comparing the protein prioritization results with a curated database, computed the scores of proteins across the topics of B/D-HPP, and characterized the top proteins in the common model organisms. We further extended the bioinformatics workflow to identify the relevant proteins in all organ systems and human diseases and deployed a cloud-based tool to prioritize proteins related to any custom search terms in real time. Our tool can facilitate the prioritization of proteins for any organ system or disease of interest and can contribute to the development of targeted proteomic studies for precision medicine.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2018 |
---|---|
Erschienen: |
2018 |
Enthalten in: |
Zur Gesamtaufnahme - volume:17 |
---|---|
Enthalten in: |
Journal of proteome research - 17(2018), 4 vom: 06. Apr., Seite 1383-1396 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Yu, Kun-Hsing [VerfasserIn] |
---|
Links: |
---|
Anmerkungen: |
Date Completed 03.06.2019 Date Revised 15.04.2023 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1021/acs.jproteome.7b00772 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM281623600 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM281623600 | ||
003 | DE-627 | ||
005 | 20231225032046.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2018 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1021/acs.jproteome.7b00772 |2 doi | |
028 | 5 | 2 | |a pubmed24n0938.xml |
035 | |a (DE-627)NLM281623600 | ||
035 | |a (NLM)29505266 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Yu, Kun-Hsing |e verfasserin |4 aut | |
245 | 1 | 0 | |a Systematic Protein Prioritization for Targeted Proteomics Studies through Literature Mining |
264 | 1 | |c 2018 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 03.06.2019 | ||
500 | |a Date Revised 15.04.2023 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a There are more than 3.7 million published articles on the biological functions or disease implications of proteins, constituting an important resource of proteomics knowledge. However, it is difficult to summarize the millions of proteomics findings in the literature manually and quantify their relevance to the biology and diseases of interest. We developed a fully automated bioinformatics framework to identify and prioritize proteins associated with any biological entity. We used the 22 targeted areas of the Biology/Disease-driven (B/D)-Human Proteome Project (HPP) as examples, prioritized the relevant proteins through their Protein Universal Reference Publication-Originated Search Engine (PURPOSE) scores, validated the relevance of the score by comparing the protein prioritization results with a curated database, computed the scores of proteins across the topics of B/D-HPP, and characterized the top proteins in the common model organisms. We further extended the bioinformatics workflow to identify the relevant proteins in all organ systems and human diseases and deployed a cloud-based tool to prioritize proteins related to any custom search terms in real time. Our tool can facilitate the prioritization of proteins for any organ system or disease of interest and can contribute to the development of targeted proteomic studies for precision medicine | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a Research Support, U.S. Gov't, Non-P.H.S. | |
650 | 4 | |a Human Proteome Project | |
650 | 4 | |a bioinformatics | |
650 | 4 | |a information retrieval | |
650 | 4 | |a literature mining | |
650 | 4 | |a proteomics | |
700 | 1 | |a Lee, Tsung-Lu Michael |e verfasserin |4 aut | |
700 | 1 | |a Wang, Chi-Shiang |e verfasserin |4 aut | |
700 | 1 | |a Chen, Yu-Ju |e verfasserin |4 aut | |
700 | 1 | |a Ré, Christopher |e verfasserin |4 aut | |
700 | 1 | |a Kou, Samuel C |e verfasserin |4 aut | |
700 | 1 | |a Chiang, Jung-Hsien |e verfasserin |4 aut | |
700 | 1 | |a Kohane, Isaac S |e verfasserin |4 aut | |
700 | 1 | |a Snyder, Michael |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of proteome research |d 2002 |g 17(2018), 4 vom: 06. Apr., Seite 1383-1396 |w (DE-627)NLM124173470 |x 1535-3907 |7 nnns |
773 | 1 | 8 | |g volume:17 |g year:2018 |g number:4 |g day:06 |g month:04 |g pages:1383-1396 |
856 | 4 | 0 | |u http://dx.doi.org/10.1021/acs.jproteome.7b00772 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 17 |j 2018 |e 4 |b 06 |c 04 |h 1383-1396 |