Genetic algorithm-based semisupervised convolutional neural network for real-time monitoring of Escherichia coli fermentation of recombinant protein production using a Raman sensor
© 2024 Wiley Periodicals LLC..
As a non-destructive sensing technique, Raman spectroscopy is often combined with regression models for real-time detection of key components in microbial cultivation processes. However, achieving accurate model predictions often requires a large amount of offline measurement data for training, which is both time-consuming and labor-intensive. In order to overcome the limitations of traditional models that rely on large datasets and complex spectral preprocessing, in addition to the difficulty of training models with limited samples, we have explored a genetic algorithm-based semi-supervised convolutional neural network (GA-SCNN). GA-SCNN integrates unsupervised process spectral labeling, feature extraction, regression prediction, and transfer learning. Using only an extremely small number of offline samples of the target protein, this framework can accurately predict protein concentration, which represents a significant challenge for other models. The effectiveness of the framework has been validated in a system of Escherichia coli expressing recombinant ProA5M protein. By utilizing the labeling technique of this framework, the available dataset for glucose, lactate, ammonium ions, and optical density at 600 nm (OD600) has been expanded from 52 samples to 1302 samples. Furthermore, by introducing a small component of offline detection data for recombinant proteins into the OD600 model through transfer learning, a model for target protein detection has been retrained, providing a new direction for the development of associated models. Comparative analysis with traditional algorithms demonstrates that the GA-SCNN framework exhibits good adaptability when there is no complex spectral preprocessing. Cross-validation results confirm the robustness and high accuracy of the framework, with the predicted values of the model highly consistent with the offline measurement results.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:121 |
---|---|
Enthalten in: |
Biotechnology and bioengineering - 121(2024), 5 vom: 12. Apr., Seite 1583-1595 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Liu, Yuan [VerfasserIn] |
---|
Links: |
---|
Themen: |
Convolutional neural network |
---|
Anmerkungen: |
Date Completed 15.04.2024 Date Revised 15.04.2024 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1002/bit.28661 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM367378558 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM367378558 | ||
003 | DE-627 | ||
005 | 20240415233124.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240122s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1002/bit.28661 |2 doi | |
028 | 5 | 2 | |a pubmed24n1376.xml |
035 | |a (DE-627)NLM367378558 | ||
035 | |a (NLM)38247359 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Liu, Yuan |e verfasserin |4 aut | |
245 | 1 | 0 | |a Genetic algorithm-based semisupervised convolutional neural network for real-time monitoring of Escherichia coli fermentation of recombinant protein production using a Raman sensor |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 15.04.2024 | ||
500 | |a Date Revised 15.04.2024 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2024 Wiley Periodicals LLC. | ||
520 | |a As a non-destructive sensing technique, Raman spectroscopy is often combined with regression models for real-time detection of key components in microbial cultivation processes. However, achieving accurate model predictions often requires a large amount of offline measurement data for training, which is both time-consuming and labor-intensive. In order to overcome the limitations of traditional models that rely on large datasets and complex spectral preprocessing, in addition to the difficulty of training models with limited samples, we have explored a genetic algorithm-based semi-supervised convolutional neural network (GA-SCNN). GA-SCNN integrates unsupervised process spectral labeling, feature extraction, regression prediction, and transfer learning. Using only an extremely small number of offline samples of the target protein, this framework can accurately predict protein concentration, which represents a significant challenge for other models. The effectiveness of the framework has been validated in a system of Escherichia coli expressing recombinant ProA5M protein. By utilizing the labeling technique of this framework, the available dataset for glucose, lactate, ammonium ions, and optical density at 600 nm (OD600) has been expanded from 52 samples to 1302 samples. Furthermore, by introducing a small component of offline detection data for recombinant proteins into the OD600 model through transfer learning, a model for target protein detection has been retrained, providing a new direction for the development of associated models. Comparative analysis with traditional algorithms demonstrates that the GA-SCNN framework exhibits good adaptability when there is no complex spectral preprocessing. Cross-validation results confirm the robustness and high accuracy of the framework, with the predicted values of the model highly consistent with the offline measurement results | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Escherichia coli | |
650 | 4 | |a Raman spectroscopy | |
650 | 4 | |a convolutional neural network | |
650 | 4 | |a genetic algorithm | |
650 | 4 | |a recombinant protein expression | |
650 | 4 | |a transfer learning | |
650 | 7 | |a Recombinant Proteins |2 NLM | |
700 | 1 | |a Zhou, Xiaotian |e verfasserin |4 aut | |
700 | 1 | |a Wang, Teng |e verfasserin |4 aut | |
700 | 1 | |a Luo, An |e verfasserin |4 aut | |
700 | 1 | |a Jia, Zhaojun |e verfasserin |4 aut | |
700 | 1 | |a Pan, Xingquan |e verfasserin |4 aut | |
700 | 1 | |a Cai, Weiqi |e verfasserin |4 aut | |
700 | 1 | |a Sun, Mengge |e verfasserin |4 aut | |
700 | 1 | |a Wang, Xuezhong |e verfasserin |4 aut | |
700 | 1 | |a Wen, Zhenguo |e verfasserin |4 aut | |
700 | 1 | |a Zhou, Guangzheng |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Biotechnology and bioengineering |d 1976 |g 121(2024), 5 vom: 12. Apr., Seite 1583-1595 |w (DE-627)NLM000014702 |x 1097-0290 |7 nnns |
773 | 1 | 8 | |g volume:121 |g year:2024 |g number:5 |g day:12 |g month:04 |g pages:1583-1595 |
856 | 4 | 0 | |u http://dx.doi.org/10.1002/bit.28661 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 121 |j 2024 |e 5 |b 12 |c 04 |h 1583-1595 |