Elucidating an Atmospheric Brown Carbon Species-Toward Supplanting Chemical Intuition with Exhaustive Enumeration and Machine Learning
Brown carbon (BrC) is involved in atmospheric light absorption and climate forcing and can cause adverse health effects. Understanding the formation mechanisms and molecular structure of BrC is of key importance in developing strategies to control its environment and health impact. Structure determination of BrC is challenging, due to the lack of experiments providing molecular fingerprints and the sheer number of molecular candidates with identical mass. Suggestions based on chemical intuition are prone to errors due to the inherent bias. We present an unbiased algorithm, using graph-based molecule generation and machine learning, which can identify all molecular structures of compounds involved in biomass burning and the composition of BrC. We apply this algorithm to C12H12O7, a light-absorbing "test case" molecule identified in chamber experiments on the aqueous photo-oxidation of syringol, a prevalent marker in wood smoke. Of the 260 million molecular graphs, the algorithm leaves only 36,518 (0.01%) as viable candidates matching the spectrum. Although no unique molecular structure is obtained from only a chemical formula and a UV/vis absorption spectrum, we discuss further reduction strategies and their efficacy. With additional data, the method can potentially more rapidly identify isomers extracted from lab and field aerosol particles without introducing human bias.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2021 |
---|---|
Erschienen: |
2021 |
Enthalten in: |
Zur Gesamtaufnahme - volume:55 |
---|---|
Enthalten in: |
Environmental science & technology - 55(2021), 12 vom: 15. Juni, Seite 8447-8457 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Tapavicza, Enrico [VerfasserIn] |
---|
Links: |
---|
Anmerkungen: |
Date Completed 01.07.2021 Date Revised 31.12.2021 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1021/acs.est.1c00885 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM326292268 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM326292268 | ||
003 | DE-627 | ||
005 | 20231225194258.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2021 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1021/acs.est.1c00885 |2 doi | |
028 | 5 | 2 | |a pubmed24n1087.xml |
035 | |a (DE-627)NLM326292268 | ||
035 | |a (NLM)34080853 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Tapavicza, Enrico |e verfasserin |4 aut | |
245 | 1 | 0 | |a Elucidating an Atmospheric Brown Carbon Species-Toward Supplanting Chemical Intuition with Exhaustive Enumeration and Machine Learning |
264 | 1 | |c 2021 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 01.07.2021 | ||
500 | |a Date Revised 31.12.2021 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Brown carbon (BrC) is involved in atmospheric light absorption and climate forcing and can cause adverse health effects. Understanding the formation mechanisms and molecular structure of BrC is of key importance in developing strategies to control its environment and health impact. Structure determination of BrC is challenging, due to the lack of experiments providing molecular fingerprints and the sheer number of molecular candidates with identical mass. Suggestions based on chemical intuition are prone to errors due to the inherent bias. We present an unbiased algorithm, using graph-based molecule generation and machine learning, which can identify all molecular structures of compounds involved in biomass burning and the composition of BrC. We apply this algorithm to C12H12O7, a light-absorbing "test case" molecule identified in chamber experiments on the aqueous photo-oxidation of syringol, a prevalent marker in wood smoke. Of the 260 million molecular graphs, the algorithm leaves only 36,518 (0.01%) as viable candidates matching the spectrum. Although no unique molecular structure is obtained from only a chemical formula and a UV/vis absorption spectrum, we discuss further reduction strategies and their efficacy. With additional data, the method can potentially more rapidly identify isomers extracted from lab and field aerosol particles without introducing human bias | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a Research Support, U.S. Gov't, Non-P.H.S. | |
650 | 4 | |a biomass burning | |
650 | 4 | |a chemical diversity | |
650 | 4 | |a chemical space | |
650 | 4 | |a light absorption | |
650 | 4 | |a oligomers | |
650 | 4 | |a structure determination | |
650 | 7 | |a Aerosols |2 NLM | |
650 | 7 | |a Carbon |2 NLM | |
650 | 7 | |a 7440-44-0 |2 NLM | |
700 | 1 | |a von Rudorff, Guido Falk |e verfasserin |4 aut | |
700 | 1 | |a De Haan, David O |e verfasserin |4 aut | |
700 | 1 | |a Contin, Mario |e verfasserin |4 aut | |
700 | 1 | |a George, Christian |e verfasserin |4 aut | |
700 | 1 | |a Riva, Matthieu |e verfasserin |4 aut | |
700 | 1 | |a von Lilienfeld, O Anatole |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Environmental science & technology |d 1967 |g 55(2021), 12 vom: 15. Juni, Seite 8447-8457 |w (DE-627)NLM112374735 |x 1520-5851 |7 nnns |
773 | 1 | 8 | |g volume:55 |g year:2021 |g number:12 |g day:15 |g month:06 |g pages:8447-8457 |
856 | 4 | 0 | |u http://dx.doi.org/10.1021/acs.est.1c00885 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 55 |j 2021 |e 12 |b 15 |c 06 |h 8447-8457 |