A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks
Real-world systems are often formulated as constrained optimization problems. Techniques to incorporate constraints into Neural Networks (NN), such as Neural Ordinary Differential Equations (Neural ODEs), have been used. However, these introduce hyperparameters that require manual tuning through trial and error, raising doubts about the successful incorporation of constraints into the generated model. This paper describes in detail the two-stage training method for Neural ODEs, a simple, effective, and penalty parameter-free approach to model constrained systems. In this approach the constrained optimization problem is rewritten as two unconstrained sub-problems that are solved in two stages. The first stage aims at finding feasible NN parameters by minimizing a measure of constraints violation. The second stage aims to find the optimal NN parameters by minimizing the loss function while keeping inside the feasible region. We experimentally demonstrate that our method produces models that satisfy the constraints and also improves their predictive performance. Thus, ensuring compliance with critical system properties and also contributing to reducing data quantity requirements. Furthermore, we show that the proposed method improves the convergence to an optimal solution and improves the explainability of Neural ODE models. Our proposed two-stage training method can be used with any NN architectures..
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
arXiv.org - (2024) vom: 05. März Zur Gesamtaufnahme - year:2024 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Coelho, C. [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
000 |
---|
Förderinstitution / Projekttitel: |
|
---|
PPN (Katalog-ID): |
XCH042798701 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | XCH042798701 | ||
003 | DE-627 | ||
005 | 20240306114522.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240306s2024 xx |||||o 00| ||eng c | ||
035 | |a (DE-627)XCH042798701 | ||
035 | |a (chemrXiv)2403.02730 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Coelho, C. |e verfasserin |4 aut | |
245 | 1 | 0 | |a A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Real-world systems are often formulated as constrained optimization problems. Techniques to incorporate constraints into Neural Networks (NN), such as Neural Ordinary Differential Equations (Neural ODEs), have been used. However, these introduce hyperparameters that require manual tuning through trial and error, raising doubts about the successful incorporation of constraints into the generated model. This paper describes in detail the two-stage training method for Neural ODEs, a simple, effective, and penalty parameter-free approach to model constrained systems. In this approach the constrained optimization problem is rewritten as two unconstrained sub-problems that are solved in two stages. The first stage aims at finding feasible NN parameters by minimizing a measure of constraints violation. The second stage aims to find the optimal NN parameters by minimizing the loss function while keeping inside the feasible region. We experimentally demonstrate that our method produces models that satisfy the constraints and also improves their predictive performance. Thus, ensuring compliance with critical system properties and also contributing to reducing data quantity requirements. Furthermore, we show that the proposed method improves the convergence to an optimal solution and improves the explainability of Neural ODE models. Our proposed two-stage training method can be used with any NN architectures. | ||
650 | 4 | |a Computer Science - Machine Learning |7 (dpeaa)DE-84 | |
650 | 4 | |a Computer Science - Computational Engineering; Finance; and Science |7 (dpeaa)DE-84 | |
650 | 4 | |a Mathematics - Optimization and Control |7 (dpeaa)DE-84 | |
650 | 4 | |a 000 |7 (dpeaa)DE-84 | |
650 | 4 | |a 510 |7 (dpeaa)DE-84 | |
700 | 1 | |a Costa, M. Fernanda P. |4 aut | |
700 | 1 | |a Ferrás, L. L. |4 aut | |
773 | 0 | 8 | |i Enthalten in |t arXiv.org |g (2024) vom: 05. März |
773 | 1 | 8 | |g year:2024 |g day:05 |g month:03 |
856 | 4 | 0 | |u https://arxiv.org/abs/2403.02730 |z kostenfrei |3 Volltext |
912 | |a GBV_XCH | ||
951 | |a AR | ||
952 | |j 2024 |b 05 |c 03 |