Details der Publikation - A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

Real-world systems are often formulated as constrained optimization problems. Techniques to incorporate constraints into Neural Networks (NN), such as Neural Ordinary Differential Equations (Neural ODEs), have been used. However, these introduce hyperparameters that require manual tuning through trial and error, raising doubts about the successful incorporation of constraints into the generated model. This paper describes in detail the two-stage training method for Neural ODEs, a simple, effective, and penalty parameter-free approach to model constrained systems. In this approach the constrained optimization problem is rewritten as two unconstrained sub-problems that are solved in two stages. The first stage aims at finding feasible NN parameters by minimizing a measure of constraints violation. The second stage aims to find the optimal NN parameters by minimizing the loss function while keeping inside the feasible region. We experimentally demonstrate that our method produces models that satisfy the constraints and also improves their predictive performance. Thus, ensuring compliance with critical system properties and also contributing to reducing data quantity requirements. Furthermore, we show that the proposed method improves the convergence to an optimal solution and improves the explainability of Neural ODE models. Our proposed two-stage training method can be used with any NN architectures..

Medienart:	Preprint

Erscheinungsjahr:	2024
Erschienen:	2024

Enthalten in:	arXiv.org - (2024) vom: 05. März Zur Gesamtaufnahme - year:2024

Sprache:	Englisch

Beteiligte Personen:	Coelho, C. [VerfasserIn] Costa, M. Fernanda P. [VerfasserIn] Ferrás, L. L. [VerfasserIn]

Links:	Volltext [kostenfrei]

Themen:	000 510 Computer Science - Computational Engineering; Finance; and Science Computer Science - Machine Learning Mathematics - Optimization and Control

Förderinstitution / Projekttitel:

PPN (Katalog-ID):	XCH042798701

Internformat


LEADER	01000naa a22002652 4500
001	XCH042798701
003	DE-627
005	20240306114522.0
007	cr uuu---uuuuu
008	240306s2024 xx \|\|\|\|\|o 00\| \|\|eng c
035			\|a (DE-627)XCH042798701
035			\|a (chemrXiv)2403.02730
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Coelho, C. \|e verfasserin \|4 aut
245	1	0	\|a A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a Real-world systems are often formulated as constrained optimization problems. Techniques to incorporate constraints into Neural Networks (NN), such as Neural Ordinary Differential Equations (Neural ODEs), have been used. However, these introduce hyperparameters that require manual tuning through trial and error, raising doubts about the successful incorporation of constraints into the generated model. This paper describes in detail the two-stage training method for Neural ODEs, a simple, effective, and penalty parameter-free approach to model constrained systems. In this approach the constrained optimization problem is rewritten as two unconstrained sub-problems that are solved in two stages. The first stage aims at finding feasible NN parameters by minimizing a measure of constraints violation. The second stage aims to find the optimal NN parameters by minimizing the loss function while keeping inside the feasible region. We experimentally demonstrate that our method produces models that satisfy the constraints and also improves their predictive performance. Thus, ensuring compliance with critical system properties and also contributing to reducing data quantity requirements. Furthermore, we show that the proposed method improves the convergence to an optimal solution and improves the explainability of Neural ODE models. Our proposed two-stage training method can be used with any NN architectures.
650		4	\|a Computer Science - Machine Learning \|7 (dpeaa)DE-84
650		4	\|a Computer Science - Computational Engineering; Finance; and Science \|7 (dpeaa)DE-84
650		4	\|a Mathematics - Optimization and Control \|7 (dpeaa)DE-84
650		4	\|a 000 \|7 (dpeaa)DE-84
650		4	\|a 510 \|7 (dpeaa)DE-84
700	1		\|a Costa, M. Fernanda P. \|4 aut
700	1		\|a Ferrás, L. L. \|4 aut
773	0	8	\|i Enthalten in \|t arXiv.org \|g (2024) vom: 05. März
773	1	8	\|g year:2024 \|g day:05 \|g month:03
856	4	0	\|u https://arxiv.org/abs/2403.02730 \|z kostenfrei \|3 Volltext
912			\|a GBV_XCH
951			\|a AR
952			\|j 2024 \|b 05 \|c 03

A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände