Artikel
Stochastic mixed model sequencing with multiple stations using reinforcement learning and probability quantiles
In this study, we propose a reinforcement learning (RL) approach for minimizing the number of work overload situations in the mixed model sequencing (MMS) problem with stochastic processing times. The learning environment simulates stochastic processing times and penalizes work overloads with negative rewards. To account for the stochastic component of the problem, we implement a state representation that specifies whether work overloads will occur if the processing times are equal to their respective 25%, 50%, and 75% probability quantiles. Thereby, the RL agent is guided toward minimizing the number of overload situations while being provided with statistical information about how fluctuations in processing times affect the solution quality. To the best of our knowledge, this study is the first to consider the stochastic problem variation with a minimization of overload situations.
- Sprache
-
Englisch
- Erschienen in
-
Journal: OR Spectrum ; ISSN: 1436-6304 ; Volume: 44 ; Year: 2021 ; Issue: 1 ; Pages: 29-56 ; Berlin, Heidelberg: Springer
Mixed model sequencing
Reinforcement learning
Metaheuristics
Combinatorial optimization
Lutz, Bernhard
Neumann, Dirk
- DOI
-
doi:10.1007/s00291-021-00652-x
- Letzte Aktualisierung
-
20.09.2024, 08:24 MESZ
Datenpartner
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.
Objekttyp
- Artikel
Beteiligte
- Brammer, Janis
- Lutz, Bernhard
- Neumann, Dirk
- Springer
Entstanden
- 2021