Hybrid-Line Remanufacturing Process Optimization for Multi-Type Factories with Twin Delayed Deep Deterministic Policy Gradient Algorithm

Authors

  • Jinlei Gu Author
  • Yujie Feng Author
  • Vlad Veksler Author

Keywords:

Multi-objective optimization, reinforcement learning, twin delayed deep deterministic policy gradient algorithm, remanufacturing process

Abstract

In response to the growing complexity of remanufacturing systems, this work investigates a novel Multi-objective Hybrid-line Multi-type Factory Remanufacturing Optimization Problem. The proposed model takes into account the disassembly technologies used by different factories, the selection of heterogeneous disassembly lines, and task-related constraints such as precedence and conflicts. The goal is to assign end-of-life products to appropriate disassembly factories and schedule tasks on optimal lines to achieve high scalability and efficiency in large-scale dynamic environments. To solve this problem, we formulate a multi-objective mixed-integer programming model that simultaneously maximizes overall profit and minimizes factory cycle time. The model is validated using a commercial solver to ensure feasibility and correctness. Due to the dynamic and sequential nature of the problem, we also employ the Twin Delayed Deep Deterministic Policy Gradient Algorithm (TD3), TD3 for short,  to learn optimal strategies through interaction with the environment. Experimental studies in various benchmark cases show that TD3 significantly outperforms baseline reinforcement learning algorithms such as Deep Deterministic Policy Gradient (DDPG), Soft Actor-Critic (SAC), and Advantage Actor-Critic (A2C) in both convergence stability and solution quality. TD3 also demonstrates superior capability in approximating Pareto-optimal solutions, which makes it suitable for real-world remanufacturing scenarios.

Downloads

Download data is not yet available.

Downloads

Published

2026-04-06

How to Cite

[1]
J. Gu, Yujie Feng, and Vlad Veksler, “Hybrid-Line Remanufacturing Process Optimization for Multi-Type Factories with Twin Delayed Deep Deterministic Policy Gradient Algorithm”, IJAIGM, vol. 2, no. 1, Apr. 2026, Accessed: Apr. 16, 2026. [Online]. Available: https://hopeembark.org/index.php/IJGMAI/article/view/75

Similar Articles

1-10 of 22

You may also start an advanced similarity search for this article.