Modern Manufacturing Engineering ›› 2025, Vol. 538 ›› Issue (7): 20-30.doi: 10.16731/j.cnki.1671-3133.2025.07.003

Previous Articles     Next Articles

Research on reinforcement learning decision algorithm for multi-objective dynamic job shop scheduling

ZHANG Ningning1, WAN Weibing1, ZHANG Mengxiao1, ZHAO Yuming2   

  1. 1 School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China;
    2 The Department of Automation,Shanghai Jiao Tong University, Shanghai 201100, China
  • Received:2024-06-11 Online:2025-07-18 Published:2025-08-04

Abstract: To address the multi-objective dynamic job shop scheduling problem and meet the real-time scheduling needs of manufacturing workshops in environments with variable scales, a method combining Proximal Policy Optimization (PPO) with GoogLeNet, named GLN-PPO, is proposed. This method constructs the state space of the scheduling problem using multidimensional matrices, designs an action space based on various priority rules, and devises a multi-objective reward function. To verify the effectiveness of the proposed algorithm, it is trained and tested in three environments: a static public environment based on common benchmark problems, a static real environment based on actual cases, and a dynamic real environment. Experimental results show that compared to genetic algorithms, GLN-PPO can provide high-quality scheduling results, meet the real-time scheduling requirements of enterprises, and adapt flexibly to environments with variable scales.

Key words: deep reinforcement learning, job shop scheduling, GoogLeNet, Proximal Policy Optimization (PPO)

CLC Number: 

Copyright © Modern Manufacturing Engineering, All Rights Reserved.
Tel: 010-67126028 E-mail: 2645173083@qq.com
Powered by Beijing Magtech Co. Ltd