IRF-RL的混合流水车间动态调度方法研究*

doi:10.16731/j.cnki.1671-3133.2024.11.004

现代制造工程 ›› 2024, Vol. 530 ›› Issue (11): 26-36.doi: 10.16731/j.cnki.1671-3133.2024.11.004

• 先进制造系统管理运作 • 上一篇下一篇

IRF-RL的混合流水车间动态调度方法研究^*

张梦杰¹, 杨晓英^2,3, 李博²

1 河南科技大学商学院,洛阳 471000;
2 河南科技大学机电工程学院,洛阳 471003;
3 机械装备先进制造河南省协同创新中心,洛阳 471003

收稿日期:2024-01-02 出版日期:2024-11-18 发布日期:2024-11-29
作者简介:张梦杰,硕士研究生,主要研究方向为智能算法与车间调度。E-mail:qwaszx42669287@163.com;杨晓英,博士研究生导师,主要研究方向为工业工程与智能制造等。E-mail:lyyxy@haust.edu.cn;李博,硕士研究生,主要研究方向为智能优化算法。E-mail:18730316960@163.com
基金资助:
^*国家重点研发计划项目(2018YFB1701205);企业委托项目(HX20221116)

Hybrid flow shop dynamic scheduling method based on improved random forest and reinforcement learning

ZHANG Mengjie¹, YANG Xiaoying^2,3, LI Bo²

1 Business School,Henan University of Science and Technology, Luoyang 471000,China;
2 School of Mechatronics Engineering,Henan University of Science and Technology, Luoyang 471003,China;
3 Henan Provincial Collaborative Innovation Center for Advanced Manufacturing of Machinery and Equipment, Luoyang 471003,China

Received:2024-01-02 Online:2024-11-18 Published:2024-11-29

摘要/Abstract

摘要： 为适应混合流水车间生产需求,提出了一种基于机器学习的两阶段动态调度方法。在离线挖掘阶段,以历史数据为基础,采用改进随机森林算法建立一个由制造系统生产状态到最优调度规则的知识映射网络,挖掘出有价值的调度规则用于在线决策,跳过预热阶段提高调度效率进而优化调度方案;在线调度阶段,采用强化学习算法对车间状态的实时数据进行分析和训练,根据系统状态的动态变化优化策略选择,以实现对扰动事件的自适应和快速响应能力;仿真实验结果验证了结合数据挖掘和强化学习的两阶段动态调度方法具有可行性和有效性,可充分利用制造数据并在线调度制造执行过程。

关键词: 混合流水车间, 动态调度, 强化学习, 改进随机森林, 数据驱动

Abstract: In order to adapt the production demands of hybrid flow shop, a two-stage dynamic scheduling method based on machine learning is proposed.In the offline mining phase, based on historical data, the improved random forest algorithm is used to establish a knowledge mapping network from the production state of the manufacturing system to the optimal scheduling rules, which can be mined for online decision-making, thus skipping the warm-up phase to improve the scheduling efficiency and optimize the scheduling scheme. In the online scheduling phase, the reinforcement learning algorithm is used to analyze and train the real-time data of the flow shop state. And it optimizes the strategy selection according to the system state′s dynamic changes of the system state to achieve the adaptive and fast response capability to the perturbation events. The simulation experiment verified that the two-phase dynamic scheduling approach combining data mining and reinforcement learning is more feasible and effective for fully utilizing manufacturing data and scheduling the manufacturing execution process online.

Key words: hybrid flow shop, dynamic scheduling, reinforcement learning, improved random forest, data-driven

中图分类号:

TP18

张梦杰, 杨晓英, 李博. IRF-RL的混合流水车间动态调度方法研究^*[J]. 现代制造工程, 2024, 530(11): 26-36.

ZHANG Mengjie, YANG Xiaoying, LI Bo. Hybrid flow shop dynamic scheduling method based on improved random forest and reinforcement learning[J]. Modern Manufacturing Engineering, 2024, 530(11): 26-36.

参考文献

[1] FAN K, ZHAI Y, LI X, et al. Review and classification of hybrid shop scheduling[J]. Production Engineering: Research and Development, 2018,12(5):597-609.
[2] 张斯琪, 倪静. 混合鲸鱼算法在柔性作业车间系统中的应用[J]. 系统科学学报, 2020,28(1):131-136.
[3] 方伟光, 郭宇, 黄少华, 等.大数据驱动的离散制造车间生产过程智能管控方法研究[J]. 机械工程学报,2021,57(20):277-291.
[4] 吴秀丽, 孙琳. 智能制造系统基于数据驱动的车间实时调度[J]. 控制与决策, 2020,35(3):523-535.
[5] 郑堃, 练志伟, 顾新艳, 等.应用改进两点交叉算子的改进自适应遗传算法求解不相关并行机混合流水车间调度问题[J]. 中国机械工程, 2023,34(14):1647-1658.
[6] 鲁建厦,金敬豪,赵文彬,等.基于候鸟算法的批量流混合装配流水车间调度[J]. 浙江大学学报(工学版), 2022,56(11):2135-2144.
[7] TIRKOLAEE E B,GOLI A,WEBER G W. Fuzzy Mathematical Programming and Self-Adaptive Artificial Fish Swarm Algorithm for Just-in-Time Energy-Aware Flow Shop Scheduling Problem with Outsourcing Option[J]. IEEE Transactions on Fuzzy Systems, 2020,28(11):2772-2783.
[8] 钟敬伟,石宇强. 基于DQN的智能工厂作业车间调度[J].现代制造工程,2021(9):17-23,93.
[9] ZHANG Y,ZHU H,TANG D,et al. Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems[J/OL]. Robotics Computer-Integrated Manufacturing,2022,78.https://doi.org/10.1016/j.rcim.2022.102412.
[10] 王艳红, 尹涛, 谭园园, 等. 基于规则与Q学习的作业车间动态调度算法[J/OL]. 计算机集成制造系统,2023:1-17.http://kns.cnki.net/kcms/detail/11.5946.TP.20230506.1406.002.html.
[11] WANG J,HE J,ZHANG J,et al. A Reinforcement Learning Method to Optimize the priority of Product for Scheduling the Large-scale Complex Manufacturing Systems[C]//48th International Conference on Computers & Industrial Engineering(CIE48). Auckland:[s.n.],2018:2-5.
[12] LIU C L,Chang C C,Tseng C J. Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems[J].IEEE Access,2020,8:71752-71762.
[13] YANG S,XU Z,WANG J. Intelligent Decision-Making of Scheduling for Dynamic Permutation Flow shop via Deep Reinforcement Learning[J]. Sensors, 2021,21(3):1019.
[14] 余斌煌. 柔性流水车间调度问题综述[J]. 现代制造工程, 2022(9):154-162,71.
[15] 李新宇,黄江平,李嘉航,等. 智能车间动态调度的研究与发展趋势分析[J].中国科学:技术科学,2023,53(7):1016-1030.
[16] BAKIRLI Gzde, BRANT Derya. DTreeSim:A new appro-ach to compute decision tree similarity using re-mining[J]. Turkish Journal of Electrical Engineering & Computer Sciences,2017,25(1):108-125.
[17] SHAO W,SHAO Z,PI D. Multi-objective evolutionary algorithm based on multiple neighborhoods local search for multi-objective distributed hybrid flow shop scheduling problem[J].Expert Systems with Applications,2021,183:115453.

IRF-RL的混合流水车间动态调度方法研究^*

Hybrid flow shop dynamic scheduling method based on improved random forest and reinforcement learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	张宁宁, 万卫兵, 张梦晓, 赵宇明. 面向多目标动态作业车间调度的强化学习决策算法研究^*[J]. 现代制造工程, 2025, 538(7): 20-30.
[2]	金桥, 杨光锐, 王霄, 徐凌桦, 张芳. 基于A-TD3的码垛机器人轨迹规划^*[J]. 现代制造工程, 2025, 536(5): 42-52.
[3]	徐帅, 李艳武, 谢辉, 牛晓伟. 基于卷积金字塔网络的PPO算法求解作业车间调度问题^*[J]. 现代制造工程, 2025, 534(3): 19-30.
[4]	杨丹, 舒先涛, 余震, 鲁光涛, 纪松霖, 王家兵. 深度强化学习求解动态柔性作业车间调度问题^*[J]. 现代制造工程, 2025, 533(2): 10-16.
[5]	杨逢海, 杨晓英, 裴志杰, 武亚琪, 张志伟. 基于深度强化学习的风电拉挤板生产智能排程^*[J]. 现代制造工程, 2025, 532(1): 23-32.
[6]	谢子健, 秦建军, 曹钰. 基于改进TD3的四足机器人非结构化地形运动控制^*[J]. 现代制造工程, 2025, 532(1): 33-41.
[7]	唐艺军, 杜纪浩, 李雪. 考虑运输时间的混合流水车间绿色生产调度[J]. 现代制造工程, 2024, 524(5): 23-30.
[8]	卢兵, 刘腾, 霍为炜. 基于强化学习的车队速度规划与能量管理联合优化^*[J]. 现代制造工程, 2024, 523(4): 80-86.
[9]	周勇, 杨旭东, 王晋冰, 张磊, 孙栋. 烟草物流中心数字化车间监控系统研究^*[J]. 现代制造工程, 2024, 521(2): 93-101.
[10]	闫富乾, 石致远, 王立闻. 基于改进灰狼算法的柔性作业车间动态节能分批调度问题^*[J]. 现代制造工程, 2024, 520(1): 24-32.
[11]	陆心屹;韩晓龙. 基于强化学习的改进NSGA-Ⅱ求解柔性作业车间节能调度问题[J]. 现代制造工程, 2023, 515(8): 22-35.
[12]	丁慧琴;曹雏清;徐昌军;李龙. 改进Q-learning算法的柔性上料系统研究[J]. 现代制造工程, 2023, 511(4): 87-92.
[13]	黄挺博;欧道江;何成刚;林群煦;陈金源;李虎. 基于Q-Learning的变阻抗控制[J]. 现代制造工程, 2023, 510(3): 70-76.
[14]	陶鑫钰;王艳;纪志成. 基于A3C的特征重构工艺路线规划方法[J]. 现代制造工程, 2023, 517(10): 15-26.
[15]	钟敬伟，石宇强. 基于DQN的智能工厂作业车间调度[J]. 现代制造工程, 2021, 492(9): 17-23.