基于强化学习的多智能体作业车间调度问题求解方法

A reinforcement learning-based approach for solving multi-agent job shop scheduling problem

International Journal of Production Research · 2024

被引 12

ABS 3

Zhuoran Dong
Tao Ren 通讯
Fang Qi
Jiacheng Weng
Danyu Bai 通讯
Jie Yang

中文导读

将晶圆加工建模为多智能体作业车间调度问题，提出两种深度强化学习方法（GTN和GTN-DABC）分别应对动态和静态调度场景，实验表明优于现有启发式算法。

Abstract

Wafer processing is the most expensive, time-consuming, and complex stage in semiconductor manufacturing. It varies significantly based on orders of customers (agents). Optimising the wafer processing flow in a multi-agent scenario can meet customised requirements, speed up delivery, and reduce costs. This work models wafer processing as a multi-agent job shop scheduling problem (MAJSP) with release dates. The objective is to minimise the total weighted makespan of agents. To address both dynamic and static scheduling scenarios in the MAJSP context, two deep reinforcement learning-based (DRL) methods are proposed. In a dynamic scheduling scenario, the statuses of orders and production resources can change at any moment. A DRL method called Graph Transformer Network (GTN) is proposed to rapidly generate high-quality solutions. In a static scheduling scenario, the production plan can be formulated based on predetermined demand and resource conditions. A novel hybrid method (GTN-DABC) that combines GTN with the discrete artificial bee colony algorithm (DABC) is proposed to provide high-quality production plans for manufacturers within an acceptable computation time. Experimental results demonstrate that the proposed GTN outperforms existing heuristics, and the well-designed GTN-DABC is more competitive than other meta-heuristics.

半导体制造作业车间调度强化学习多智能体系统生产调度优化

阅读原文 ↗