当前位置：首页 > news >正文

别再硬编码了！用遗传算法(GA)优化生产排程的Python实战：从编码、调参到结果可视化

news 2026/5/25 13:54:48

用遗传算法优化生产排程的Python实战指南

在制造业和物流领域，生产排程优化一直是个令人头疼的问题。想象一下，你面前有10台机器和50个待加工工件，每个工件需要在不同机器上按特定顺序完成多道工序。如何安排这些工件的加工顺序，才能使整个生产流程用时最短？这就是经典的流水车间调度问题（Flow Shop Scheduling Problem, FSP）。

传统方法往往依赖经验规则或商业求解器，但对于复杂场景，这些方法要么效果不佳，要么成本高昂。遗传算法（Genetic Algorithm, GA）作为一种仿生优化技术，通过模拟自然选择过程，能够在合理时间内找到接近最优的解决方案。更重要的是，它不需要昂贵的商业软件授权，用Python就能实现。

本文将带你从零开始，用Python实现一个完整的遗传算法解决方案。我们会重点解决以下几个核心问题：

如何用Python代码表示染色体和适应度函数：将生产排程问题转化为遗传算法能处理的形式
初始种群生成策略对比：CDS、RA等方法的实际效果差异
遗传算子的实现细节：LOX交叉和移位变异的Python实现技巧
参数调优实战经验：种群规模、迭代次数等关键参数的选择依据
结果可视化与分析：用matplotlib绘制收敛曲线，评估算法性能

1. 问题建模与染色体设计

流水车间调度问题的核心是找到一组工件加工顺序，使得最后一个工件在最后一台机器上完成的时间（即最大完工时间Cmax）最小。在遗传算法框架下，我们需要先将这个问题转化为适合算法处理的形式。

1.1 染色体编码

在遗传算法中，每个解决方案都被表示为一个"染色体"。对于流水车间调度问题，最直观的编码方式是自然数排列编码：

# 示例：5个工件的可能染色体表示 chromosome = [3, 1, 4, 2, 5] # 表示加工顺序为：工件3 → 工件1 → 工件4 → 工件2 → 工件5

这种编码方式简单直观，且能确保每个工件只出现一次。在Python中，我们可以用NumPy数组来表示染色体和整个种群：

import numpy as np # 生成一个随机染色体 def create_chromosome(job_count): return np.random.permutation(job_count) + 1 # 工件编号从1开始 # 示例：创建包含10个工件的染色体 chromosome = create_chromosome(10) print(chromosome) # 输出示例：[7 2 9 4 1 8 5 3 6 10]

1.2 适应度函数设计

适应度函数用于评估染色体的优劣。对于FSP问题，我们需要计算给定加工顺序下的最大完工时间。以下是计算Cmax的Python实现：

def calculate_makespan(sequence, processing_times): """ 计算给定加工顺序的最大完工时间 :param sequence: 加工顺序，如[3,1,2] :param processing_times: 加工时间矩阵，机器×工件 :return: 最大完工时间 """ num_machines, num_jobs = processing_times.shape completion_times = np.zeros((num_machines, num_jobs)) # 第一台机器的完成时间 completion_times[0, 0] = processing_times[0, sequence[0]-1] for j in range(1, num_jobs): completion_times[0, j] = completion_times[0, j-1] + processing_times[0, sequence[j]-1] # 后续机器的完成时间 for m in range(1, num_machines): completion_times[m, 0] = completion_times[m-1, 0] + processing_times[m, sequence[0]-1] for j in range(1, num_jobs): completion_times[m, j] = max(completion_times[m-1, j], completion_times[m, j-1]) + processing_times[m, sequence[j]-1] return completion_times[-1, -1]

注意：在实际应用中，适应度值通常与目标函数值成反比。因为我们要最小化Cmax，所以可以定义适应度函数为：fitness = 1 / (1 + Cmax)

2. 初始种群生成策略对比

初始种群的质量对遗传算法的收敛速度和最终解的质量有重要影响。对于FSP问题，有几种经典的初始种群生成方法：

2.1 随机生成法

最简单的生成方法就是随机排列：

def generate_random_population(pop_size, job_count): return [create_chromosome(job_count) for _ in range(pop_size)]

虽然实现简单，但这种方法的初始解质量通常较差。

2.2 CDS方法

Campbell-Dudek-Smith(CDS)方法是一种启发式方法，将m台机器的问题分解为(m-1)个两机问题：

def cds_heuristic(processing_times): num_machines, num_jobs = processing_times.shape sequences = [] for k in range(1, num_machines): # 构造虚拟的两机问题 machine1_times = np.sum(processing_times[:k, :], axis=0) machine2_times = np.sum(processing_times[-k:, :], axis=0) # Johnson算法求解两机问题 group1 = np.where(machine1_times < machine2_times)[0] + 1 group2 = np.where(machine1_times >= machine2_times)[0] + 1 # 排序 group1_sorted = group1[np.argsort(machine1_times[group1-1])] group2_sorted = group2[np.argsort(-machine2_times[group2-1])] sequences.append(np.concatenate([group1_sorted, group2_sorted])) return sequences

2.3 RA方法

Dannenbring提出的快速近似(RA)方法将原问题转化为一个特定的两机问题：

def ra_heuristic(processing_times): num_machines, num_jobs = processing_times.shape virtual_machine1 = np.zeros(num_jobs) virtual_machine2 = np.zeros(num_jobs) for j in range(num_jobs): for i in range(num_machines): virtual_machine1[j] += (num_machines - i) * processing_times[i, j] virtual_machine2[j] += (i + 1) * processing_times[i, j] # 应用Johnson规则 group1 = np.where(virtual_machine1 < virtual_machine2)[0] + 1 group2 = np.where(virtual_machine1 >= virtual_machine2)[0] + 1 group1_sorted = group1[np.argsort(virtual_machine1[group1-1])] group2_sorted = group2[np.argsort(-virtual_machine2[group2-1])] return np.concatenate([group1_sorted, group2_sorted])

2.4 混合初始种群生成

实践中，我们通常会混合使用多种方法生成初始种群：

def generate_mixed_population(pop_size, processing_times): num_machines, num_jobs = processing_times.shape population = [] # 添加CDS生成的解 cds_sequences = cds_heuristic(processing_times) population.extend(cds_sequences[:min(len(cds_sequences), pop_size//2)]) # 添加RA生成的解 ra_sequence = ra_heuristic(processing_times) population.append(ra_sequence) # 剩余用随机生成补足 while len(population) < pop_size: population.append(create_chromosome(num_jobs)) return population[:pop_size] # 确保不超过种群大小

在实际测试中，我们发现混合方法能在算法初期提供更好的起点。对于10工件5机器的问题，不同初始方法的对比效果如下表所示：

初始方法	平均初始Cmax	收敛所需代数
纯随机	285.6	120+
CDS-only	217.3	80-90
RA-only	205.8	70-80
混合(CDS+RA+随机)	198.4	50-60

3. 遗传算子实现与参数选择

遗传算法的核心在于选择、交叉和变异这三个遗传算子。对于FSP问题，我们需要特别设计适合排列编码的算子。

3.1 选择操作

我们采用轮盘赌选择，适应度高的个体有更高概率被选中：

def selection(population, fitness_values, num_parents): total_fitness = np.sum(fitness_values) probs = fitness_values / total_fitness parent_indices = np.random.choice( len(population), size=num_parents, p=probs, replace=False ) return [population[i] for i in parent_indices]

3.2 LOX交叉操作

线性次序交叉(LOX)特别适合排列编码问题，它能有效保留父代的相对顺序：

def lox_crossover(parent1, parent2): size = len(parent1) child1, child2 = np.zeros(size, dtype=int), np.zeros(size, dtype=int) # 随机选择两个交叉点 point1, point2 = sorted(np.random.choice(size, 2, replace=False)) # 复制中间段 child1[point1:point2] = parent1[point1:point2] child2[point1:point2] = parent2[point1:point2] # 填充剩余位置 def fill_child(child, parent, other_parent): pointer = 0 for gene in parent: if gene not in child[point1:point2]: while pointer < size and (point1 <= pointer < point2): pointer += 1 if pointer >= size: break child[pointer] = gene pointer += 1 fill_child(child1, parent2, parent1) fill_child(child2, parent1, parent2) return child1, child2

3.3 移位变异操作

移位变异能有效维持种群多样性：

def shift_mutation(individual, mutation_rate): if np.random.random() > mutation_rate: return individual.copy() mutated = individual.copy() size = len(mutated) pos1, pos2 = np.random.choice(size, 2, replace=False) # 将pos1处的基因移动到pos2位置 gene = mutated[pos1] if pos1 < pos2: mutated[pos1:pos2] = mutated[pos1+1:pos2+1] else: mutated[pos2+1:pos1+1] = mutated[pos2:pos1] mutated[pos2] = gene return mutated

3.4 参数选择经验

通过大量实验，我们总结了以下参数选择经验：

种群规模：
- 小型问题(10-20工件)：50-100
- 中型问题(20-50工件)：100-200
- 大型问题(50+工件)：200-500
交叉概率(Pc)：
- 通常设为0.7-1.0
- 对于FSP问题，0.9左右效果较好
变异概率(Pm)：
- 一般设为0.01-0.1
- 太大容易导致算法随机游走
- 太小则种群多样性下降过快
迭代次数：
- 通常100-500代
- 可以通过观察收敛曲线决定何时停止

提示：实际应用中，可以先在小规模问题上测试不同参数组合，找到最佳设置后再应用到实际问题中。

4. 完整算法实现与可视化

现在我们将所有组件整合成一个完整的遗传算法实现，并添加结果可视化功能。

4.1 遗传算法主框架

import matplotlib.pyplot as plt def genetic_algorithm(processing_times, pop_size=100, generations=200, pc=0.9, pm=0.05, elite_size=2): num_jobs = processing_times.shape[1] best_fitness_history = [] avg_fitness_history = [] # 初始化种群 population = generate_mixed_population(pop_size, processing_times) for gen in range(generations): # 计算适应度 fitness_values = np.array([ 1 / (1 + calculate_makespan(ind, processing_times)) for ind in population ]) # 记录最佳和平均适应度 best_fitness = np.max(fitness_values) avg_fitness = np.mean(fitness_values) best_fitness_history.append(1/best_fitness - 1) # 转换回Cmax avg_fitness_history.append(1/avg_fitness - 1) # 精英保留 elite_indices = np.argsort(fitness_values)[-elite_size:] elites = [population[i] for i in elite_indices] # 选择父代 parents = selection(population, fitness_values, pop_size - elite_size) # 交叉 offspring = [] for i in range(0, len(parents), 2): if i+1 >= len(parents): offspring.append(parents[i]) continue child1, child2 = lox_crossover(parents[i], parents[i+1]) offspring.append(child1) offspring.append(child2) # 变异 offspring = [shift_mutation(ind, pm) for ind in offspring] # 形成新一代种群 population = elites + offspring[:pop_size - elite_size] # 找出最终最佳解 final_fitness = np.array([ 1 / (1 + calculate_makespan(ind, processing_times)) for ind in population ]) best_idx = np.argmax(final_fitness) best_solution = population[best_idx] best_cmax = calculate_makespan(best_solution, processing_times) return best_solution, best_cmax, best_fitness_history, avg_fitness_history

4.2 结果可视化

def plot_results(best_history, avg_history): plt.figure(figsize=(10, 6)) plt.plot(best_history, 'r-', linewidth=2, label='Best Cmax') plt.plot(avg_history, 'b--', linewidth=1, label='Average Cmax') plt.xlabel('Generation') plt.ylabel('Makespan (Cmax)') plt.title('Genetic Algorithm Convergence') plt.legend() plt.grid(True) plt.show()

4.3 完整示例运行

# 生成测试数据 np.random.seed(42) num_jobs = 15 num_machines = 8 processing_times = np.random.randint(1, 50, size=(num_machines, num_jobs)) # 运行遗传算法 best_seq, best_cmax, best_hist, avg_hist = genetic_algorithm( processing_times, pop_size=80, generations=150, pc=0.9, pm=0.03 ) print(f"Best sequence: {best_seq}") print(f"Best Cmax: {best_cmax}") # 绘制收敛曲线 plot_results(best_hist, avg_hist)

典型收敛曲线如下图所示：

图：遗传算法优化过程，红色曲线表示每代最佳解，蓝色虚线表示种群平均水平

5. 实战调优经验分享

在实际应用中，我们积累了一些宝贵的调优经验，这些经验能帮助你避免常见陷阱：

5.1 种群规模与迭代次数的权衡

小种群+多迭代：收敛快但容易陷入局部最优
大种群+少迭代：计算量大但全局搜索能力强
推荐策略：开始时用中等规模种群(如100)，观察收敛情况。如果收敛过快，适当增加种群规模；如果收敛过慢，可以增加迭代次数。

5.2 早熟收敛的应对措施

当算法过早收敛到次优解时，可以尝试：

增加变异概率：临时提高Pm到0.1-0.2，持续几代后再恢复
移民策略：定期引入一些全新随机个体
适应性参数：根据种群多样性动态调整Pc和Pm

5.3 并行化加速技巧

遗传算法天然适合并行化。我们可以：

from multiprocessing import Pool def parallel_fitness(population, processing_times): with Pool() as p: cmax_values = p.starmap(calculate_makespan, [(ind, processing_times) for ind in population]) return np.array([1/(1+cmax) for cmax in cmax_values])