高级检索

基于深度强化学习的多塔吊布局优化研究

Research on Multi-tower Crane Layout Optimization Based on Deep Reinforcement Learning

  • 摘要: 塔吊作为大型施工现场中关键的垂直运输设备,其布置方案的合理性直接影响施工效率、成本控制与作业安全。传统布置方法主要依赖人工经验或启发式算法,难以在多重约束条件下实现全局最优解。针对这一问题,该文提出一种基于深度强化学习的多塔吊协同布置方法,采用近端策略优化(PPO)算法构建智能体与仿真环境的交互机制,将塔吊布置过程建模为马尔可夫决策过程(MDP)。通过综合考虑塔吊覆盖率、设备成本及碰撞风险等因素,设计多目标奖励函数,引导智能体实现塔吊位置的自适应、快速优化配置。试验结果表明,该方法在三种典型仿真场景下均能生成高效、安全的布置方案,未发生任何塔吊碰撞,且作业覆盖率均超过98%,展现出优异的优化性能与稳定性。该研究为建筑施工中的塔吊调度与空间规划提供了新思路,具有显著的工程应用价值和广阔的推广前景。

     

    Abstract: Tower cranes are critical vertical transportation equipment in large-scale construction sites, and the rationality of their layout directly affects construction efficiency, cost control, and operational safety. Traditional layout methods rely heavily on manual experience or heuristic algorithms, making it difficult to achieve globally optimal solutions under multiple constraints. To address this issue, this paper proposes a deep reinforcement learning-based approach for cooperative multi-tower crane layout optimization. The Proximal Policy Optimization (PPO) algorithm is employed to establish an interaction mechanism between intelligent agents and a simulated environment, modeling the crane layout process as a Markov Decision Process (MDP). A multi-objective reward function is designed by integrating key factors such as crane coverage, equipment cost, and collision risk, guiding the agent to achieve adaptive and rapid optimal configuration of crane positions. Experimental results demonstrate that the proposed method generates high-quality, collision-free layout schemes across three typical simulation scenarios, with operational coverage exceeding 98% in all cases, indicating excellent optimization performance and robustness. This study provides a novel approach for crane scheduling and spatial planning in construction projects, offering significant practical value and broad prospects for application and promotion.

     

/

返回文章
返回