基于机器学习的机场低能见度短临预报研究

展开
  • 1. 成都信息工程大学大气科学学院,四川 成都 610225
    2. 气候变化风险与气象灾害防御江西省重点实验室,江西 南昌 330096
    3. 江西省气象台,江西 南昌 330096

网络出版日期: 2025-07-22

基金资助

四川省科技教育联合基金项目(2024NSFSC1983

Nowcasting of Airport Low Visibility Based on Machine Learning

Expand
  • 1. School of Atmospheric SciencesChengdu University of Information TechnologyChengdu 610225SichuanChina
    2. Jiangxi Provincial Key Laboratory of Climate Change Risk and Meteorological Disaster PreventionNanchang 330096JiangxiChina
    3. Jiangxi Meteorological ObservatoryNanchang 330096JiangxiChina

Online published: 2025-07-22

摘要

为减少低能见度造成的航班备降和返航,利用地面观测数据和 ECMWF 高空、地面数值预报产品,基于机器学习算法建立景德镇机场当前和未来1 h低能见度短临预报模型。通过对比检验发现XG‐BoostLightGBM机器学习算法在机场低能见度短临预报中优于SVM机器学习算法,特征筛选对XG‐BoostLightGBM机器学习算法的性能都有改善;通过SHAP方法解释LightGBM机器学习模型,分析各特征对模型输出的贡献。主要结论如下:(1LightGBM XGBoost建立的机器学习模型在机场低能见度预报方面表现良好,AUC可达0. 98,对于当前低能见度和未来1 h低能见度预报的F1_score最高可达 0. 92。(2)基于机器学习特征工程要求特征相互独立原理对特征进行清洗筛选,有利于提高 XG‐Boost算法模型对未来1 h低能见度的预报准确率,而经过特征筛选的LightGBM模型在预报当前和未来1 h低能见度时比没有特征筛选的LightGBM模型漏报率更低。对当前低能见度的预报,LightGBM_24_0h模型最优,对未来 1 h低能见度的预报,XGBoost_24_1h模型最优,且特征筛选对 XGBoost算法的性能提升更大。(3)分别使用分裂次数和SHAP值分析LightGBM算法模型的特征重要性,表明在不同特征重要性准则下,机场实测相对湿度、气温、风、海平面气压和ECMWF预报的1000 hPa相对湿度、925 hPa垂直速度和散度、850 hPa散度9个特征对机场低能见度的预报更重要,且散度作为机器学习模型的输入特征可以极大提高机器学习模型的性能。(4)基于SHAP值解释特征重要性时,排名前十的特征重要性累计占比80%,说明在以雾为主的景德镇机场低能见度短临预报中LightGBM模型能根据关键预报因子输出预测结果,且在预报未来 1 h低能见度是否持续时,可重点关注 850 hPa散度、1000 hPa相对湿度、机场海平面气压和风向的变化。

本文引用格式

殷齐娥, 倪长健, 肖 安 . 基于机器学习的机场低能见度短临预报研究[J]. 高原气象, 0 : 1 . DOI: 10.7522/j.issn.1000-0534.2025.00067

Abstract

In order to reduce the rate of flight diversion and return caused by low visibilitythis study has established a short-term and nowcasting model of low visibility using ground observation data and the upper-air and surface forecast data of the ECMWF at Jingdezhen Airport based on machine learning algorithms. Comparing the evaluation indicatorsthe results find that the XGBoost and LightGBM machine learning algorithms outperform the SVM machine learning algorithm in nowcasting of the airport low visibility. A detailed comparison of the evaluation metrics was conducted both before and after feature screening in the same machine learning algorithms. The study highlights that feature screening significantly boosts the effectiveness of both models. Further‐ morethe SHAPSHapley Additive exPlanationsmethod elucidates the contribution of each feature to the LightGBM model's output. The main conclusions are as follows:(1The machine learning models established by LightGBM and XGBoost perform well in airport low visibility forecastingwith the AUC reaching up to 0. 98and the F1_score for the prediction of current low visibility and the low visibility in the next one hour can reach up to 0. 92.2Data cleaning and feature screening is conducive to improving the prediction accuracy of the XGBoost algorithm for low visibility in the next houraccording to the principle that "feature engineering in machine learning requires features to be mutually independent". Moreover the LightGBM model with feature screening has a lower false negative rate than the LightGBM model without feature selection when forecasting the current and future one-hour low visibility. For the forecast of the current low visibilitythe LightGBM_24_0h model is the best. For the forecast of low visibility in the next one hourthe XGBoost_24_1h model is the best. And feature selection has a greater improvement on the performance of the XGBoost algorithm.3The splitting times and SHAP values are used respectively to analyze the feature importance of the LightGBM algorithm mod‐ el. It shows that under different feature importance criterianine featuresnamely the measured relative humidityair temperaturewindsea level pressure at the airportand the relative humidity at 1000 hPavertical velocity and divergence at 925 hPaand divergence at 850 hPa predicted by ECMWFare more important for the pre‐ diction of low visibility at the airport. And divergenceas an input feature of the machine learning modelcan greatly improve the performance of the machine learning model.4When explaining feature importance based on SHAP valuesthe cumulative proportion of the top ten feature importance accounts for 80%. This indicates that in the nowcasting of low visibility at Jingdezhen Airport where fog is the main factorthe LightGBM model can output prediction results according to key forecast factors. And when forecasting whether the low visibility in the next one hour will continuemore attention should be paid to the changes in 850 hPa divergence1000 hPa relative humidityairport sea level pressure and wind direction.

文章导航

/