用户名: 密码: 验证码:
大数据时代下高危污染源的预警研究
详细信息    查看官网全文
摘要
环境管理重在预防,未雨绸缪从源头治污,提前预测可能引起环境污染风险和产生违法行为的高危污染源,对更加有针对性地进行污染治理具有重要意义。文中研究在大数据架构下,运用R语言构建基于SMOTE算法的随机森林分类预警模型,圈出高危污染源;针对高危污染源的污染程度问题进行主成分分析,从而应用大数据手段确定污染源的综合排名。通过科学预测高危污染源排名名单,提前预知重点监察对象,重点监督排名较前的企业,监控其排污行为,避免超标排污、违法排污。论述应用大数据分析技术创新生态环境监管模式的重大影响,分析大数据在环境管理领域应用的重大意义。
Environmental management focuses on prevention,and it takes precautions from the source of pollution.High-risk sources predicted in advance,which is important to control pertinently.This paper researches under the big data architecture.The R language is used to construct an early-warning model of random forest classification which based on SMOTE algorithm so as to circle the high-risk pollution sources.The principal component analysis is proposed to solve the problem which aims at the pollution level of high risk pollution source,and the application of big data determines the comprehensive ranking of high risk pollution source.Ranking list of high risk pollution sources by scientific prediction,people can know the important monitoring objects,which focuses on monitoring the ranking of the former enterprise,so that environmental protection department can monitor their emissions behavior and avoid excessive sewage and illegal sewage.This paper discusses the significant impact of the application of big data analysis on the innovation of the ecological environment supervision model,and analyzes the significance of the application of big data in the field of environmental management.
引文
[1]吴班.程春明.生态环境大数据应用探析[J].环境保护,2016.(1).
    [2]杨霞.吴东伟.R语言在大数据处理中的应用[J].科技资讯.2013.(23):19-20.
    [3]Robert I,Kabacoff.R语言实践[M].北京:人民邮电出版社.2013.
    [4]李欣海.随机森林模型在分类与回归分析中的应用[J].应用昆虫学报.2013.50(4);1190-1197.
    [5]Gislason P O,Benediktsson J A.Sveinsson J R.Random Forests for Land Cover Classification[J].Pattern Recognition Letters,2006.
    [6]彭国兰,林成德.随机森林在企业信用评估指标体系确定中的应用[D].厦门大学,2007:24-25.
    [7]费宇.多元统计分析——基于R[M].北京:中国人民大学出版社,2014,77-80.
    [8]HanH,WangWY,Mao B H.Borderline-SMOTE;A new over-sampling method in unbalanced data sets learning[J].Advances in Intelligent Computing,Springer,2005:878-887.
    [9]Liaw A,Wiener M Classification and Regression by randomForest[J].R News,2(3),18-22.
    [10]Tianjiang Li,Qiang Du.Abstract principal component analysis[J].Science China(Mathematics),2013(12).

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700