用户名: 密码: 验证码:
中文地址识别算法研究及在医院的应用
详细信息    查看官网全文
摘要
中文地址分析技术在本地搜索服务已经得到广泛应用,其实其在医院管理特别是医院病人来源性分析、病案地址标准化上报也存在重要的作用。该技术就是把自然语言描述的地址转化成机器可识别可定位的信息。基于正则表达式匹配无语义的方法,识别效果较差,全文的最大相似度计算方法虽然效果好,但是需要占用大量的计算资源和完整的基础资料,而利用分词和基于命名实体识别的思想来对地址进行分析,使得整体效果有了明显改善。
Chinese address analysis technology has been widely used in local search services,and actually it is also useful in hospital management,especially in the analysis of the source of patients and the standardization report of medical record address.This technology address just intend to make machine can recognize location information from address text described by natural language,method based on Regular expression non-semanticmatching has a poor recognition effect,method based on maximum string similarity has great effect,but need a lot of compute resource and complete basic data,while analyzing address using segment and the idea based onnamed entity recognition can make improvement in overall effect.
引文
[1]谭侃侃.基于规则的中文地址分词与匹配[D].山东科技大学,2011
    [2]赵阳阳,王亮,仇阿根.地址要素识别机制的地名地址分词算法[J].测绘科学,2013,38(5):74-76
    [3]徐聪,张丰,杜震洪等.基于哈希和双数组trie树的多层次地址匹配算法[J].浙江大学学报(理学版),2014,41(2):217:222

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700