论文部分内容阅读
随着Internet的迅猛发展,青少年已经成为我国网民的重要组成部分,伴随而来的青少年网瘾问题已经引起社会各界的高度关注。过滤不良Web网页是绿色网络建设的重大难题。一般的网页过滤系统都只是针对URL级别的,没有做到对内容级别的过滤,只要不法分子改变URL,就没有办法起到过滤的作用。提出了将自然语言理解与Web挖掘技术相结合并应用到网页过滤模块设计之中的解决方案,以做到对Web内容级别的过滤。
With the rapid development of the Internet, adolescents have become an important part of Internet users in our country. The problem of adolescent Internet addiction has drawn great attention from all walks of life. Filtering bad Web pages is a major challenge in building green networks. The general web filtering system are only for the URL level, did not do content-level filtering, as long as lawless elements change the URL, there is no way to play the role of filtering. A solution that combines natural language understanding with Web mining technology and applies it to the design of Web filtering module is proposed to filter Web content level.