Luận văn: | Information Extraction for Vietnamese Real-Estate Advertisements : M.A Thesis Information Technology : 60 48 01 |
Nhà xuất bản: | ĐHCN |
Ngày: | 2012 |
Chủ đề: | Công nghệ thông tin Quảng cáo Bất động sản Khai thác thông tin |
Miêu tả: | 51 p. + CD-ROM M.A Thesis. Computer Science -- University of Engineering and Technology. Vietnam National University, Hanoi, 2012 In recent years, real-estate market in Vietnam is growing rapidly which creates a lot of information about real-estate, especially information on advertising for buying and selling activities of real-estate development. This poses an essential demand for building an information extraction system to Giúp users deal with the increasing amount of real-estate advertisements on the Internet. We propose a rule-based approach to build an information extraction system for online real-estate advertisements in Vietnamese. At the same time, we set up a process to build an annotated corpus wich can be used in machine learning approaches at a later stage. Our system achieve promising results with F-measures of above 90%. Our approach is particularly suitable for under-resourced languages where an annotated corpus of a decent size is not readily available Electronic Resources |
Kiểu: | text |
Định dạng: | text/pdf |
You must be registered for see links
You must be registered for see links