Structured Web Data Extraction: University Domain
Public Deposited- Resource Type
- Creator
- Abstract
In the Semantic Web, information is structured and thus processable by machines. However, it is still largely unrealized. The current web is simply a collection of unstructured documents. To find information on the web, we use search engines such as Google to retrieve relevant documents. Users often need to search through the retrieved documents to find information. Due to web information explosion, it has become harder and harder for users to find information easily. While Google is trying to provide the most relevant results, our goal is to provide precise results that answer structured queries. To achieve our goal, we adopt the information extraction approach. In particular, we extract structured data from the unstructured web and organize the extracted data in a database to provide search functions. This thesis focuses on the implementation of a web information extraction system in a university domain.
- Subject
- Language
- Publisher
- Thesis Degree Level
- Thesis Degree Name
- Thesis Degree Discipline
- Identifier
- Rights Notes
Copyright © 2014 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.
- Date Created
- 2014
Relations
- In Collection:
Items
Thumbnail | Title | Date Uploaded | Visibility | Actions |
---|---|---|---|---|
li-structuredwebdataextractionuniversitydomain.pdf | 2023-05-04 | Public | Download |