The data extraction using distributed crawler inside multi-agent system
Loading...
Downloads
5
Date issued
Authors
Tomala, Karel
Plucar, Jan
Dubec, Patrik
Rapant, Lukáš
Vozňák, Miroslav
Journal Title
Journal ISSN
Volume Title
Publisher
Vysoká škola báňská - Technická univerzita Ostrava
Location
Signature
Abstract
The paper discusses the use of web crawler technology. We created an application based on standard web crawler. Our application is determined for data extraction. Primarily, the application was designed to extract data using keywords from a social network Twitter. First, we created a standard crawler, which went through a predefined list of URLs and gradually download page content of each of the URLs. Page content was then parsed and important text and metadata were stored in a database. Recently, the application was modified in to the form of the multi-agent system. The system was developed in the C# language, which is used to create web applications and sites etc. Obtained data was evaluated graphically. The system was created within Indect project at the VSB-Technical University of Ostrava.
Description
Subject(s)
Citation
Advances in electrical and electronic engineering. 2013, vol. 11, no. 6, p. 455-460 : il.