The data extraction using distributed crawler inside multi-agent system

Loading...
Thumbnail Image

Downloads

5

Date issued

Authors

Tomala, Karel
Plucar, Jan
Dubec, Patrik
Rapant, Lukáš
Vozňák, Miroslav

Journal Title

Journal ISSN

Volume Title

Publisher

Vysoká škola báňská - Technická univerzita Ostrava

Location

Signature

Abstract

The paper discusses the use of web crawler technology. We created an application based on standard web crawler. Our application is determined for data extraction. Primarily, the application was designed to extract data using keywords from a social network Twitter. First, we created a standard crawler, which went through a predefined list of URLs and gradually download page content of each of the URLs. Page content was then parsed and important text and metadata were stored in a database. Recently, the application was modified in to the form of the multi-agent system. The system was developed in the C# language, which is used to create web applications and sites etc. Obtained data was evaluated graphically. The system was created within Indect project at the VSB-Technical University of Ostrava.

Description

Subject(s)

Citation

Advances in electrical and electronic engineering. 2013, vol. 11, no. 6, p. 455-460 : il.