Konferenzbeitrag

Web Mining Framework: Croatian Patents Case Study

Patents are one of the most valuable sources of technical and commercial knowledge. Although patents are public and can be easily searched on the Web, for most countries, there is no easy way to download bulk patent data. The purpose of this paper is to create a web mining framework used to extract patent data from the Croatian State Intellectual Property Office. Even though framework was created for the purposes of extracting Croatian patents, it can be reused for other web mining cases. The architecture of the proposed framework combines the use of web crawler and big data tools, in order to provide a complete and flexible solution for building general-purpose web mining application. The biggest limitation of this framework comes from the programming knowledge required to implement it. Therefore, this framework is only available to the small number of researchers. Data extracted with web mining methods is only as good as the algorithm used to extract data. Nevertheless, the data from official sources should always be preferred to the one retrieved using web mining methods.

Language
Englisch

Bibliographic citation
In: Proceedings of the ENTRENOVA - ENTerprise REsearch InNOVAtion Conference, Rovinj, Croatia, 8-9 September 2016 ; Year: 2016 ; Pages: 478-483 ; Zagreb: IRENET - Society for Advancing Innovation and Research in Economy

Classification
Wirtschaft
Information and Internet Services; Computer Software
Intellectual Property and Intellectual Capital
Subject
web mining
data mining
crawling
patent mining

Event
Geistige Schöpfung
(who)
Popović, Goran
Event
Veröffentlichung
(who)
IRENET - Society for Advancing Innovation and Research in Economy
(where)
Zagreb
(when)
2016

Handle
Last update
10.03.2025, 11:44 AM CET

Data provider

This object is provided by:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Popović, Goran
  • IRENET - Society for Advancing Innovation and Research in Economy

Time of origin

  • 2016

Other Objects (12)