The entity based search engine
Abstract
Day by day, internet is growing extremely strong. Along with this development, it bring the better life and convenient for us. We can do anything at home, just sit on the couch, open the laptop or smartphone, and enjoy online shopping, studying, reading news…. It not necessary for us to go to the market to buy stuff, the product which we want purchase online will be delivered to our home. Thanks to the internet, we can work at home, the online meeting or remote control working will be easy to done by the internet.
However, over the past two decades, the Internet, search engines, and Web users have had to deal with unstructured data, which is essentially any data that has not been organized or classified according to any sort of pre-defined data model. Thus, search engines were able to identify patterns within webpages (keywords) but were not really able to attach meaning to those pages.
Hence, search engine optimization has been developing to serve exigency of modern life. Performance and the method of search engine lead to the development of many open source tools for indexing and querying a large amount of data.
This study will focus on researching and implementing an open source tool for search engine, understanding how it works and finding an effective method to improve the performance of searching and indexing big data which was processed from Wikipedia