A web application for administrative document digitization
Abstract
Document digitization is one of the emerging trends of digitization and no more a new concept in the information science field. The digitization of documents allows retrieving of information from a paper document. The document digitization process involves conversion processing of the obtained image to extract information. The digitized document can be directly applied to searching, sorting, and storage stage. As the digitization literature is surveyed, the process of extraction of digital string from scanned paper documents plays a very important role in the document digitization process.
In this thesis, we will research and create an implementation application for processing the administrative documents, which used for management storing and searching. This application will focus on processing and digitizing the document. The method consists of the following steps: Obtaining data by using a scanner to scan administrative documents. Processing obtained data to remove noise and picking up the information, the content of the document is necessary. The application picks up the data based on the structure of documents following the criteria of the Vietnam government.