First steps toward CNN based source classification of document images shared over messaging app

Show simple item record

dc.contributor.author Joshi, Sharad
dc.contributor.author Saxena, Suraj
dc.contributor.author Khanna, Nitin
dc.date.accessioned 2018-08-28T07:53:57Z
dc.date.available 2018-08-28T07:53:57Z
dc.date.issued 2018-08
dc.identifier.citation Joshi, Sharad; Saxena, Suraj and Khanna, Nitin, First steps toward CNN based source classification of document images shared over messaging app, arXiv, Cornell University Library, DOI: arXiv:1808.05941, Aug. 2018. en_US
dc.identifier.uri https://arxiv.org/abs/1808.05941
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/3888
dc.description.abstract Knowledge of source smartphone corresponding to a document image can be helpful in a variety of applications including copyright infringement, ownership attribution, leak identification and usage restriction. In this letter, we investigate a convolutional neural network-based approach to solve source smartphone identification problem for printed text documents which have been captured by smartphone cameras and shared over messaging platform. In absence of any publicly available dataset addressing this problem, we introduce a new image dataset consisting of 315 images of documents printed in three different fonts, captured using 21 smartphones and shared over WhatsApp. Experiments conducted on this dataset demonstrate that, in all scenarios, the proposed system performs as well as or better than the state-of-the-art system based on handcrafted features and classification of letters extracted from document images. The new dataset and code of the proposed system will be made publicly available along with this letter's publication, presently they are submitted for review.
dc.description.statementofresponsibility by Sharad Joshi, Suraj Saxena and Nitin Khanna
dc.language.iso en en_US
dc.publisher Cornell University Library en_US
dc.subject Multimedia en_US
dc.subject Computer Vision en_US
dc.subject Pattern Recognition en_US
dc.title First steps toward CNN based source classification of document images shared over messaging app en_US
dc.type Preprint en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account