Exploring temporal differences in 3D convolutional neural networks

Show simple item record

dc.contributor.author Kanojia, Gagan
dc.contributor.author Kumawat, Sudhakar
dc.contributor.author Raman, Shanmuganathan
dc.date.accessioned 2019-09-18T10:12:54Z
dc.date.available 2019-09-18T10:12:54Z
dc.date.issued 2019-09
dc.identifier.citation Kanojia, Gagan; Kumawat, Sudhakar and Raman, Shanmuganathan, "Exploring temporal differences in 3D convolutional neural networks", arXiv, Cornell University Library, DOI: arXiv:1909.03309, Sep. 2019. en_US
dc.identifier.uri https://arxiv.org/abs/1909.03309
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/4835
dc.description.abstract Traditional 3D convolutions are computationally expensive, memory intensive, and due to large number of parameters, they often tend to overfit. On the other hand, 2D CNNs are less computationally expensive and less memory intensive than 3D CNNs and have shown remarkable results in applications like image classification and object recognition. However, in previous works, it has been observed that they are inferior to 3D CNNs when applied on a spatio-temporal input. In this work, we propose a convolutional block which extracts the spatial information by performing a 2D convolution and extracts the temporal information by exploiting temporal differences, i.e., the change in the spatial information at different time instances, using simple operations of shift, subtract and add without utilizing any trainable parameters. The proposed convolutional block has same number of parameters as of a 2D convolution kernel of size nxn, i.e. n^2, and has n times lesser parameters than an nxnxn 3D convolution kernel. We show that the 3D CNNs perform better when the 3D convolution kernels are replaced by the proposed convolutional blocks. We evaluate the proposed convolutional block on UCF101 and ModelNet datasets.
dc.description.statementofresponsibility by Gagan Kanojia, Sudhakar Kumawat and Shanmuganathan Raman
dc.language.iso en_US en_US
dc.publisher Cornell University Library en_US
dc.subject Deep learning en_US
dc.subject 3D convolution neural networks en_US
dc.title Exploring temporal differences in 3D convolutional neural networks en_US
dc.type Preprint en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account