Abstract
Background: The proposed work uses two approaches as its background. They are (i) LBP approach (ii) Kirsch compass mask.
Texture classification plays a vital role in object discrimination from the original image. LBP is majorly used for classifying texture. Many filtering based methods co-occurrence matrix method, etc., were used, but due to the computational efficiency and invariance to monotonic grey level changes, LBP is adopted majorly. Second, as Edge plays a vital role in discriminating the object visually, Kirsch compass mask was applied to obtain maximum edge strength in 8 compass directions which has the advantage of changing the mask according to users own requirement than any other compass mask.
Objective: The objective of our work was to extract better features and model a classifier for the Multimedia Event Detection task.
Methods: The proposed work consists of two steps for feature extraction. Initially, an LBP based approach is used for object discrimination, later, convolution is used for object magnitude determination using Kirsch Compass mask. Eigenvalue decomposition is adopted for feature representation. Finally, a classifier is modelled using a chi-square kernel for the event classification task.
Result: The proposed event detection work is experimented using Columbia Consumer Video (CCV) dataset. It contains 20 event based videos. The proposed work is evaluated with other existing works using mean Average Precision (mAP). Several experiments have been carried out to evaluate our work, they are LBP vs. non-LBP approach, Kirsch vs. Robinson compass mask, Kirsch masks angle wise analysis, comparison of above approaches are performed in a modeled classifier. Two approaches are used to compare the proposed work with other existing works.They are (i) Non Clustered Events (events were considered individually and one versus one strategy was followed) (ii) Clustered Events (some events were clustered and followed one vs. all strategy and remaining events were non-clustered).
Conclusion: In the proposed work, a method for event detection is described. Feature extraction is performed using LBP based approach and Kirsch compass mask for convolution. For event detection, a classifier model is generated using the chi-square kernel. The accuracy of event classification is further increased using clustered events approach. The proposed work is compared with various state- of- the- art methods and proved that the proposed work obtained outstanding performance.
Keywords: Multimedia event detection, convolution, kirsch compass mask, local binary pattern, keyframe extraction, feature extraction, columbia consumer video.
Graphical Abstract
[http://dx.doi.org/10.1109/TMM.2011.2166379]
[http://dx.doi.org/10.1007/978-1-4614-3831-1_10]
[http://dx.doi.org/10.1145/2324796.2324855]
[http://dx.doi.org/10.1109/SLT.2014.7078568]
[http://dx.doi.org/10.1109/ICIP.2013.6738742]
[http://dx.doi.org/10.1109/CVPR.2009.5206569]
[http://dx.doi.org/10.1109/CVPR.2009.5206641]
[http://dx.doi.org/10.1016/j.cviu.2011.09.009]
[http://dx.doi.org/10.1016/j.patcog.2004.03.003]
[http://dx.doi.org/10.1109/TIP.2011.2172800] [PMID: 22020684]
[http://dx.doi.org/10.1007/s11263-005-1838-7]
[http://dx.doi.org/10.1109/CVPR.2008.4587756]
[http://dx.doi.org/10.1109/CVPR.2014.288]
[http://dx.doi.org/10.1007/978-3-319-10590-1_53]
[http://dx.doi.org/10.1109/AVSS.2015.7301782]
[http://dx.doi.org/10.1109/CVPR.2011.5995496]
[http://dx.doi.org/10.1109/TIP.2009.2015682] [PMID: 19342342]
[http://dx.doi.org/10.1109/CVPR.2010.5539817]
[http://dx.doi.org/10.1109/TPAMI.2007.1110]
[http://dx.doi.org/10.1109/TIP.2014.2310123] [PMID: 24690574]
[http://dx.doi.org/10.1109/CVPR.2014.288]
[http://dx.doi.org/10.1109/CVPR.2015.7298789]
[http://dx.doi.org/10.1109/CVPR.2010.5540039]
[http://dx.doi.org/10.1109/CVPR.2015.7299071]
[http://dx.doi.org/10.1145/2578726.2578763]]
[http://dx.doi.org/10.1016/j.neucom.2014.09.096]
[http://dx.doi.org/10.1109/ICCV.2013.335]
[http://dx.doi.org/10.1145/2393347.2393412]]
[http://dx.doi.org/10.1109/TMM.2015.2436813]
[http://dx.doi.org/10.1109/TIP.2015.2511585] [PMID: 26780785]
[http://dx.doi.org/10.1016/j.neucom.2016.06.002]
[http://dx.doi.org/10.1007/s00138-013-0567-0]
[http://dx.doi.org/10.1007/s11263-014-0723-7]
[http://dx.doi.org/10.1109/TIP.2015.2423560] [PMID: 25879948]
[http://dx.doi.org/10.1109/ICRTCCM.2017.64]
[http://dx.doi.org/10.1109/CVPR.2011.5995407]