Distributed Content-Based Image Retrieval of Satellite Images on Hadoop

Tapan       Sharma; Vinod       Shokeen; Sunil       Mathur

doi:10.2174/2666255813666191126095114

Abstract

Background: Owing to increased growth in satellite imagery, the development of an architecture that rapidly and efficiently identifies similar images has become crucial. Hadoop has become a de-facto platform for storing large amounts of data. Apache Spark and MapReduce have also become key frameworks for distributed processing of big data.

Objective: This paper proposes a novel Distributed Content-Based Image Retrieval (DCBIR) architecture that leverages the qualities of these engines, which were not utilized in previous studies.

Methods: Features of 40 satellite images with sizes greater than 500 MB were indexed, on a 15-node Hadoop cluster with two different databases viz. Neo4J, a graph database, and HBase, a columnar database.

Results: Performance and Scalability of both indexing and query phases, along with precision and recall were observed for both databases.

Conclusion: Experimental results show that the proposed system can efficiently perform image retrieval on large remote sensing images.

Keywords: Distributed computing, image retrieval, MapReduce, satellite images, hadoop, spark, Neo4J, HBase.

Graphical Abstract

Rights & Permissions Print Cite

Article Metrics

5

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/2666255813666191126095114	Print ISSN 2666-2558
Publisher Name Bentham Science Publisher	Online ISSN 2666-2566

Recent Advances in Computer Science and Communications

Distributed Content-Based Image Retrieval of Satellite Images on Hadoop

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Abstract