Dissemin is shutting down on January 1st, 2025

Published in

2015 IEEE International Conference on Multimedia Big Data

DOI: 10.1109/bigmm.2015.38

Links

Tools

Export citation

Search in Google Scholar

Geometric Consistent Tree Partitioning min-Hash for Large-scale Partial Duplicate Image Discovery

Journal article published in 2015 by Qian Zhang, Guoping Qiu ORCID
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

This paper introduces a novel method that effectively and efficiently encodes the spatial geometric information of bag of visual words (BoW) to boost the performance of large scale partial duplicate image discovery and clustering. The loose cyclic spatial verification (LCSV) technique projects the locations of BoWs onto the perimeter of a circle centred around their geometric centroid and encodes their geometric relations in a simple ordered sequence of scalar values. We then treat the problem of validating the geometric consistencies of the BoWs from two separate image patches as the longest common cyclic subsequence (LCCS) problem and solve it using dynamic programming. By embedding the LCSV technique in a modified tree partitioning min-Hash framework, we introduce a geometric consistent tree partitioning min-Hash (gcTmH) technique for partial duplicate image discovery and clustering. We show that gcTmH is invariant to rotation and scaling, robust against noisy conditions, and is able to handle multiple duplicate models. We show that gcTmH can boost the accuracy of partial duplicate image discovery by deleting the random and false matching image pairs in a very efficient way. We present experimental results on two datasets and show that our method can boost partial duplicate image discovery performances of state of the art techniques.