Can you find the object? Each example shows a query image on the left with the target instance marked in red, alongside four candidate gallery images on the right. One of these four images contains the exact same instance as retrieved by our MaO method (top-1 result from a gallery of 1,580 images). Can you spot which one? This interactive challenge demonstrates why finding these small instances in cluttered, real-world scenes is such a difficult task—objects appear at different scales, viewpoints, and contexts, truly making it a search for a needle in a haystack.
We address the challenge of Small Object Image Retrieval (SoIR), where the goal is to retrieve images containing a specific small object in a cluttered scene. The key challenge in this setting is constructing a single image descriptor, for scalable and efficient search, that effectively represents all objects in the image.
In this paper, we first analyze the limitations of existing methods on this challenging task and then introduce new benchmarks to support SoIR evaluation. Next, we introduce Multi-object Attention Optimization (MaO), a novel retrieval framework which incorporates a dedicated multi-object pre-training phase. This is followed by a refinement process that leverages attention-based feature extraction with object masks, integrating them into a single unified image descriptor.
Our MaO approach significantly outperforms existing retrieval methods and strong baselines, achieving notable improvements in both zero-shot and lightweight multi-object fine-tuning. We hope this work will lay the groundwork and inspire further research to enhance retrieval performance for this highly practical task.
Examples from our dataset containing multiple small objects. Each annotated object is shown with its relative size (with respect to the image dimensions) displayed above its bounding box.
@inproceedings{green2025findyourneedle,
author = {Green, Michael and Levy, Matan and Tzachor, Issar and Samuel, Dvir and Darshan, Nir and Ben-Ari, Rami},
title = {Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization},
booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
year = {2025}
}