Davide Talon
GANzzle: Reframing jigsaw puzzle solving as a retrieval task using generative mental images
Talon, Davide; Del Bue, Alessio; James, Stuart
Abstract
Puzzle solving is a combinatorial challenge due to the difficulty of matching adjacent pieces. Instead, we infer a mental image from all pieces, which a given piece can then be matched against avoiding the combinatorial explosion. Exploiting advancements in Generative Adversarial methods, we learn how to reconstruct the image given a set of unordered pieces, allowing the model to learn a joint embedding space to match an encoding of each piece to the cropped layer of the generator. Therefore we frame the problem as a R@1 retrieval task, and then solve the linear assignment using differentiable Hungarian attention, making the process end-to-end. In doing so our model is puzzle size agnostic, in contrast to prior deep learning methods which are single size. We evaluate on two new large-scale datasets, where our model is on par with deep learning methods, while generalizing to multiple puzzle sizes.
Citation
Talon, D., Del Bue, A., & James, S. (2022, October). GANzzle: Reframing jigsaw puzzle solving as a retrieval task using generative mental images. Presented at IEEE International Conference on Image Processing, Bordeaux, France
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | IEEE International Conference on Image Processing |
Start Date | Oct 16, 2022 |
Publication Date | 2022 |
Deposit Date | Oct 24, 2024 |
Peer Reviewed | Peer Reviewed |
Book Title | 2022 IEEE International Conference on Image Processing (ICIP) |
DOI | https://doi.org/10.1109/ICIP46576.2022.9897553 |
Keywords | own, conference |
Public URL | https://durham-repository.worktribe.com/output/2024604 |
You might also like
Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images
(2024)
Presentation / Conference Contribution
Positional diffusion: Graph-based diffusion models for set ordering
(2024)
Journal Article
Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving
(2024)
Presentation / Conference Contribution
IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model
(2024)
Presentation / Conference Contribution
Inclusive Digital Storytelling: Artificial Intelligence and Augmented Reality to re-centre Stories from the Margins
(2023)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search