Davide Talon
GANzzle + + : Generative approaches for jigsaw puzzle solving as local to global assignment in latent spatial representations
Talon, Davide; Del Bue, Alessio; James, Stuart
Abstract
Jigsaw puzzles are a popular and enjoyable pastime that humans can easily solve, even with many pieces. However, solving a jigsaw is a combinatorial problem, and the space of possible solutions is exponential in the number of pieces, intractable for pairwise solutions. In contrast to the classical pairwise local matching of pieces based on edge heuristics, we estimate an approximate solution image, i.e., a mental image, of the puzzle and exploit it to guide the placement of pieces as a piece-to-global assignment problem. Therefore, from unordered pieces, we consider conditioned generation approaches, including Generative Adversarial Networks (GAN) models, Slot Attention (SA) and Vision Transformers (ViT), to recover the solution image. Given the generated solution representation, we cast the jigsaw solving as a 1-to-1 assignment matching problem using Hungarian attention, which places pieces in corresponding positions in the global solution estimate. Results show that the newly proposed GANzzle-SA and GANzzle-VIT benefit from the early fusion strategy where pieces are jointly compressed and gathered for global structure recovery. A single deep learning model generalizes to puzzles of different sizes and improves the performances by a large margin. Evaluated on PuzzleCelebA and PuzzleWikiArts, our approaches bridge the gap of deep learning strategies with respect to optimization-based classic puzzle solvers.
Citation
Talon, D., Del Bue, A., & James, S. (2025). GANzzle + + : Generative approaches for jigsaw puzzle solving as local to global assignment in latent spatial representations. Pattern Recognition Letters, 187, 35-41. https://doi.org/10.1016/j.patrec.2024.11.010
Journal Article Type | Article |
---|---|
Acceptance Date | Nov 9, 2024 |
Online Publication Date | Nov 19, 2024 |
Publication Date | 2025-01 |
Deposit Date | Nov 21, 2024 |
Publicly Available Date | Feb 12, 2025 |
Journal | Pattern Recognition Letters |
Print ISSN | 0167-8655 |
Electronic ISSN | 1872-7344 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 187 |
Pages | 35-41 |
DOI | https://doi.org/10.1016/j.patrec.2024.11.010 |
Public URL | https://durham-repository.worktribe.com/output/3103967 |
Files
Published Journal Article
(2.1 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
PaintBranch: Asynchronous Collaborative Art in Virtual Reality
(2025)
Presentation / Conference Contribution
Interactive Digital Storytelling Navigating the Inherent Currents of the Diasporic Mind
(2024)
Presentation / Conference Contribution
6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model
(2024)
Presentation / Conference Contribution
Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images
(2024)
Presentation / Conference Contribution
Positional diffusion: Graph-based diffusion models for set ordering
(2024)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search