Paper
20 June 2023 Research on cross-modal hashing method based on diffusion mode
Wenjiao Li, Zirui Zhong
Author Affiliations +
Proceedings Volume 12715, Eighth International Conference on Electronic Technology and Information Science (ICETIS 2023); 127150Z (2023) https://doi.org/10.1117/12.2682410
Event: Eighth International Conference on Electronic Technology and Information Science (ICETIS 2023), 2023, Dalian, China
Abstract
For achieving fast and flexible retrieval across heterogeneous modalities, unsupervised is more flexible and easy to use than supervised methods, of which the unsupervised method GAN is the most popular. However, GAN has been suffering from the problems of lack of diversity in generated samples, debugging difficulties and training instability. A cross-modal hashing method based on a diffusion model is proposed in the paper. Specifically: (1) For the first time, the diffusion model is applied to the field of cross-modal retrieval, targeting three modalities for mutual retrieval. (2) The combination of adversarial network GAN and diffusion model improves the sample quality and sample diversity, and ameliorates the problems of complex GAN debugging and unstable training. The effectiveness of the proposed method is demonstrated through experiments on three datasets and comparison with state-of-the-art methods.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Wenjiao Li and Zirui Zhong "Research on cross-modal hashing method based on diffusion mode", Proc. SPIE 12715, Eighth International Conference on Electronic Technology and Information Science (ICETIS 2023), 127150Z (20 June 2023); https://doi.org/10.1117/12.2682410
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Diffusion

Data modeling

Gallium nitride

Education and training

Particle filters

Statistical modeling

Databases

Back to Top