TECMH: Transformer-Based Cross-Modal Hashing For Fine-Grained Image-Text Retrieval

Li, Qiqi; Ma, Longfei; Jiang, Zheng; Li, Mingyong; Jin, Bo

doi:10.32604/cmc.2023.037463

Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/114664

DC Field	Value	Language
dc.contributor.author	Li, Qiqi	-
dc.contributor.author	Ma, Longfei	-
dc.contributor.author	Jiang, Zheng	-
dc.contributor.author	Li, Mingyong	-
dc.contributor.author	Jin, Bo	-
dc.date.accessioned	2024-04-04T10:12:55Z	-
dc.date.available	2024-04-04T10:12:55Z	-
dc.date.issued	2023	-
dc.identifier.issn	1546-2226	pt
dc.identifier.uri	https://hdl.handle.net/10316/114664	-
dc.description.abstract	In recent years, cross-modal hash retrieval has become a popular research field because of its advantages of high efficiency and low storage. Cross-modal retrieval technology can be applied to search engines, crossmodalmedical processing, etc. The existing main method is to use amulti-label matching paradigm to finish the retrieval tasks. However, such methods do not use fine-grained information in the multi-modal data, which may lead to suboptimal results. To avoid cross-modal matching turning into label matching, this paper proposes an end-to-end fine-grained cross-modal hash retrieval method, which can focus more on the fine-grained semantic information of multi-modal data. First, the method refines the image features and no longer uses multiple labels to represent text features but uses BERT for processing. Second, this method uses the inference capabilities of the transformer encoder to generate global fine-grained features. Finally, in order to better judge the effect of the fine-grained model, this paper uses the datasets in the image text matching field instead of the traditional label-matching datasets. This article experiment on Microsoft COCO (MS-COCO) and Flickr30K datasets and compare it with the previous classicalmethods. The experimental results show that this method can obtain more advanced results in the cross-modal hash retrieval field.	pt
dc.language.iso	eng	pt
dc.publisher	Tech Science Press	pt
dc.relation	This work was partially supported by Chongqing Natural Science Foundation of China (Grant No. CSTB2022NSCQ-MSX1417), the Science and Technology Research Program of Chongqing Municipal Education Commission (Grant No. KJZD-K202200513) and Chongqing Normal University Fund (Grant No. 22XLB003), Chongqing Education Science Planning Project (Grant No. 2021-GX-320) and Humanities and Social Sciences Project of Chongqing Education Commission of China (Grant No. 22SKGH100)	pt
dc.rights	openAccess	pt
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	pt
dc.subject	Deep learning	pt
dc.subject	cross-modal retrieval	pt
dc.subject	hash learning	pt
dc.subject	transformer	pt
dc.title	TECMH: Transformer-Based Cross-Modal Hashing For Fine-Grained Image-Text Retrieval	pt
dc.type	article	-
degois.publication.firstPage	3713	pt
degois.publication.lastPage	3728	pt
degois.publication.issue	2	pt
degois.publication.title	Computers, Materials and Continua	pt
dc.peerreviewed	yes	pt
dc.identifier.doi	10.32604/cmc.2023.037463	pt
degois.publication.volume	75	pt
dc.date.embargo	2023-01-01	*
uc.date.periodoEmbargo	0	pt
item.openairetype	article	-
item.fulltext	Com Texto completo	-
item.languageiso639-1	en	-
item.grantfulltext	open	-
item.cerifentitytype	Publications	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
crisitem.author.researchunit	ISR - Institute of Systems and Robotics	-
crisitem.author.parentresearchunit	University of Coimbra	-
crisitem.author.orcid	0000-0001-9255-5772	-
Appears in Collections:	FCTUC Eng.Electrotécnica - Artigos em Revistas Internacionais I&D ISR - Artigos em Revistas Internacionais

Files in This Item:

File	Description	Size	Format
TECMH-TransformerBased-CrossModal-Hashing-for-FineGrained-ImageText-RetrievalComputers-Materials-and-Continua.pdf		900.33 kB	Adobe PDF	View/Open

Show simple item record

Page view(s)

43

checked on Jul 17, 2024

Download(s)

15

checked on Jul 17, 2024

Files in This Item:

Page view(s)

Download(s)

Google Scholar^TM

Altmetric

Altmetric

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Altmetric

Google Scholar^TM