Parallel encoder-decoder framework for image captioning

Saeidimesineh, Reyhane; Adibi, Peyman; Karshenas, Hossein; Darvishy, Alireza

doi:10.1016/j.knosys.2023.111056

Full metadata record

DC Field	Value	Language
dc.contributor.author	Saeidimesineh, Reyhane	-
dc.contributor.author	Adibi, Peyman	-
dc.contributor.author	Karshenas, Hossein	-
dc.contributor.author	Darvishy, Alireza	-
dc.date.accessioned	2023-10-27T09:09:48Z	-
dc.date.available	2023-10-27T09:09:48Z	-
dc.date.issued	2023	-
dc.identifier.issn	0950-7051	de_CH
dc.identifier.issn	1872-7409	de_CH
dc.identifier.uri	https://digitalcollection.zhaw.ch/handle/11475/28970	-
dc.description.abstract	Recent progress in deep learning has led to successful utilization of encoder–decoder frameworks inspired by machine translation in image captioning models. The stacking of layers in encoders and decoders has made it possible to use several modules in encoders and decoders. However, just one type of module in encoder or decoder has been used in stacked models. In this research, we propose a parallel encoder–decoder framework that aims to take advantage of multiple of types modules in encoders and decoders, simultaneously. This framework contains augmented parallel blocks, which include stacking modules or non-stacked ones. Then, the results of the blocks are integrated to extract higher-level semantic concepts. This general idea is not limited to image captioning and can be customized for many applications that utilize encoder–decoder frameworks. We evaluated our proposed method on the MS-COCO dataset and achieved state-of-the-art results. We got 149.92 for CIDEr-D metric outperforming state-of-the-art image captioning models.	de_CH
dc.language.iso	en	de_CH
dc.publisher	Elsevier	de_CH
dc.relation.ispartof	Knowledge-Based Systems	de_CH
dc.rights	Licence according to publishing contract	de_CH
dc.subject	Parallelization	de_CH
dc.subject	Encoder–decoder framework	de_CH
dc.subject	Image captioning	de_CH
dc.subject	Natural language processing	de_CH
dc.subject.ddc	006: Spezielle Computerverfahren	de_CH
dc.title	Parallel encoder-decoder framework for image captioning	de_CH
dc.type	Beitrag in wissenschaftlicher Zeitschrift	de_CH
dcterms.type	Text	de_CH
zhaw.departement	School of Engineering	de_CH
zhaw.organisationalunit	Institut für Informatik (InIT)	de_CH
dc.identifier.doi	10.1016/j.knosys.2023.111056	de_CH
zhaw.funding.eu	No	de_CH
zhaw.issue	111056	de_CH
zhaw.originated.zhaw	Yes	de_CH
zhaw.publication.status	publishedVersion	de_CH
zhaw.volume	282	de_CH
zhaw.publication.review	Peer review (Publikation)	de_CH
zhaw.webfeed	Human Information Interaction	de_CH
zhaw.webfeed	Human-Centered Computing	de_CH
zhaw.author.additional	No	de_CH
zhaw.display.portrait	Yes	de_CH
Appears in collections:	Publikationen School of Engineering

Files in This Item:

There are no files associated with this item.

Show simple item record

Saeidimesineh, R., Adibi, P., Karshenas, H., & Darvishy, A. (2023). Parallel encoder-decoder framework for image captioning. Knowledge-Based Systems, 282(111056). https://doi.org/10.1016/j.knosys.2023.111056

Saeidimesineh, R. et al. (2023) ‘Parallel encoder-decoder framework for image captioning’, Knowledge-Based Systems, 282(111056). Available at: https://doi.org/10.1016/j.knosys.2023.111056.

R. Saeidimesineh, P. Adibi, H. Karshenas, and A. Darvishy, “Parallel encoder-decoder framework for image captioning,” Knowledge-Based Systems, vol. 282, no. 111056, 2023, doi: 10.1016/j.knosys.2023.111056.

SAEIDIMESINEH, Reyhane, Peyman ADIBI, Hossein KARSHENAS und Alireza DARVISHY, 2023. Parallel encoder-decoder framework for image captioning. Knowledge-Based Systems. 2023. Bd. 282, Nr. 111056. DOI 10.1016/j.knosys.2023.111056

Saeidimesineh, Reyhane, Peyman Adibi, Hossein Karshenas, and Alireza Darvishy. 2023. “Parallel Encoder-Decoder Framework for Image Captioning.” Knowledge-Based Systems 282 (111056). https://doi.org/10.1016/j.knosys.2023.111056.

Saeidimesineh, Reyhane, et al. “Parallel Encoder-Decoder Framework for Image Captioning.” Knowledge-Based Systems, vol. 282, no. 111056, 2023, https://doi.org/10.1016/j.knosys.2023.111056.