Full metadata record
DC FieldValueLanguage
dc.contributor.authorSaeidimesineh, Reyhane-
dc.contributor.authorAdibi, Peyman-
dc.contributor.authorKarshenas, Hossein-
dc.contributor.authorDarvishy, Alireza-
dc.date.accessioned2023-10-27T09:09:48Z-
dc.date.available2023-10-27T09:09:48Z-
dc.date.issued2023-
dc.identifier.issn0950-7051de_CH
dc.identifier.issn1872-7409de_CH
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/28970-
dc.description.abstractRecent progress in deep learning has led to successful utilization of encoder–decoder frameworks inspired by machine translation in image captioning models. The stacking of layers in encoders and decoders has made it possible to use several modules in encoders and decoders. However, just one type of module in encoder or decoder has been used in stacked models. In this research, we propose a parallel encoder–decoder framework that aims to take advantage of multiple of types modules in encoders and decoders, simultaneously. This framework contains augmented parallel blocks, which include stacking modules or non-stacked ones. Then, the results of the blocks are integrated to extract higher-level semantic concepts. This general idea is not limited to image captioning and can be customized for many applications that utilize encoder–decoder frameworks. We evaluated our proposed method on the MS-COCO dataset and achieved state-of-the-art results. We got 149.92 for CIDEr-D metric outperforming state-of-the-art image captioning models.de_CH
dc.language.isoende_CH
dc.publisherElsevierde_CH
dc.relation.ispartofKnowledge-Based Systemsde_CH
dc.rightsLicence according to publishing contractde_CH
dc.subjectParallelizationde_CH
dc.subjectEncoder–decoder frameworkde_CH
dc.subjectImage captioningde_CH
dc.subjectNatural language processingde_CH
dc.subject.ddc006: Spezielle Computerverfahrende_CH
dc.titleParallel encoder-decoder framework for image captioningde_CH
dc.typeBeitrag in wissenschaftlicher Zeitschriftde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Engineeringde_CH
zhaw.organisationalunitInstitut für Informatik (InIT)de_CH
dc.identifier.doi10.1016/j.knosys.2023.111056de_CH
zhaw.funding.euNode_CH
zhaw.issue111056de_CH
zhaw.originated.zhawYesde_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.volume282de_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.webfeedHuman Information Interactionde_CH
zhaw.webfeedHuman-Centered Computingde_CH
zhaw.author.additionalNode_CH
zhaw.display.portraitYesde_CH
Appears in collections:Publikationen School of Engineering

Files in This Item:
There are no files associated with this item.
Show simple item record
Saeidimesineh, R., Adibi, P., Karshenas, H., & Darvishy, A. (2023). Parallel encoder-decoder framework for image captioning. Knowledge-Based Systems, 282(111056). https://doi.org/10.1016/j.knosys.2023.111056
Saeidimesineh, R. et al. (2023) ‘Parallel encoder-decoder framework for image captioning’, Knowledge-Based Systems, 282(111056). Available at: https://doi.org/10.1016/j.knosys.2023.111056.
R. Saeidimesineh, P. Adibi, H. Karshenas, and A. Darvishy, “Parallel encoder-decoder framework for image captioning,” Knowledge-Based Systems, vol. 282, no. 111056, 2023, doi: 10.1016/j.knosys.2023.111056.
SAEIDIMESINEH, Reyhane, Peyman ADIBI, Hossein KARSHENAS und Alireza DARVISHY, 2023. Parallel encoder-decoder framework for image captioning. Knowledge-Based Systems. 2023. Bd. 282, Nr. 111056. DOI 10.1016/j.knosys.2023.111056
Saeidimesineh, Reyhane, Peyman Adibi, Hossein Karshenas, and Alireza Darvishy. 2023. “Parallel Encoder-Decoder Framework for Image Captioning.” Knowledge-Based Systems 282 (111056). https://doi.org/10.1016/j.knosys.2023.111056.
Saeidimesineh, Reyhane, et al. “Parallel Encoder-Decoder Framework for Image Captioning.” Knowledge-Based Systems, vol. 282, no. 111056, 2023, https://doi.org/10.1016/j.knosys.2023.111056.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.