The Wayback Machine - http://web.archive.org/web/20220409135417/https://github.com/topics/image-captioning

#

image-captioning

Here are 506 public repositories matching this topic...

sgrvinod / a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

computer-vision pytorch image-captioning show-attend-and-tell attention-mechanism encoder-decoder pytorch-tutorial mscoco

Updated Jul 26, 2021
Python

imaginary-cloud / CameraManager

Simple Swift class to provide all the configurations you need to create custom camera view in your app

swift ios camera cocoapods carthage swift-package-manager video-recording custom-camera image-captioning qrcode-reader

Updated Mar 17, 2022
Swift

peteanderson80 / bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

caffe vqa faster-rcnn image-captioning captioning-images mscoco mscoco-dataset visual-question-answering

Updated Feb 28, 2022
Jupyter Notebook

yunjey / show-attend-and-tell

TensorFlow Implementation of "Show, Attend and Tell"

tensorflow image-captioning show-attend-and-tell attention-mechanism mscoco-image-dataset

Updated Jul 28, 2018
Jupyter Notebook

ruotianluo / self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

image-captioning

Updated May 17, 2021
Python

Oscar

microsoft / Oscar

Oscar and VinVL

vqa image-captioning oscar vision-and-language pre-training image-text-search vinvl

Updated Nov 23, 2021
Python

YehLi / xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

image-captioning video-captioning visual-question-answering vision-and-language cross-modal-retrieval pretraining tden

Updated Mar 23, 2022
Python

kdexd / virtex

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

model-zoo image-captioning pretrained-models coco-dataset cvpr2021

Updated Apr 9, 2022
Python

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

image-captioning visual-reasoning visual-question-answering vision-language vision-language-transformer image-text-retrieval vision-and-language-pre-training

Updated Mar 29, 2022
Jupyter Notebook

subho406 / OmniNet

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

nlp machine-learning deep-learning neural-network artificial-intelligence transformer image-captioning video-recognition multimodal-learning multitask-learning

Updated Oct 31, 2020
Python

ufal / neuralmonkey

An open-source tool for sequence learning in NLP built on TensorFlow.

python nlp deep-learning tensorflow gpu machine-translation neural-networks image-captioning neural-machine-translation sequence-to-sequence mt nmt encoder-decoder

Updated Apr 28, 2020
Python

kuanghuei / SCAN

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

computer-vision deep-learning neural-network pytorch image-captioning cross-modal visual-semantic

Updated Mar 26, 2021
Python

OFA-Sys / OFA

Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

image-captioning visual-question-answering multimodal text-to-image-synthesis vision-language pretraining referring-expression-comprehension vision-and-language-pre-training

Updated Apr 6, 2022
Python

aimagelab / meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

pytorch transformer image-captioning captioning-images visual-semantic caption-generation cvpr2020

Updated Mar 20, 2020
Python

MahanFathi / CS231

Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition

computer-vision deep-learning solutions tensorflow neural-networks stanford image-captioning convolutional-neural-networks dd cs231n visual-recognition assignments

Updated Jul 20, 2018
Jupyter Notebook

jiasenlu / AdaptiveAttention

Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"

torch image-captioning attention-mechanism

Updated Dec 26, 2017
Jupyter Notebook

yashk2810 / Image-Captioning

Image Captioning using InceptionV3 and beam search

tensorflow keras cnn lstm image-captioning beam-search

Updated Aug 26, 2020
Jupyter Notebook

husthuaan / AoANet

Code for paper "Attention on Attention for Image Captioning". ICCV 2019

image-captioning attention-mechanism iccv2019

Updated May 2, 2021
Python

krasserm / fairseq-image-captioning

Transformer-based image captioning extension for pytorch/fairseq

pytorch transformer image-captioning fairseq

Updated Dec 18, 2020
Python

aimagelab / show-control-and-tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

pytorch image-captioning captioning-images visual-semantic caption-generation cvpr2019

Updated Aug 7, 2019
Python

Image-to-Image-Search

sethuiyer / Image-to-Image-Search

A reverse image search engine powered by elastic search and tensorflow

search-engine elasticsearch deep-learning image-captioning

Updated Apr 3, 2021
Python

anuragmishracse / caption_generator

A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.

image tensorflow keras cnn lstm rnn image-captioning captioning-images bleu-score

Updated May 13, 2018
Python

DataTurks / DataTurks

ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.

java image-processing image-classification image-captioning document-classification image-segmentation ner annotation-tool document-annotate

Updated Nov 28, 2021
JavaScript

peteanderson80 / Up-Down-Captioner

Automatic image captioning model based on Caffe, using features from bottom-up attention.

caffe lstm image-captioning captioning-images

Updated Jun 25, 2019
Jupyter Notebook

JDAI-CV / image-captioning

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

image-captioning vision-and-language

Updated Jul 27, 2021
Python

zjuchenlong / sca-cnn.cvpr17

Image Captions Generation with Spatial and Channel-wise Attention

theano coco image-captioning resnet attention-mechanism cvpr-2017

Updated Apr 4, 2018
Python

saahiluppal / catr

Image Captioning Using Transformer

transformer image-captioning

Updated May 22, 2021
Python

dabasajay / Image-Caption-Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

deep-learning recurrent-neural-networks lstm attention image-captioning beam-search convolutional-neural-networks vgg16 inceptionv3 attention-mechanism cnn-keras captioning-images bleu-score flickr-dataset inception-v3 bleu attention-model image-caption caption-generation flickr-8k

Updated Oct 1, 2020
Python

tsenghungchen / show-adapt-and-tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

reinforcement-learning tensorflow policy-gradient image-captioning adversarial-networks

Updated Jan 8, 2019
Python

scopeInfinity / Video2Description

Video to Text: Natural language description generator for some given video. [Video Captioning]

deep-neural-networks video-processing image-captioning cnn-keras audio-processing lstm-neural-networks video-captioning video-to-text

Updated Mar 2, 2022
Python

Improve this page

Add a description, image, and links to the image-captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-captioning topic, visit your repo's landing page and select "manage topics."