Image Captioner

This is another project that was part of my Computer Vision Nanodegree from 2020. In this notebook, I cover the process of developing an image captioner: a network that receives previously unseen images and returns written descriptions of their content. It combines Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) cells to create a network with both feedforward and feedback connections. These connections allow the network to maintain memory between steps and handle sequential data, like language, where the last output determines the next output as much as the current input does.

Beyond the obvious but incredible difference an image captioner can make in terms of user accessibility, this project demonstrates a network that's capable of infering contextual nuance, which has countless other applications and is simply fascinating. On the other hand, this project sometimes demonstrates a network that's incapable of infering contextual nuance; which, while disappointing, can also be pretty funny.

Probably the most valuable outcome of this project will be the development notebook, which I think could be a useful reference for future machine learning projects. Please take a look if this sounds interesting to you!

Features

Generate a caption of about 10 words for any given image
Explore the key differences between CNNs and RNNs
Learn how to use and build a custom data loader for language tasks
Construct an encoder-decoder pipeline for sequential data inference

Credits

This project was part of Udacity's Computer Vision Nanodegree.
Image data is provided by Microsoft's Common Objects in Context and COCO API.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
images		images
models		models
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
data_loader.py		data_loader.py
index.html		index.html
model.py		model.py
notebook.ipynb		notebook.ipynb
vocab.pkl		vocab.pkl
vocabulary.py		vocabulary.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioner

Features

Credits

License

About

Uh oh!

Contributors

Uh oh!

Languages

License

SeanvonB/image-captioner

Folders and files

Latest commit

History

Repository files navigation

Image Captioner

Features

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages