Captioning

Sam Witteveen and I started the TensorFlow and Deep Learning Singapore group on MeetUp in February 2017, and the fifth MeetUp, aka 'Advanced Text', was again hosted by Google.

This 30 minute talk (which was intended to last between 45 minutes and 1 hour, but was cut short due to others running long) included a Keras version of a number of Captioning models :

Standard LSTM
Dilated CNN (á la DeepMind)
CNN of Gated Linear Units (á la Facebook)
Attention-is-all-you-need (a very recent paper from Google)

The presentation also includes links to the relevant code at suitable points.

If there are any questions about the presentation please ask below, or contact me using the details given on the slides themselves.

Video Link

The presentation was kindly recorded by Engineers.sg.

Data set

The notebook(s) in the Captioning section of the CNN folder require you to download the Flickr30k dataset yourself, because there are license restrictions that prevent it from being shared freely. Which is a pity.