For this blogpost, I'll be explaining on how to use a pretrained model and creating an image captioning model using it. The input is an image and it is imbedded with a condition LSTM. On that embedding, the next token is predicted using a START inp...
cha1tanya.hashnode.dev4 min read
No responses yet.