Extending the Question Answering Models to Visual Question Answering #1382

uakarsh · 2022-07-06T09:11:20Z

🚀 Feature

Extending the idea of Question Answering to Visual Question Answering

Motivation

I was going through the example and was interested in using transformers for the purpose of Visual Question Answering (could not find many resources related to the same as code), so I thought of contributing my own implementation (implemented in PyTorch), for the same. I believe that the implementation is simple enough to be quickly able to fine-tune on any dataset with ease.

Pitch

I am not sure about how to pitch, but I have managed to implement the model and get fair results on the same model. I want to extend the applicability of the model for any dataset and since this is a multi-modal model, it would be helpful for the research community as well

Alternatives

Not sure about it, since this is a model contribution.

Additional context

Here is the implementation for the same here
What does this implementation contain?

Implementation of the model in PyTorch
Pre-training script in PyTorch Lightning
Fine-tuning script in PyTorch Lightning (along with the checkpoint)
Results of the experiments

uakarsh added enhancement New feature or request help wanted Extra attention is needed labels Jul 6, 2022

stale bot added the won't fix This will not be worked on label Sep 22, 2022

ethanwharris removed the won't fix This will not be worked on label Oct 1, 2022

stale bot added the won't fix This will not be worked on label Mar 18, 2023

Lightning-Universe deleted a comment from stale bot Jun 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extending the Question Answering Models to Visual Question Answering #1382

Extending the Question Answering Models to Visual Question Answering #1382

uakarsh commented Jul 6, 2022 •

edited

Loading

Extending the Question Answering Models to Visual Question Answering #1382

Extending the Question Answering Models to Visual Question Answering #1382

Comments

uakarsh commented Jul 6, 2022 • edited Loading

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

uakarsh commented Jul 6, 2022 •

edited

Loading