[FLAVA] Make projections part of the core model #106

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

ankitade wants to merge 10 commits into gh/ankitade/5/base from gh/ankitade/5/head

Contributor

ankitade commented Jun 21, 2022 •

edited

Loading

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan

pytest
python -m flava.train config=flava/configs/pretraining/debug.yaml
python -m flava.finetune config=flava/configs/finetuning/qnli.yaml

Stack from ghstack (oldest at bottom):

Differential Revision: D37481127


          Temp CL

738111a

[ghstack-poisoned]

This was referenced Jun 21, 2022

Moving flava model to its own folder #96

Closed

[FLAVA] Separate out text and image encoders #102

Closed

[FLAVA]Change some initialization orders and corresponding tests #105

Closed

facebook-github-bot added the CLA Signed label

ankitade added a commit that referenced this pull request


          Temp CL

182c75e

ghstack-source-id: 1b7477c
Pull Request resolved: #106


          Update on "Temp CL"

2934b63

[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          Temp CL

e4709b3

ghstack-source-id: 0bca6e6
Pull Request resolved: #106

codecov-commenter commented Jun 23, 2022 •

edited

Loading

Codecov Report

❗ No coverage uploaded for pull request base (gh/ankitade/5/base@3f7009e). Click here to learn what that means.
The diff coverage is n/a.

@@                  Coverage Diff                  @@
##             gh/ankitade/5/base     #106   +/-   ##
=====================================================
  Coverage                      ?   93.04%           
=====================================================
  Files                         ?       47           
  Lines                         ?     2776           
  Branches                      ?        0           
=====================================================
  Hits                          ?     2583           
  Misses                        ?      193           
  Partials                      ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3f7009e...4bcee67. Read the comment docs.


          Update on "Temp CL"

c59460e

[ghstack-poisoned]

ankitade mentioned this pull request

[Flava] Add ckpt loading and accuracy metric to finetuning #119

Closed

ankitade added a commit that referenced this pull request


          Temp CL

ffdbc6a

ghstack-source-id: 4c0738f
Pull Request resolved: #106


          Update on "Temp CL"

43f7cff

[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

901662f

ghstack-source-id: a97330d
Pull Request resolved: #106

ankitade changed the title ~~Temp CL~~ [FLAVA] Make projections part of the core model


          Update on "[FLAVA] Make projections part of the core model"

89b126e

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 



[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

c801910

ghstack-source-id: f8b9173
Pull Request resolved: #106


          Update on "[FLAVA] Make projections part of the core model"

f805ce2

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 



[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

72fe0af

ghstack-source-id: 4844b17
Pull Request resolved: #106

Contributor Author

ankitade commented Jun 28, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          Update on "[FLAVA] Make projections part of the core model"

0d43ce3

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

ankitade added a commit that referenced this pull request


          [FLAVA] Move projections from contrastive loss to model

6fac11d

ghstack-source-id: e6b230c
Pull Request resolved: #106

This was referenced Jul 4, 2022

change order of itm loss init #131

Draft

[FLAVA]Move itm head to flava model for pretraining #132

Draft

ankitade requested review from RdoubleA, apsdehal, ebsmothers and langong347

July 13, 2022 06:25

ankitade marked this pull request as ready for review

July 13, 2022 06:26

ebsmothers approved these changes

View reviewed changes


          Update on "[FLAVA] Make projections part of the core model"

44c8379

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

Contributor Author

ankitade commented Jul 23, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          Update on "[FLAVA] Make projections part of the core model"

2ba6ff6

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

Contributor Author

ankitade commented Jul 23, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

4 similar comments

Contributor Author

ankitade commented Jul 23, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor Author

ankitade commented Jul 24, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor Author

ankitade commented Jul 24, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Contributor Author

ankitade commented Jul 25, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ankitade mentioned this pull request

[FLAVA] Move masked prediction head to flava_for_pretraining #195

Draft


          Update on "[FLAVA] Make projections part of the core model"

4bcee67

Move projections from the contrastive loss to the core model
This will allow users to use the model (instead of the pretraining model) for doing zero shot
Also moved to using the translated the checkpoint.

Test plan
1. pytest
2.  python -m flava.train config=flava/configs/pretraining/debug.yaml
3. python -m flava.finetune config=flava/configs/finetuning/qnli.yaml
 

Differential Revision: [D37481127](https://our.internmc.facebook.com/intern/diff/D37481127)

[ghstack-poisoned]

Contributor Author

ankitade commented Jul 26, 2022

@ankitade has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

679f359

facebook-github-bot deleted the gh/ankitade/5/head branch

July 29, 2022 14:17

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels