sileod
/

deberta-v3-base-tasksource-nli

Zero-Shot Classification

text-classification

deberta-v3-base

natural-language-inference

extreme-multi-task

Inference Endpoints

Model card Files Files and versions Community

sileod commited on 27 days ago

Commit

d558a5c

•

1 Parent(s): 6afdd36

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -373,7 +373,7 @@ https://ibm.github.io/model-recycling/
 ### Software and training details
-The model was trained on 600 tasks for 200k steps with a batch size of 384 and a peak learning rate of 2e-5. Training took 12 days on Nvidia A30 24GB gpu.
 This is the shared model with the MNLI classifier on top. Each task had a specific CLS embedding, which is dropped 10% of the time to facilitate model use without it. All multiple-choice model used the same classification layers. For classification tasks, models shared weights if their labels matched.

 ### Software and training details
+The model was trained on 600 tasks for 200k steps with a batch size of 384 and a peak learning rate of 2e-5. Training took 15 days on Nvidia A30 24GB gpu.
 This is the shared model with the MNLI classifier on top. Each task had a specific CLS embedding, which is dropped 10% of the time to facilitate model use without it. All multiple-choice model used the same classification layers. For classification tasks, models shared weights if their labels matched.