modularStarEncoder
/

ModularStarEncoder

Feature Extraction

ModularStarEncoder

Model card Files Files and versions

andreagurioli1995 commited on Feb 21, 2025

Commit

5cee63e

·

verified ·

1 Parent(s): 7b1dd54

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -78,6 +78,7 @@ The pre-training and fine-tuning were conducted on 512 NVIDIA Ampere (64GB) GPUs
 | Num. of parameters       | ≈1B       |
 | Training tokens          | ≈1T       |
 |Loss function             |MLM + In-Context loss|
 ## Licence
 The model is licensed under the BigCode OpenRAIL-M v1 license agreement. You can find the full agreement [here](https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement).

 | Num. of parameters       | ≈1B       |
 | Training tokens          | ≈1T       |
 |Loss function             |MLM + In-Context loss|
+|Multi-layer loss          | yes       |
 ## Licence
 The model is licensed under the BigCode OpenRAIL-M v1 license agreement. You can find the full agreement [here](https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement).