GShard (dense)
GoogleTranslation
Developed by Google in 2020, GShard (dense) is a translation model with 2300000000.0 parameters.
About GShard (dense)
Neural network scaling has been critical for improving the model quality in many real-world machine learning applications with vast amounts of training data and compute. Although this trend of scaling is affirmed to be a sure-fire approach for better
Details
- Provider
- Task
- Translation
- Parameters
- 2300000000.0
- Released
- 2020-06-30
- Open weights
- No