KoboldAI
/

fairseq-dense-355M

Text Generation

Inference Endpoints

Model card Files Files and versions Community

fairseq-dense-355M / README.md

ve-forbryderne's picture

Add basic model information

d071c5d about 2 years ago

|

408 Bytes

	---
	language: en
	---
	This is a Hugging Face transformers-compatible conversion of the original dense 355M-parameter model from the paper "[Efficient Large Scale Language Modeling with Mixtures of Experts](https://arxiv.org/abs/2112.10684)" from Artetxe et al. Please refer to the original model card, which can be found at https://github.com/facebookresearch/fairseq/blob/main/examples/moe_lm/model_card.md.