Malaysian LLM2Vec
Collection
Extending Malaysian CausalLM on non-causal masking training, https://arxiv.org/abs/2404.05961
•
5 items
•
Updated
Replicating https://github.com/McGill-NLP/llm2vec using https://huggingface.co/mesolitica/malaysian-mistral-349M-4096, done by https://github.com/aisyahrzk https://twitter.com/aisyahhhrzk
Source code at https://github.com/mesolitica/malaya/tree/master/session/llm2vec
WandB, https://wandb.ai/aisyahrazak/mistral-349M-mlm?nw=nwuseraisyahrazak