BLOOM can also be instructed to perform text tasks it hasn't been explicitly trained for, by casting them as text generation tasks. As such, it is able to output coherent text in 46 languages and 13 programming languages that is hardly distinguishable from text written by humans. BigScience Large Open-science Open-access Multilingual Language ModelĬurrent Checkpoint: Training Iteration 95000īLOOM is an autoregressive Large Language Model (LLM), trained to continue text from a prompt on vast amounts of text data using industrial-scale computational resources.