BigScience Large Model Training

@BigScienceLLM

Follow the training of "BLOOM 🌸", the @BigScienceW multilingual 176B parameter open-science open-access language model, a research tool for the AI community.

JeanZay supercomputer (France)

bigscience.notion.site/BigScience-176…

Joined March 2022

129Posts 8KFollowers 1Following

You might like

@huggingface

@ClementDelangue

@SchmidhuberAI

@AnthropicAI

@Gradio

@chipro

@seb_ruder

@JayAlammar

@YejinChoinka

@ykilcher

@Nils_Reimers

@LightningAI

@wandb

@srush_nlp

@OriolVinyalsML

Pinned

BigScience Large Model Training

@BigScienceLLM

Jul 12, 2022

The BLOOM model is now officially released! Read more here: bigscience.huggingface.co/blog/bloom Find the model here: huggingface.co/bigscience/blo…

bigscience/bloom · Hugging Face

Source: huggingface.co

BigScience Research Workshop

@BigscienceW

Jul 12, 2022

BLOOM is here. The largest open-access multilingual language model ever. Read more about it or get it at bigscience.huggingface.co/blog/bloom hf.co/bigscience/blo…

BigscienceW's tweet image. BLOOM is here. The largest open-access multilingual language model ever. Read more about it or get it at
bigscience.huggingface.co/blog/bloom
hf.co/bigscience/blo…

BigScience Large Model Training reposted

The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs & better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! arxiv.org/abs/2211.05100

ClementDelangue's tweet image. The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs &amp; better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! arxiv.org/abs/2211.05100

BigScience Large Model Training reposted

Niklas Muennighoff

@Muennighoff

Nov 4, 2022

Crosslingual Generalization through Multitask Finetuning 🌸 Demo: huggingface.co/bigscience/blo… 📜 arxiv.org/abs/2211.01786 💻github.com/bigscience-wor… We present BLOOMZ & mT0, a family of models w/ up to 176B params that follow human instructions in >100 languages zero-shot. 1/7

Muennighoff's tweet image. Crosslingual Generalization through Multitask Finetuning 🌸

Demo: huggingface.co/bigscience/blo…
📜 arxiv.org/abs/2211.01786
💻github.com/bigscience-wor…

We present BLOOMZ &amp; mT0, a family of models w/ up to 176B params that follow human instructions in &gt;100 languages zero-shot. 1/7

BigScience Large Model Training

@BigScienceLLM

Sep 15, 2022

The super-fast inference solutions are finally here for all to use:

Stas Bekman

@StasBekman

Sep 15, 2022

Learn how you can get under 1msec per token generation time with BLOOM 176B model! Not one, but multiple super-fast solutions including Deepspeed-Inference, Accelerate and Deepspeed-ZeRO! huggingface.co/blog/bloom-inf…

Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate

Source: huggingface.co

BigScience Large Model Training reposted

clem 🤗

@ClementDelangue

Aug 31, 2022

What do @StabilityAI @EMostaque #stablediffusion & @BigscienceW Bloom - aka the coolest new models ;) - have in common? They both use a new gen of ML licenses aimed at making ML more open & inclusive while keeping it harder to do harm with them. So cool! huggingface.co/blog/open_rail

ClementDelangue's tweet image. What do @StabilityAI @EMostaque #stablediffusion &amp; @BigscienceW Bloom - aka the coolest new models ;) - have in common?

They both use a new gen of ML licenses aimed at making ML more open &amp; inclusive while keeping it harder to do harm with them. So cool!

huggingface.co/blog/open_rail

BigScience Large Model Training reposted

Hugging Face

@huggingface

Jul 14, 2022

The Technology Behind BLOOM Training🌸 Discover how @BigscienceW used @MSFTResearch DeepSpeed + @nvidia Megatron-LM technologies to train the World's Largest Open Multilingual Language Model (BLOOM): huggingface.co/blog/bloom-meg…

The Technology Behind BLOOM Training

Source: huggingface.co

BigScience Large Model Training reposted

BigScience Research Workshop

@BigscienceW

Jul 12, 2022

BLOOM is here. The largest open-access multilingual language model ever. Read more about it or get it at bigscience.huggingface.co/blog/bloom hf.co/bigscience/blo…

BigScience Large Model Training reposted

Saulnier Lucile

@LucileSaulnier

Jun 28, 2022

🌸@BigscienceW BLOOM's intermediate checkpoints have already shown some very cool capabilities! What's great about BLOOM is that you can ask it to generate the rest of a text - and this even if it is not yet fully trained yet! 👶 🧵 A thread with some examples

LucileSaulnier's tweet image. 🌸@BigscienceW BLOOM's intermediate checkpoints have already shown some very cool capabilities!

What's great about BLOOM is that you can ask it to generate the rest of a text - and this even if it is not yet fully trained yet! 👶

🧵 A thread with some examples

Saulnier Lucile

@LucileSaulnier

Jun 27, 2022

A milestone soon to be reached 🚀💫 Can't wait to see the capabilities and performance of this long-awaited checkpoint! What about you? Have you already prepared some prompts that you want to test? ✏️

BigScience Large Model Training

@BigScienceLLM

Jun 30, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 102%

BigScience Large Model Training

@BigScienceLLM

Jun 29, 2022

For 111 days, we've enjoyed world-class hardware stability and throughput thanks to the hard work of our friends at @Genci_fr, @INS2I_CNRS, Megatron & DeepSpeed. Having reached our objective earlier than expected, we'll keep training for a few more days. Stay tuned, more soon ;)

BigScience Large Model Training

@BigScienceLLM

Jun 29, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 101%

BigScience Large Model Training

@BigScienceLLM

Jun 28, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 100%

BigScience Large Model Training

@BigScienceLLM

Jun 27, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 99%

BigScience Large Model Training

@BigScienceLLM

Jun 26, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 98%

BigScience Large Model Training

@BigScienceLLM

Jun 25, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 97%

BigScience Large Model Training

@BigScienceLLM

Jun 24, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 96%

BigScience Large Model Training

@BigScienceLLM

Jun 23, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 95%

BigScience Large Model Training

@BigScienceLLM

Jun 22, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 94%

BigScience Large Model Training

@BigScienceLLM

Jun 20, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 92%

BigScience Large Model Training

@BigScienceLLM

Jun 18, 2022

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 91%

merve

@mervenoyann

Hugging Face

@huggingface

Jeremy Howard

@jeremyphoward

Julien Chaumond

@julien_c

clem 🤗

@ClementDelangue

Omar Sanseviero

@osanseviero

Percy Liang

@percyliang

Thomas Wolf

@Thom_Wolf

MMitchell

@mmitchell_ai

Sasha Luccioni, PhD 🦋🌎✨🤗

@SashaMTL

Richard Socher

@RichardSocher

Stella Biderman

@BlancheMinerva

Jason Wei

@_jasonwei

Sam Bowman

@sleepinyourhat

Sasha Rush

@srush_nlp

Ross Wightman

@wightmanr

Nate Raw

@_nateraw

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$

Sara Hooker

@sarahookr

rohan anil

@_arohan_

Zach Mueller

@TheZachMueller

Sarada _

@Sarada_Nov

george tty

@georg0xue

Cesar Mendez

@devcsar

Anshul

@anshul_bpl

Azam Khan

@DevAzamKhan

Denis Williams

@deniswilliams

Krisostomus Nova R.

@KristoNova

Alnarwuir

@Alnarwuir53644

ahmed abdallah ali

@across_ai_vally

$gudrun_lotze's profile picture. gudrun@chaos.social. X-ray physicist w/ love for in situ experiments with nano beams. Enjoys scattering, diffraction, programming. 🚴‍♀️🏊‍♀️⛷🌲🥾🧩$