Qiang Gao

@gaoqiang_nlp

a third year master student at Wuhan University, focusing on the natural language processing area. I'm actively looking for a PhD position for 2025Fall.

China

cooper12121.github.io

Tham gia vào Tháng 12 2022

30Bài đăng 25Người theo dõi 175Đang theo dõi

Qiang Gao đã đăng lại

Nathan Lambert

@natolambert

1 thg 2

Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months. Policy gradient chapter is coming together. Plugging away at the book every day now. rlhfbook dot com

natolambert's tweet image. Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months.

Policy gradient chapter is coming together. Plugging away at the book every day now.

rlhfbook dot com

Qiang Gao đã đăng lại

Jason Kneen

@jasonkneen

3 thg 10, 2024

The new chatGPT 4o with Canvas system prompt:

Qiang Gao đã đăng lại

internet hall of fame

@InternetH0F

1 thg 9, 2024

This is the best use for of a drone I've ever seen 😭

Qiang Gao đã đăng lại

harry law (hopfield network truther)

@lawhsw

29 thg 7, 2024

don’t worry about it babe it’s just a brand

Qiang Gao đã đăng lại

MIT Media Lab

@medialab

3 thg 6, 2024

Congratulations to the new faculty members in @MITEngineering, including Media Lab alum Anna Huang (@huangcza)! Huang will hold a joint appointment in @MITEECS and MIT Music and Theater Arts. news.mit.edu/2024/school-en…

medialab's tweet card. Fifteen new faculty members join six of the MIT School of Engineering's academic departments in 2024.

School of Engineering welcomes new faculty

Nguồn: news.mit.edu

Qiang Gao

@gaoqiang_nlp

17 thg 6, 2024

generated by #DreamMachine

Qiang Gao đã đăng lại

DAIR.AI

@dair_ai

2 thg 6, 2024

The Top ML Papers of the Week (May 27 - June 2): - SimPO - GNN-RAG - Attention as an RNN - Abacus Embeddings - Symbolic Chain-of-Thought - Contextual Position Encoding ...

Qiang Gao đã đăng lại

Nathan Lambert

@natolambert

30 thg 5, 2024

I've been thinking about the many, MANY, DPO spinoff methods we've been seeing recently for rlhf. IPO, D2PO, CPO, ORPO, SPO, sDPO, KTO, DNO... Most claim they're "the best" but doesn't properly compare to related work. What do we do in alignment research? Thread 📚

Qiang Gao

@gaoqiang_nlp

9 thg 5, 2024

🎉 Exciting News! 🎉 Just open-sourced my latest project: Llama3-based 8x8b-MoE model! 🚀 Extends llama3-8B-Instruct model with MoE architecture. Check it out & give it a star! github.com/cooper12121/ll…

gaoqiang_nlp's tweet image. 🎉 Exciting News! 🎉 Just open-sourced my latest project: Llama3-based 8x8b-MoE model! 🚀 Extends llama3-8B-Instruct model with MoE architecture. Check it out &amp; give it a star! github.com/cooper12121/ll…

Qiang Gao đã đăng lại

BURKOV

@burkov

17 thg 1, 2024

If you really want to do something useful in AI, instead of training another tiny llama, pick up this project hazyresearch.stanford.edu/blog/2024-01-1… and train a 1B-parameter multilingual BERT with 32k input size. The code is here github.com/HazyResearch/m2. The data is all over @huggingface. The…

burkov's tweet card. Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture" - HazyResearch/m2

GitHub - HazyResearch/m2: Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Nguồn: github.com

Qiang Gao đã đăng lại

AI at Meta

@AIatMeta

30 thg 12, 2023

To close out 2023, here are 10 of the most interesting AI research advancements we shared on our feed this year — and where you can find more details on the work. 1️⃣ Segment Anything (SAM) A step toward the first foundation model for image segmentation. Details:…

Qiang Gao đã đăng lại

AK

@_akhaliq

28 thg 12, 2023

Alibaba releases DreaMoving demo on Hugging Face A Human Video Generation Framework based on Diffusion Models demo: huggingface.co/spaces/jiayong…

Qiang Gao đã đăng lại

Carlos E. Perez

@IntuitMachine

26 thg 12, 2023

Key vulnerabilities of GPT-4: 1. Fine-tuning API can remove or diminish safety guardrails, causing the model to produce harmful outputs or assist with dangerous requests 2. Fine-tuning can make the model generate targeted misinformation against public figures 3. Fine-tuning…

IntuitMachine's tweet image. Key vulnerabilities of GPT-4:

1. Fine-tuning API can remove or diminish safety guardrails, causing the model to produce harmful outputs or assist with dangerous requests

2. Fine-tuning can make the model generate targeted misinformation against public figures

3. Fine-tuning…

Qiang Gao đã đăng lại

The Rundown AI

@TheRundownAI

2 thg 8, 2023

If you're not using AI, you're falling behind.

Qiang Gao đã đăng lại

ひさだん

@hisadan

2 thg 9, 2023

//#つぶやきProcessing int n=999999,p[]=new int[n],i,j; float t=1; void setup(){ size(800,800); for(i=2;i<n;i++)if(p[i]==0)for(j=i+i;j<n;j+=i)p[j]=i; } void draw(){ clear(); stroke(-1); for(i=2;i<n;i++)if(p[i]==0)circle(i*sin(i*t)/99+400,i*cos(i*t)/99+400,2); t+=1e-7; }

Qiang Gao đã đăng lại

rishi@NeurIPS

@RishiBommasani

29 thg 3, 2023

Foundation models are transforming society: in the past month alone, we've seen a flurry of releases! GPT-4, Claude, PaLM API, Alpaca, Dolly, Jurassic-2, PaLM-E, GPT4All, Cerebras-GPT, OpenFlamingo, ... We built Ecosystem Graphs to track their footprint: crfm.stanford.edu/ecosystem-grap…

RishiBommasani's tweet image. Foundation models are transforming society: in the past month alone, we've seen a flurry of releases!

GPT-4, Claude, PaLM API, Alpaca, Dolly, Jurassic-2, PaLM-E, GPT4All, Cerebras-GPT, OpenFlamingo, ...

We built Ecosystem Graphs to track their footprint:
crfm.stanford.edu/ecosystem-grap…

Qiang Gao đã đăng lại

Vrdoljak J

@Vrda82073569

18 thg 1, 2023

huggingface.co/spaces/JavaFXp… you should check this out

Qiang Gao đã đăng lại

Abacus.AI

@abacusai

13 thg 1, 2023

4 essential books anyone should read: • Machine Learning with PyTorch and Scikit-Learn • Transformers for NLP • Deep Learning with Python • Designing Machine Learning Systems

abacusai's tweet image. 4 essential books anyone should read:

• Machine Learning with PyTorch and Scikit-Learn
• Transformers for NLP
• Deep Learning with Python
• Designing Machine Learning Systems

Runhua ZHANG (Riva)

@Zhang_Runhua_

ReginaBecky

@fU1KUXw2ZOW0paa

Joan

@cordovajoan81

Plum Lab

@LabPlum

zixiang meng

@TkicMeng

Yu-Min Tseng

@ym_tseng

feliciytttty

@kkkkdomkkkk

mingwei

@mingwei78946797

Sushil Pokhrel

@sushilpokhrel

anpaure

@anpaure

Zonglin Yang

@Yang_zy223

$betterestli's profile picture. MS student (2023-2026) 📖 ; Feel free to contact ✉️; sampling_params = {'temperature': 2.0, 'top_p': 1.0} 🤯; I'm a fool who needs a reasoning model🫠$

betterest

@betterestli

Yangqiu Song

@yqsong

Suren T.

@therealthapa

Reysore

@ReysoreS5Thsf

Miranda Zhu 筱萌

@XiaomengMZhu

Michael Hanna@NeurIPS2025

@michaelwhanna

Xinyan Velocity Yu

@XinyanVYu

Dra. Saiph Savage

@saiphcita

Shangbin Feng

@shangbinfeng

Ardent

@arinobeshi21609

Endurance

@Fhm27x88v3zO9

gao

@gao_nlp

Sheata

@Sheata222258

Deloris

@deloris_milton

Binyuan Hui

@huybery

马东锡 NLP

@dongxi_nlp

a16z

@a16z

Linus

@thesephist

Latent.Space

@latentspacepod

Stratechery

@stratechery

Paul Graham

@paulg

jianlin.su

@Jianlin_S

Christopher Manning

@chrmanning

Jim Cramer

@jimcramer

Flood Sung

@RotekSong

Armen Aghajanyan

@ArmenAgha

Daya Guo

@Guodaya

Mark Chen

@markchen90

Hyung Won Chung

@hwchung27

Junyang Lin

@JustinLin610

gabriel

@GabrielPeterss4

Peter West

@PeterWestTM

DeepSeek

@deepseek_ai

CopeNLU

@CopeNLU

zixiang meng

@TkicMeng

dr. jack morris

@jxmnop

Nathan Lambert

@natolambert

Donald J. Trump

@realDonaldTrump

Julian Schrittwieser

@Mononofu

The TWIML AI Podcast

@twimlai

Luca Soldaini 🌯 NeurIPS 2025

@soldni

Kyle Lo @ NeurIPS 2025

@kylelostat

Semantic Scholar Research @ AI2

@ai2_s2research

Jiayi Pan

@jiayi_pirate

Ofir Press

@OfirPress

Thomas Wolf

@Thom_Wolf

$betterestli's profile picture. MS student (2023-2026) 📖 ; Feel free to contact ✉️; sampling_params = {'temperature': 2.0, 'top_p': 1.0} 🤯; I'm a fool who needs a reasoning model🫠$