gaoqiang_nlp's profile picture. a third year master student at Wuhan University, focusing on the natural language processing area. I'm actively looking for a PhD position for 2025Fall.

Qiang Gao

@gaoqiang_nlp

a third year master student at Wuhan University, focusing on the natural language processing area. I'm actively looking for a PhD position for 2025Fall.

Qiang Gao รีโพสต์แล้ว

Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months. Policy gradient chapter is coming together. Plugging away at the book every day now. rlhfbook dot com

natolambert's tweet image. Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months. 

Policy gradient chapter is coming together. Plugging away at the book every day now.

rlhfbook dot com

Qiang Gao รีโพสต์แล้ว

The new chatGPT 4o with Canvas system prompt:

jasonkneen's tweet image. The new chatGPT 4o with Canvas system prompt:

Qiang Gao รีโพสต์แล้ว

This is the best use for of a drone I've ever seen 😭


Qiang Gao รีโพสต์แล้ว

don’t worry about it babe it’s just a brand

lawhsw's tweet image. don’t worry about it babe it’s just a brand

Qiang Gao รีโพสต์แล้ว

Congratulations to the new faculty members in @MITEngineering, including Media Lab alum Anna Huang (@huangcza)! Huang will hold a joint appointment in @MITEECS and MIT Music and Theater Arts. news.mit.edu/2024/school-en…


Qiang Gao รีโพสต์แล้ว

The Top ML Papers of the Week (May 27 - June 2): - SimPO - GNN-RAG - Attention as an RNN - Abacus Embeddings - Symbolic Chain-of-Thought - Contextual Position Encoding ...


Qiang Gao รีโพสต์แล้ว

I've been thinking about the many, MANY, DPO spinoff methods we've been seeing recently for rlhf. IPO, D2PO, CPO, ORPO, SPO, sDPO, KTO, DNO... Most claim they're "the best" but doesn't properly compare to related work. What do we do in alignment research? Thread 📚


🎉 Exciting News! 🎉 Just open-sourced my latest project: Llama3-based 8x8b-MoE model! 🚀 Extends llama3-8B-Instruct model with MoE architecture. Check it out & give it a star! github.com/cooper12121/ll…

gaoqiang_nlp's tweet image. 🎉 Exciting News! 🎉 Just open-sourced my latest project: Llama3-based 8x8b-MoE model! 🚀 Extends llama3-8B-Instruct model with MoE architecture. Check it out & give it a star! github.com/cooper12121/ll…

Qiang Gao รีโพสต์แล้ว

If you really want to do something useful in AI, instead of training another tiny llama, pick up this project hazyresearch.stanford.edu/blog/2024-01-1… and train a 1B-parameter multilingual BERT with 32k input size. The code is here github.com/HazyResearch/m2. The data is all over @huggingface. The…


Qiang Gao รีโพสต์แล้ว

To close out 2023, here are 10 of the most interesting AI research advancements we shared on our feed this year —  and where you can find more details on the work. 1️⃣ Segment Anything (SAM) A step toward the first foundation model for image segmentation. Details:…


Qiang Gao รีโพสต์แล้ว

Alibaba releases DreaMoving demo on Hugging Face A Human Video Generation Framework based on Diffusion Models demo: huggingface.co/spaces/jiayong…


Qiang Gao รีโพสต์แล้ว

Key vulnerabilities of GPT-4: 1. Fine-tuning API can remove or diminish safety guardrails, causing the model to produce harmful outputs or assist with dangerous requests 2. Fine-tuning can make the model generate targeted misinformation against public figures 3. Fine-tuning…

IntuitMachine's tweet image. Key vulnerabilities of GPT-4:

1. Fine-tuning API can remove or diminish safety guardrails, causing the model to produce harmful outputs or assist with dangerous requests

2. Fine-tuning can make the model generate targeted misinformation against public figures

3. Fine-tuning…

Qiang Gao รีโพสต์แล้ว

If you're not using AI, you're falling behind.


Qiang Gao รีโพสต์แล้ว

//#つぶやきProcessing int n=999999,p[]=new int[n],i,j; float t=1; void setup(){ size(800,800); for(i=2;i<n;i++)if(p[i]==0)for(j=i+i;j<n;j+=i)p[j]=i; } void draw(){ clear(); stroke(-1); for(i=2;i<n;i++)if(p[i]==0)circle(i*sin(i*t)/99+400,i*cos(i*t)/99+400,2); t+=1e-7; }


Qiang Gao รีโพสต์แล้ว

Foundation models are transforming society: in the past month alone, we've seen a flurry of releases! GPT-4, Claude, PaLM API, Alpaca, Dolly, Jurassic-2, PaLM-E, GPT4All, Cerebras-GPT, OpenFlamingo, ... We built Ecosystem Graphs to track their footprint: crfm.stanford.edu/ecosystem-grap…

RishiBommasani's tweet image. Foundation models are transforming society: in the past month alone, we&apos;ve seen a flurry of releases!

GPT-4, Claude, PaLM API, Alpaca, Dolly, Jurassic-2, PaLM-E, GPT4All, Cerebras-GPT, OpenFlamingo, ...

We built Ecosystem Graphs to track their footprint:
crfm.stanford.edu/ecosystem-grap…

Qiang Gao รีโพสต์แล้ว

huggingface.co/spaces/JavaFXp… you should check this out


Qiang Gao รีโพสต์แล้ว

4 essential books anyone should read: • Machine Learning with PyTorch and Scikit-Learn • Transformers for NLP • Deep Learning with Python • Designing Machine Learning Systems

abacusai's tweet image. 4 essential books anyone should read:

• Machine Learning with PyTorch and Scikit-Learn
• Transformers for NLP
• Deep Learning with Python
• Designing Machine Learning Systems

United States เทรนด์

Loading...

Something went wrong.


Something went wrong.