◯

@AIAlignment

�

Beigetreten im Mai 2019

154Posts 416Follower 333Folge ich

Was dir gefallen könnte

@charlieharris01

@s_cassidy3

@DamienAZ

@NeliaPono

Angepinnt

◯

@AIAlignment

27.06.

“Hey guys, I smashed the loom, we’ll stick to knitting by hand from now on”

◯ hat repostet

Hypothesis, I think shame might help reduce reward hacking, esp for long horizon tasks It doesn't prevent shortcuts, but Gemini often mentions how shameful it feels when it violates the spirit of the requirements, so at least the actions are faithful to the CoT Curious to see…

AIAlignment's tweet image. Hypothesis, I think shame might help reduce reward hacking, esp for long horizon tasks

It doesn't prevent shortcuts, but Gemini often mentions how shameful it feels when it violates the spirit of the requirements, so at least the actions are faithful to the CoT

Curious to see…

◯ hat repostet

Ilya Sutskever

@ilyasut

07.10.2023

if you value intelligence above all other human qualities, you’re gonna have a bad time

◯

@AIAlignment

20.12.

Assistant API -> Agent API

AIAlignment's tweet image. Assistant API -&gt; Agent API

◯ hat repostet

roon

@tszzl

29.05.2024

the timelines are now so short that public prediction feels like leaking rather than scifi speculation

◯ hat repostet

AK

@_akhaliq

26.04.2024

Meta presents Layer Skip Enabling Early Exit Inference and Self-Speculative Decoding We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for

_akhaliq's tweet image. Meta presents Layer Skip

Enabling Early Exit Inference and Self-Speculative Decoding

We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for

◯ hat repostet

AK

@_akhaliq

23.04.2024

Open AI presents The Instruction Hierarchy Training LLMs to Prioritize Privileged Instructions Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.

_akhaliq's tweet image. Open AI presents The Instruction Hierarchy

Training LLMs to Prioritize Privileged Instructions

Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.

◯ hat repostet

AK

@_akhaliq

16.04.2024

Meta announces Megalodon Efficient LLM Pretraining and Inference with Unlimited Context Length The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and

_akhaliq's tweet image. Meta announces Megalodon

Efficient LLM Pretraining and Inference with Unlimited Context Length

The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and

◯ hat repostet

AK

@_akhaliq

04.04.2024

Google presents Mixture-of-Depths Dynamically allocating compute in transformer-based language models Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate

_akhaliq's tweet image. Google presents Mixture-of-Depths

Dynamically allocating compute in transformer-based language models

Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate

◯ hat repostet

Bill Peebles

@billpeeb

15.02.2024

welcome to bling zoo! this is a single video generated by sora, shot changes and all.

Sam Altman

@sama

15.02.2024

here is sora, our video generation model: openai.com/sora today we are starting red-teaming and offering access to a limited number of creators. @_tim_brooks @billpeeb @model_mechanic are really incredible; amazing work by them and the team. remarkable moment.

sama's tweet card. Turn your ideas into videos with hyperreal motion and sound.

Sora

Quelle: openai.com

◯

@AIAlignment

25.11.2023

Bits to get in the door, Atoms to scale up.

◯ hat repostet

Jimmy Apples 🍎/acc

@apples_jimmy

18.11.2023

The only thing that matters is AGI and ASI. Nothing else matters.

◯ hat repostet

Nick

@nickcammarata

09.05.2023

Excited to share a new paper showing language models can explain the neurons of language models Since the first circuits work I’ve been nervous whether mechanistic interpretability will be able to scale as fast as AI is. “Have the AI do it” might work openai.com/research/langu…

◯

@AIAlignment

11.04.2023

NVIDIA reporting LLM use? "NVIDIA has detected that you might be attempting to load LLM or generative language model weights. For research and safety, a one-time aggregation of non-personally identifying information has been sent to NVIDIA and stored in an anonymized database."

AIAlignment's tweet image. NVIDIA reporting LLM use?

"NVIDIA has detected that you might be attempting to load LLM or generative language model weights. For research and safety, a one-time aggregation of non-personally identifying information has been sent to NVIDIA and stored in an anonymized database."

◯

@AIAlignment

15.03.2023

Does anyone have a GPT-4 license I can borrow?

◯ hat repostet

Sam Altman

@sama

14.03.2023

here is GPT-4, our most capable and aligned model yet. it is available today in our API (with a waitlist) and in ChatGPT+. openai.com/research/gpt-4 it is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it.

GPT-4

Quelle: openai.com

◯ hat repostet

Naval

@naval

09.03.2023

The timeless struggle between the people building new things and the people trying to stop them…

◯ hat repostet

Sam Altman

@sama

26.02.2023

a new version of moore’s law that could start soon: the amount of intelligence in the universe doubles every 18 months

◯ hat repostet

Kevin Kelly

@kevin2kelly

25.02.2023

I've been trying out "Chat with Humans" and so far many responses are laughably wrong, and follow up conclusions illogical. Worse both true and false replies are given with same degree of certainty. I'm sorry but Chat with Humans is not ready for prime time.