Blurry Logic

@blurrylogic

I repost stuff I find interesting.

Event Venue

Joined June 2022

Pinned

1 out of 500 questions in HLE(humanities last exam) by scale ai is a question with a fully wrong answer(no logical chain leads to that answer i made up fake math) any llm that guys 500/500 is cheating :)

Jeffrey Wang

@wangzjeff

Sep 24

PSA to CS college students - if you labeled data for Scale AI, please don't put it on your resume

Blurry Logic

@blurrylogic

May 21

BABEL FISH!

Justin Hart

@justin_hart

May 20

WILD. Real-time Google Meet translation capabilities.

Blurry Logic

@blurrylogic

May 21

They finally made diffusion llms work at scale

Brendan O'Donoghue

@bodonoghue85

May 20

Excited to share what my team has been working on lately - Gemini diffusion! We bring diffusion to language modeling, yielding more power and blazing speeds! 🚀🚀🚀 Gemini diffusion is especially strong at coding. In this example the model generates at 2000 tokens/sec,…

Blurry Logic reposted

Zhihong Shao

@zhs05232838

Apr 30

We just released DeepSeek-Prover V2. - Solves nearly 90% of miniF2F problems - Significantly improves the SoTA performance on the PutnamBench - Achieves a non-trivial pass rate on AIME 24 & 25 problems in their formal version Github: github.com/deepseek-ai/De…

zhs05232838's tweet image. We just released DeepSeek-Prover V2.
- Solves nearly 90% of miniF2F problems
- Significantly improves the SoTA performance on the PutnamBench
- Achieves a non-trivial pass rate on AIME 24 &amp; 25 problems in their formal version

Github: github.com/deepseek-ai/De…

Blurry Logic reposted

.txt

@dottxtai

Apr 28

We recently came across an interesting paper that helps LLMs be better at handling domain-specific languages like database queries or probabilistic programming languages, using an approach called "grammar prompting". Link + brief thread below.

Blurry Logic reposted

payloadartist

@payloadartist

Apr 23

$64k in #bugbounty for finding basic secrets in predictable places because teams skipped Git 101 and proper .gitignore hygiene. Good on the reporter for cashing in on the perpetual lack of fundamental version control understanding. medium.com/@sharon.brizin…

payloadartist's tweet image. $64k in #bugbounty for finding basic secrets in predictable places because teams skipped Git 101 and proper .gitignore hygiene. Good on the reporter for cashing in on the perpetual lack of fundamental version control understanding.

medium.com/@sharon.brizin…

Blurry Logic reposted

cocktail peanut

@cocktailpeanut

Apr 17

FramePack: Generate Video Forever [NVIDIA ONLY] The creator of ControlNet just dropped a new approach to generating videos in a PROGRESSIVE way--you can generate loooong videos with LOW VRAM (just 6GB) @SUP3RMASS1VE wrote a 1-click launcher to run the Gradio app on your PC!

Hunyuan

@TencentHunyuan

Apr 17

㊗️Congrats on Lvmin Zhang’s (github@lllyasviel) latest project FramePack and thank you for using and recommending HunyuanVideo. 😀So happy to see innovations based on Hunyuan and we would like to see more. ▶️FramePack's Brief Intro and Showcases Attached: FramePack is a…

Blurry Logic reposted

Xiaolong Wang

@xiaolonw

Apr 7

Test-Time Training (TTT) is now on Video! And not just a 5-second video. We can generate a full 1-min video! TTT module is an RNN module that provides an explicit and efficient memory mechanism. It models the hidden state of an RNN with a machine learning model, which is updated…

Blurry Logic reposted

Patrick Wardle

@patrickwardle

Apr 1

objectivebythesea.org/v8/cfp.html 🫣🤗

patrickwardle's tweet card. Submit a talk for #OBTS today!

#OBTS v8.0: CFP

Source: objectivebythesea.org

Blurry Logic reposted

Baifeng

@baifeng_shi

Mar 27

Next-gen vision pre-trained models shouldn’t be short-sighted. Humans can easily perceive 10K x 10K resolution. But today’s top vision models—like SigLIP and DINOv2—are still pre-trained at merely hundreds by hundreds of pixels, bottlenecking their real-world usage. Today, we…

baifeng_shi's tweet image. Next-gen vision pre-trained models shouldn’t be short-sighted.

Humans can easily perceive 10K x 10K resolution. But today’s top vision models—like SigLIP and DINOv2—are still pre-trained at merely hundreds by hundreds of pixels, bottlenecking their real-world usage.

Today, we…

Blurry Logic reposted

spike

@spikedoanz

Mar 21

i wrote a new blog. i hope you all enjoy.

Blurry Logic reposted

Ndea

@ndea

Feb 27

Deep Learning architectures usually aren't trained to perform search at test time, leading to sample inefficiency + poor generalization. Latent Program Network (LPN) builds in test-time adaption by learning a latent space that can be searched. @ClementBonnet16 @MattVMacfarlane

Blurry Logic reposted

Anne Ouyang

@anneouyang

Feb 12

New blog post from Nvidia: LLM-generated GPU kernels showing speedups over FlexAttention and achieving 100% numerical correctness on 🌽KernelBench Level 1

anneouyang's tweet image. New blog post from Nvidia: LLM-generated GPU kernels showing speedups over FlexAttention and achieving 100% numerical correctness on 🌽KernelBench Level 1

Blurry Logic reposted

James Kettle

@albinowax

Feb 12

This is a great infoleak exploit chain targeting YouTube by @brutecat. Love the use of a DoS flaw to make the attack stealthier! brutecat.com/articles/leaki…

Blurry Logic reposted

Lucas Nestler

@_clashluke

Feb 7

* BF16 + Stochastic Rounding doesn't always converge as well as FP32, introducing risk * Both scaled and unscaled caution can underperform the baseline * MARS needs more memory and compute and does not affect large-batch training * Untuned PSGD and SOAP can lead to early…

Blurry Logic reposted

apolinario 🌐

@multimodalart

Feb 7

Boring Reality LoRA just dropped for HunyuanVideo 🏙️🏞️ A fine-tune that lead not to cinematic shots, but to something that could've come out of your phone 📱

Blurry Logic

@blurrylogic

Jan 7

I don't bullshit on the internet service. I provide noise so that your llm genralizes better

spor

@sporadica

Jan 7

This paper from DeepMind is blowing my mind: “Our findings reveal that models fine-tuned on weaker & cheaper generated data consistently outperform those trained on stronger & more-expensive generated data across multiple benchmarks…”

sporadica's tweet image. This paper from DeepMind is blowing my
mind:

“Our findings reveal that models fine-tuned on weaker &amp; cheaper generated data consistently outperform those trained on stronger &amp; more-expensive generated data across multiple benchmarks…”

Blurry Logic reposted

אגי-e/acc

@murage_kibicho

Jan 2

Hackathon update. I built a programming language alongside @deepseek_ai It's called Recursive Assembly. We emulate GPUs on CPU using finite field arithmetic. I'm working on the docs then I'll launch tomorrow on my Substack (link in bio) #redbullfund @redbullfuturist