Ray Strode

@halfartificial

@halfline alt account for ai related discourse

Joined April 2024

541Posts 28Followers 253Following

Ray Strode reposted

atharva ☆

@k7agar

Jan 10

chinese bros back at it again - train a Decoder-Only transformer < 3 hrs on a 3090 - fully studded with LoRA, DPO, SFT. - well documented training - vision, moe, and other goodies also avail.

k7agar's tweet image. chinese bros back at it again
- train a Decoder-Only transformer &lt; 3 hrs on a 3090
- fully studded with LoRA, DPO, SFT.
- well documented training
- vision, moe, and other goodies also avail.

Ray Strode reposted

Knowledge Graphs give LLMs the context they need to understand your code better This paper presents a novel approach to improve software repository question-answering by combining LLMs with knowledge graphs. The research demonstrates how knowledge graphs can enhance LLMs'…

rohanpaul_ai's tweet image. Knowledge Graphs give LLMs the context they need to understand your code better

This paper presents a novel approach to improve software repository question-answering by combining LLMs with knowledge graphs. The research demonstrates how knowledge graphs can enhance LLMs'…

Ray Strode

@halfartificial

Dec 13

bye bye byte pair encoding. you've served as well, but your time has come.

Lisan al Gaib

@scaling01

Dec 13

META JUST KILLED TOKENIZATION !!! A few hours ago they released "Byte Latent Transformer". A tokenizer free architecture that dynamically encodes Bytes into Patches and achieves better inference efficiency and robustness! (I was just talking about how we need dynamic…

scaling01's tweet image. META JUST KILLED TOKENIZATION !!!

A few hours ago they released "Byte Latent Transformer". A tokenizer free architecture that dynamically encodes Bytes into Patches and achieves better inference efficiency and robustness!

(I was just talking about how we need dynamic…

Ray Strode reposted

Sam Altman

@sama

Dec 10

canvas is now available to all chatgpt users, and can execute code! more importantly it can also still emojify your writing.

Ray Strode reposted

Rohan Paul

@rohanpaul_ai

Dec 7

Open-sourced local LLM based RAG, chatting with your documents with open-source LLMs. ✨ It trended at Number-1 in Github for quite sometime. And a clean & customizable RAG UI for chatting with your documents. → Open-source RAG UI for document QA → Supports local LLMs and…

rohanpaul_ai's tweet image. Open-sourced local LLM based RAG, chatting with your documents with open-source LLMs. ✨

It trended at Number-1 in Github for quite sometime.

And a clean &amp; customizable RAG UI for chatting with your documents.

→ Open-source RAG UI for document QA

→ Supports local LLMs and…

Ray Strode reposted

Nathan Cooper

@ncooper57

Dec 5

As R&D staff @answerdotai, I work a lot on boosting productivity with AI. A common theme that always comes up is the combination of human+AI. This combination proved to be powerful in our new project ShellSage, which is an AI terminal buddy that learns and teaches with you. A 🧵

ncooper57's tweet image. As R&amp;D staff @answerdotai, I work a lot on boosting productivity with AI. A common theme that always comes up is the combination of human+AI. This combination proved to be powerful in our new project ShellSage, which is an AI terminal buddy that learns and teaches with you. A 🧵

Ray Strode reposted

merve

@mervenoyann

Dec 5

Welcome PaliGemma 2! 🤗 Google released PaliGemma 2, best vision language model family that comes in various sizes: 3B, 10B, 28B, based on Gemma 2 and SigLIP, comes with transformers support day-0 🎁 Saying this model is amazing would be an understatement, keep reading ✨

mervenoyann's tweet image. Welcome PaliGemma 2! 🤗

Google released PaliGemma 2, best vision language model family that comes in various sizes: 3B, 10B, 28B, based on Gemma 2 and SigLIP, comes with transformers support day-0 🎁

Saying this model is amazing would be an understatement, keep reading ✨

Ray Strode

@halfartificial

Dec 4

this has instructlab vibes

Rohan Paul

@rohanpaul_ai

Dec 1

📚 arxiv.org/abs/2410.20088

Ray Strode reposted

Rohan Paul

@rohanpaul_ai

Nov 30, 2024

The First Globally Trained 10B Parameter Model is released. 👏👏 INTELLECT-1 is a groundbreaking 10B parameter LLM trained collaboratively across multiple continents globally using distributed computing, representing a 10x scale-up from previous research. → The model achieved…

rohanpaul_ai's tweet image. The First Globally Trained 10B Parameter Model is released. 👏👏

INTELLECT-1 is a groundbreaking 10B parameter LLM trained collaboratively across multiple continents globally using distributed computing, representing a 10x scale-up from previous research.

→ The model achieved…

Prime Intellect

@PrimeIntellect

Nov 29, 2024

Releasing INTELLECT-1: We’re open-sourcing the first decentralized trained 10B model: - INTELLECT-1 base model & intermediate checkpoints - Pre-training dataset - Post-trained instruct models by @arcee_ai - PRIME training framework - Technical paper with all details

Ray Strode reposted

ollama

@ollama

Nov 27, 2024

ollama run qwq 🤯 an experimental 32B model by the Qwen team that is competitive with o1-mini and o1-preview in some cases. ollama.com/library/qwq Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”

ollama's tweet image. ollama run qwq

🤯 an experimental 32B model by the Qwen team that is competitive with o1-mini and o1-preview in some cases.

ollama.com/library/qwq

Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”

Ray Strode reposted

Rohan Paul

@rohanpaul_ai

Nov 26, 2024

Adding rule-based guidance doubles RAG's performance in document retrieval and answer generation. Basically, RAG gets a proper manual on how to use its knowledge. It's like giving RAG a GPS instead of letting it wander around blindly. 🎯 Original Problem: Current…

rohanpaul_ai's tweet image. Adding rule-based guidance doubles RAG's performance in document retrieval and answer generation.

Basically, RAG gets a proper manual on how to use its knowledge.

It's like giving RAG a GPS instead of letting it wander around blindly.

🎯 Original Problem:

Current…

Ray Strode reposted

Eldar Kurtić

@_EldarKurtic

Nov 25, 2024

2:4 Sparsity + @AIatMeta Llama-3.1: At @neuralmagic, we've developed a recipe to produce very competitive sparse LLMs, and we are starting by open-sourcing the first one: Sparse-Llama-3.1-8B-2of4. We also show how to leverage it for blazingly fast inference in @vllm_project.

_EldarKurtic's tweet image. 2:4 Sparsity + @AIatMeta Llama-3.1: At @neuralmagic, we've developed a recipe to produce very competitive sparse LLMs, and we are starting by open-sourcing the first one: Sparse-Llama-3.1-8B-2of4. We also show how to leverage it for blazingly fast inference in @vllm_project.

Ray Strode reposted

Alex Albert

@alexalbert__

Nov 25, 2024

Introducing the Model Context Protocol (MCP) An open standard we've been working on at Anthropic that solves a core challenge with LLM apps - connecting them to your data. No more building custom integrations for every data source. MCP provides one protocol to connect them all:

alexalbert__'s tweet image. Introducing the Model Context Protocol (MCP)

An open standard we've been working on at Anthropic that solves a core challenge with LLM apps - connecting them to your data.

No more building custom integrations for every data source. MCP provides one protocol to connect them all:

Ray Strode reposted

Awni Hannun

@awnihannun

Nov 23, 2024

Hunyuan-Large by Tencent is a 389B param MOE (52B active). It's the largest open-weights MOE. In some benchmarks it exceeds Llama 3.1 405B. With MLX's new 3-bit quant it just barely fits on a single 192GB M2 Ultra! And runs at a very decent >15 toks/sec:

Ray Strode reposted

Yam Peleg

@Yampeleg

Nov 22, 2024

Lightricks just dropped the fastest text-to-video generation mode ever. It can generate videos faster than the time it takes to watch them! Code: github.com/Lightricks/LTX…

From AK

Ray Strode

@halfartificial

Nov 23, 2024

llama.cpp/vllm in a container ready to go. cross platform support. models stored in OCI compatible container registries like quay.io

Daniel Walsh

@rhatdan

Nov 22, 2024

Been working on a new RamaLama project last couple of months. Goal is to make running AI Models inside of containers super easy. Get your AI Models anywhere. This blog announces the today. Try it out. Love to hear your feedback @redhat @openshift @ibm developers.redhat.com/articles/2024/…

rhatdan's tweet card. Over the last few months, our team has been working on a new AI project called RamaLama (Figure 1). Yes, another name that contains lama

How RamaLama makes working with AI models boring | Red Hat Developer

Source: developers.redhat.com

Ray Strode

@halfartificial

Nov 22, 2024

"real" and open source reinforcement learning recipe that uses verifiable rewards called tulu is released. Will be used on molmo soon.

Nathan Lambert

@natolambert

Nov 21, 2024

I've spent the last two years scouring all available resources on RLHF specifically and post training broadly. Today, with the help of a totally cracked team, we bring you the fruits of that labor — Tülu 3, an entirely open frontier model post training recipe. We beat Llama 3.1…

natolambert's tweet image. I've spent the last two years scouring all available resources on RLHF specifically and post training broadly. Today, with the help of a totally cracked team, we bring you the fruits of that labor — Tülu 3, an entirely open frontier model post training recipe. We beat Llama 3.1…

Ray Strode

@halfartificial

Nov 22, 2024

as with many things, the devil is in the details, and that, includes sparse retrievers like BM25, apparently. Of course, the wider point is to do your own benchmarks and don't rely on feature matrices when choosing an implementation.

Jo Kristian Bergum

@jobergum

Nov 22, 2024

x.com/i/article/1859…

Ray Strode

@halfartificial

Nov 21, 2024

chain of thought reasoning exhibited in normally discarded top-k results

Rohan Paul

@rohanpaul_ai

Nov 20, 2024

Can LLMs reason effectively without prompting? Great paper by @GoogleDeepMind By considering multiple paths during decoding, LLMs show improved reasoning without special prompts. It reveals LLMs' natural reasoning capabilities. LLMs can reason better by exploring multiple…