#distributedprocessingn 搜索结果

EXO Labs

年12月29日

Distributed training on M4 Mac Mini cluster We implemented @GoogleDeepMind DiLoCo on Apple Silicon to train large models with 100-1000x less bandwidth compared to DDP baseline. AI is entering a new era where a distributed network of consumer devices can train large models.

Pluralis Research

@PluralisHQ

年6月3日

We've reached a major milestone in fully decentralized training: for the first time, we've demonstrated that a large language model can be split and trained across consumer devices connected over the internet - with no loss in speed or performance.

PluralisHQ's tweet image. We've reached a major milestone in fully decentralized training: for the first time, we've demonstrated that a large language model can be split and trained across consumer devices connected over the internet - with no loss in speed or performance.

Litu Rout

@litu_rout_

年10月6日

Continuous diffusion had a good run—now it’s time for Discrete diffusion! Introducing Anchored Posterior Sampling (APS) APS outperforms discrete and continuous baselines in terms of performance & scaling on inverse problems, stylization, and text-guided editing.

distribute.ai

@distributeai

年9月5日

Requests in. Results out. Rewards distributed. Each inference = an on-chain proof + a GPU payout in $DIS.

SENSOR

@SensorRobotics

23 小时

Multi-Agent Systems & Swarm Intelligence DDS (Data Distribution Service) enables multiple robots to communicate and coordinate using a mesh network topology. Each agent shares its state with others to achieve complex group behaviors that no single robot could accomplish alone.…

Bless

@theblessnetwork

11 小时

Answer: yes. It’s called the shared computer. techround.co.uk/artificial-int…

theblessnetwork's tweet card. In the evolving world of digital infrastructure, data centres have never been more crucial. As demand for AI workloads, cloud...

Can Artificial Intelligence Run Data Centres Better Than Humans? - TechRound

来源: techround.co.uk

FAR Labs

@FarLabsAI

年10月31日

The growth of AI depends on distributed power. Millions of personal devices are starting to act like miniature data centers. Every connected GPU moves us closer to a networked intelligence that belongs to everyone. You ready for this?

bagel.com

@bageldotcom

年10月7日

Introducing Paris - world's first decentralized trained open-weight diffusion model. We named it Paris after the city that has always been a refuge for those creating without permission. Paris is open for research and commercial use.

Zachary Charles

@MatharyCharles

年3月14日

We just put out a key step for making distributed training work at larger and larger models: Scaling Laws for DiLoCo TL;DR: We can do LLM training across datacenters in a way that scales incredibly well to larger and larger models!

MatharyCharles's tweet image. We just put out a key step for making distributed training work at larger and larger models: Scaling Laws for DiLoCo

TL;DR: We can do LLM training across datacenters in a way that scales incredibly well to larger and larger models!

Gradient

@Gradient_HQ

年6月20日

Introducing Parallax, the first fully distributed inference and serving engine for large language models. Try it now: chat.gradient.network 🧵

Peter Kraft

@petereliaskraft

2024年10月19日

Scalability! But at what cost? This paper is an absolute classic because it explores the underappreciated tradeoffs of distributing systems. It asks about the COST of distributed systems--the Configuration that Outscales a Single Thread. The question is, how many cores does a…

petereliaskraft's tweet image. Scalability! But at what cost?

This paper is an absolute classic because it explores the underappreciated tradeoffs of distributing systems.

It asks about the COST of distributed systems--the Configuration that Outscales a Single Thread. The question is, how many cores does a…

Nous Research

@NousResearch

2024年8月26日

What if you could use all the computing power in the world to train a shared, open source AI model? Preliminary report: github.com/NousResearch/D… Nous Research is proud to release a preliminary report on DisTrO (Distributed Training Over-the-Internet) a family of…

NousResearch's tweet image. What if you could use all the computing power in the world to train a shared, open source AI model?

Preliminary report: github.com/NousResearch/D…

Nous Research is proud to release a preliminary report on DisTrO (Distributed Training Over-the-Internet) a family of…

maharshi

@mrsiipa

年4月27日

understanding DDP is easy: these two functions are technically all you need to implement Distributed Data Parallel (DDP) model similar to PyTorch, from scratch.

mrsiipa's tweet image. understanding DDP is easy: these two functions are technically all you need to implement Distributed Data Parallel (DDP) model similar to PyTorch, from scratch.

Bilgin Ibryam

@bibryam

年10月26日

5 Distributed Concepts Every Developer Should Know swequiz.com/blog/5-core-di…

Poseidon

@psdnai

年10月30日

We’re deep in R&D on Poseidon Subnets – specialized data pipelines that coordinate how AI domains collect, curate, and license real-world data. Think of them as high-throughput lanes in the world’s first decentralized data highway. More to come 🔱

psdnai's tweet image. We’re deep in R&amp;D on Poseidon Subnets – specialized data pipelines that coordinate how AI domains collect, curate, and license real-world data.

Think of them as high-throughput lanes in the world’s first decentralized data highway.

More to come 🔱

Derek Lewis

@derekelewis

14 小时

You probably wouldn't know it from this top output, but I have a FSDP training run going on the DGX Spark cluster. No wasted CPU time spent processing interrupts or copying between buffers. RDMA networking is a wonderful thing.