CrazyTensor

@linbo_pythoner

Shanghai, China

Eylül 2014’de katıldı

1KGönderiler 48Takipçiler 117Takip edilenler

Bunları beğenebilirsin

@steven2358

@datashinobi

@AshtonSix

@papawing

@dsanno

@yaklion

@amsqr

CrazyTensor gönderiyi yeniden yayınladı

Boris Cherny

@bcherny

2 Oca

13/ A final tip: probably the most important thing to get great results out of Claude Code -- give Claude a way to verify its work. If Claude has that feedback loop, it will 2-3x the quality of the final result. Claude tests every single change I land to claude.ai/code…

Claude

Kaynak: claude.ai

CrazyTensor

@linbo_pythoner

14 Ara

I learned about agentic ai from the 'Agentic AI' mooc course agenticai-learning.org/f25 it completely reframed my understanding of where AI is heading. We are witnessing a massive paradigm shift: moving from Human Aligned Models to Environment Feedback Aligned Models.

linbo_pythoner's tweet card. MOOC, Fall 2025

Agentic AI MOOC

Kaynak: agenticai-learning.org

CrazyTensor gönderiyi yeniden yayınladı

Tech with Mak

@techNmak

9 Ara

Last month, Google dropped something interesting: five AI Agent papers released across five consecutive days, one per day, each digging into a different part of how agents should be built, evaluated, secured, and deployed. No big splash, just a steady rollout of more than 250…

techNmak's tweet image. Last month, Google dropped something interesting:
five AI Agent papers released across five consecutive days, one per day, each digging into a different part of how agents should be built, evaluated, secured, and deployed.

No big splash, just a steady rollout of more than 250…

CrazyTensor gönderiyi yeniden yayınladı

Tech with Mak

@techNmak

22 Kas

Make the most of your weekend. Don't sleep on this. Stanford's Autumn 2025 Transformers & LLMs course. 7 lectures. Free. While others scroll, you could understand how Flash Attention achieves 3x speedup, how LoRA cuts fine-tuning costs by 90%, and how MoE makes models…

techNmak's tweet image. Make the most of your weekend.

Don't sleep on this.

Stanford's Autumn 2025 Transformers &amp; LLMs course. 7 lectures. Free.

While others scroll, you could understand how Flash Attention achieves 3x speedup, how LoRA cuts fine-tuning costs by 90%, and how MoE makes models…

CrazyTensor gönderiyi yeniden yayınladı

汉松

@Yonah_x

8 Kas

最近我们团队跟 SGLang 社区给 slime 贡献了 KIMI K2 RL 的代码。之前我们 Multi-Agent 强化学习方案 MrlX 就是基于 slime 做的。我们特别喜欢 slime 的设计：rollout 代码跟训练引擎完全解耦，Infra同学在升级框架的时候，我们 DeepResearch Agent 的训练代码完全无感。欢迎大家尝试用 slime 对 KIMI…

slime

@slime_framework

7 Kas

Ant AQ-Team @AQ_MedAI @TheInclusionAI and SGLang RL Team @sgl_project just helped land Kimi-K2-Instruct RL on slime — fully wired up and running on 256× H20 141GB 🚀 Huge shout-out to @yngao016, @menlzy, @Yonah_x from AQ Team and @Ji_Li_233, @Yefei_RL from the SGLang RL Team for…

slime_framework's tweet card. As the title says, I run it for 40 steps, and the raw_rewards is shown in the following figure: This work is done in collaboration with @GeLee-Q @yefei12 and @yzlnew

Add kimi-k2-instruct running script by Gao016 · Pull Request #694 · THUDM/slime

Kaynak: github.com

CrazyTensor gönderiyi yeniden yayınladı

Kimi.ai

@Kimi_Moonshot

6 Kas

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built…

Kimi_Moonshot's tweet image. 🚀 Hello, Kimi K2 Thinking!
The Open-Source Thinking Agent Model is here.

🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%)
🔹 Executes up to 200 – 300 sequential tool calls without human interference
🔹 Excels in reasoning, agentic search, and coding
🔹 256K context window

Built…

CrazyTensor gönderiyi yeniden yayınladı

elie

@eliebakouch

30 Eki

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

eliebakouch's tweet image. Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably

huggingface.co/spaces/Hugging…

CrazyTensor gönderiyi yeniden yayınladı

Geek

@geekbb

30 Eki

《Agentic Design Patterns》中文翻译版 github.com/ginobefun/agen…

CrazyTensor gönderiyi yeniden yayınladı

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

31 Eki

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Breaks down SFT dataset demonstrations into a sequence of actions, generate internal reasoning before each action, reward based on similarity of model's actions and expert actions. Experiments…

iScienceLuvr's tweet image. Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Breaks down SFT dataset demonstrations into a sequence of actions, generate internal reasoning before each action, reward based on similarity of model's actions and expert actions. Experiments…

CrazyTensor gönderiyi yeniden yayınladı

Robert Youssef

@rryssf_

30 Eki

🚨 This might be the biggest leap in AI agents since ReAct. Researchers just dropped DeepAgent a reasoning model that can think, discover tools, and act completely on its own. No pre-scripted workflows. No fixed tool lists. Just pure autonomous reasoning. It introduces…

rryssf_'s tweet image. 🚨 This might be the biggest leap in AI agents since ReAct.

Researchers just dropped DeepAgent a reasoning model that can think, discover tools, and act completely on its own.

No pre-scripted workflows. No fixed tool lists. Just pure autonomous reasoning.

It introduces…

CrazyTensor gönderiyi yeniden yayınladı

Tongyi Lab

@Ali_TongyiLab

29 Eki

1/4 Following up on our launch of Tongyi DeepResearch: We're now releasing the full technical report! Dive deep into the technology and insights behind our 30B (A3B) open-source web agent that achieves SOTA performance: 32.9 on Humanity's Last Exam, 43.4 on BrowseComp, and 46.7…

Ali_TongyiLab's tweet image. 1/4 Following up on our launch of Tongyi DeepResearch: We're now releasing the full technical report! Dive deep into the technology and insights behind our 30B (A3B) open-source web agent that achieves SOTA performance: 32.9 on Humanity's Last Exam, 43.4 on BrowseComp, and 46.7…

CrazyTensor gönderiyi yeniden yayınladı

Jiaxuan You

@youjiaxuan

28 Eki

Introducing Multi-Agent Evolve 🧠 A new paradigm beyond RLHF and RLVR: More compute → closer to AGI No need for expensive data or handcrafted rewards We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all…

youjiaxuan's tweet image. Introducing Multi-Agent Evolve 🧠

A new paradigm beyond RLHF and RLVR:
More compute → closer to AGI
No need for expensive data or handcrafted rewards

We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all…

CrazyTensor gönderiyi yeniden yayınladı

Graham Neubig

@gneubig

29 Eki

And if anyone wants to contribute an additional dataset the instructions are available here: github.com/neulab/agent-d…

github.com

agent-data-protocol/CONTRIBUTING.md at main · neulab/agent-data-protocol

Contribute to neulab/agent-data-protocol development by creating an account on GitHub.

Kaynak: github.com

CrazyTensor gönderiyi yeniden yayınladı

GitHubDaily

@GitHub_Daily

29 Eki

在开发 Agent 应用，当想让它能通过实际运行数据不断学习优化，但是该功能实现起来颇为复杂。微软技术团队，最近开源了一个叫 “Agent Lightning” 项目，将这个技术门槛大幅降低，轻松为 Agent 加上自我优化能力。…

GitHub_Daily's tweet image. 在开发 Agent 应用，当想让它能通过实际运行数据不断学习优化，但是该功能实现起来颇为复杂。

微软技术团队，最近开源了一个叫 “Agent Lightning” 项目，将这个技术门槛大幅降低，轻松为 Agent 加上自我优化能力。…

CrazyTensor gönderiyi yeniden yayınladı

DailyPapers

@HuggingPapers

10 Eyl

This survey systematizes RL methods for agentic AI, covering data, training, evaluation, and practical guidance. Read the full paper on the Hugging Face Hub: huggingface.co/papers/2509.06… Explore the code & resources: github.com/wenjunli-0/dee…

HuggingPapers's tweet card. a survey on deep research . Contribute to wenjunli-0/deepresearch-survey development by creating an account on GitHub.

GitHub - wenjunli-0/deepresearch-survey: a survey on deep research

Kaynak: github.com

CrazyTensor gönderiyi yeniden yayınladı

elvis

@omarsar0

10 Eki

Agentic Context Engineering Great paper on agentic context engineering. The recipe: Treat your system prompts and agent memory as a living playbook. Log trajectories, reflect to extract actionable bullets (strategies, tool schemas, failure modes), then merge as append-only…

omarsar0's tweet image. Agentic Context Engineering

Great paper on agentic context engineering.

The recipe:

Treat your system prompts and agent memory as a living playbook.

Log trajectories, reflect to extract actionable bullets (strategies, tool schemas, failure modes), then merge as append-only…

CrazyTensor gönderiyi yeniden yayınladı

Google DeepMind

@GoogleDeepMind

7 Eki

Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐 It builds on Gemini’s visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster…

GoogleDeepMind's tweet image. Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐

It builds on Gemini’s visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster…

CrazyTensor gönderiyi yeniden yayınladı

GitHubDaily

@GitHub_Daily

3 Eki

在做投资分析或市场研究，我们需要从不同网站收集股价、财报、新闻等信息，来回查找颇为耗时。现在只需要给 AI 助手装一个 Financial Datasets MCP 服务器，即可在同一对话中直接获取相关实时数据。比如查询公司的损益表、资产负债表、现金流量表，还能获取股票价格、市场新闻等等信息。…

GitHub_Daily's tweet image. 在做投资分析或市场研究，我们需要从不同网站收集股价、财报、新闻等信息，来回查找颇为耗时。

现在只需要给 AI 助手装一个 Financial Datasets MCP 服务器，即可在同一对话中直接获取相关实时数据。

比如查询公司的损益表、资产负债表、现金流量表，还能获取股票价格、市场新闻等等信息。…

CrazyTensor gönderiyi yeniden yayınladı

AI进化论-花生

@AlchainHust

27 Eyl

我的一人公司，再添一名数字员工——B站+YouTube的运营专员（Claude Code+Chrome Devtools MCP）事情这样，为你提升视频的互动率，我比较常用的一个内容运营策略是在做完视频后，再做一份图文教程或视频相关的素材，让观众留言特点关键词获取。…

CrazyTensor gönderiyi yeniden yayınladı

sitin

@sitinme

26 Eyl

最近在 GitHub 上发现了一个新项目 —— SQLBot，刚开源几天就已经收获 1.5K Star，堪称数据库分析神器。为什么值得关注？无需写 SQL：直接用自然语言提问，比如“查一下上个月新增用户”，SQLBot 就会自动生成 SQL 并返回结果。多数据库支持：兼容 MySQL、SQL Server、ClickHouse、RedShift，甚至…