linbo_pythoner's profile picture.

CrazyTensor

@linbo_pythoner

CrazyTensor gönderiyi yeniden yayınladı

13/ A final tip: probably the most important thing to get great results out of Claude Code -- give Claude a way to verify its work. If Claude has that feedback loop, it will 2-3x the quality of the final result. Claude tests every single change I land to claude.ai/code


I learned about agentic ai from the 'Agentic AI' mooc course agenticai-learning.org/f25 it completely reframed my understanding of where AI is heading. We are witnessing a massive paradigm shift: moving from Human Aligned Models to Environment Feedback Aligned Models.


CrazyTensor gönderiyi yeniden yayınladı

Last month, Google dropped something interesting: five AI Agent papers released across five consecutive days, one per day, each digging into a different part of how agents should be built, evaluated, secured, and deployed. No big splash, just a steady rollout of more than 250…

techNmak's tweet image. Last month, Google dropped something interesting:
five AI Agent papers released across five consecutive days, one per day, each digging into a different part of how agents should be built, evaluated, secured, and deployed.

No big splash, just a steady rollout of more than 250…

CrazyTensor gönderiyi yeniden yayınladı

Make the most of your weekend. Don't sleep on this. Stanford's Autumn 2025 Transformers & LLMs course. 7 lectures. Free. While others scroll, you could understand how Flash Attention achieves 3x speedup, how LoRA cuts fine-tuning costs by 90%, and how MoE makes models…

techNmak's tweet image. Make the most of your weekend.

Don't sleep on this.

Stanford's Autumn 2025 Transformers & LLMs course. 7 lectures. Free.

While others scroll, you could understand how Flash Attention achieves 3x speedup, how LoRA cuts fine-tuning costs by 90%, and how MoE makes models…

CrazyTensor gönderiyi yeniden yayınladı

最近我们团队跟 SGLang 社区给 slime 贡献了 KIMI K2 RL 的代码。之前我们 Multi-Agent 强化学习方案 MrlX 就是基于 slime 做的。我们特别喜欢 slime 的设计:rollout 代码跟训练引擎完全解耦,Infra同学在升级框架的时候,我们 DeepResearch Agent 的训练代码完全无感。欢迎大家尝试用 slime 对 KIMI…

Ant AQ-Team @AQ_MedAI @TheInclusionAI and SGLang RL Team @sgl_project just helped land Kimi-K2-Instruct RL on slime — fully wired up and running on 256× H20 141GB 🚀 Huge shout-out to @yngao016, @menlzy, @Yonah_x from AQ Team and @Ji_Li_233, @Yefei_RL from the SGLang RL Team for…



CrazyTensor gönderiyi yeniden yayınladı

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built…

Kimi_Moonshot's tweet image. 🚀 Hello, Kimi K2 Thinking!
The Open-Source Thinking Agent Model is here.

🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%)
🔹 Executes up to 200 – 300 sequential tool calls without human interference
🔹 Excels in reasoning, agentic search, and coding
🔹 256K context window

Built…

CrazyTensor gönderiyi yeniden yayınladı

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

eliebakouch's tweet image. Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably

huggingface.co/spaces/Hugging…

CrazyTensor gönderiyi yeniden yayınladı

《Agentic Design Patterns》中文翻译版 github.com/ginobefun/agen…

geekbb's tweet image. 《Agentic Design Patterns》中文翻译版
github.com/ginobefun/agen…

CrazyTensor gönderiyi yeniden yayınladı

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Breaks down SFT dataset demonstrations into a sequence of actions, generate internal reasoning before each action, reward based on similarity of model's actions and expert actions. Experiments…

iScienceLuvr's tweet image. Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Breaks down SFT dataset demonstrations into a sequence of actions, generate internal reasoning before each action, reward based on similarity of model's actions and expert actions. Experiments…

CrazyTensor gönderiyi yeniden yayınladı

🚨 This might be the biggest leap in AI agents since ReAct. Researchers just dropped DeepAgent a reasoning model that can think, discover tools, and act completely on its own. No pre-scripted workflows. No fixed tool lists. Just pure autonomous reasoning. It introduces…

rryssf_'s tweet image. 🚨 This might be the biggest leap in AI agents since ReAct.

Researchers just dropped DeepAgent a reasoning model that can think, discover tools, and act completely on its own.

No pre-scripted workflows. No fixed tool lists. Just pure autonomous reasoning.

It introduces…

CrazyTensor gönderiyi yeniden yayınladı

1/4 Following up on our launch of Tongyi DeepResearch: We're now releasing the full technical report! Dive deep into the technology and insights behind our 30B (A3B) open-source web agent that achieves SOTA performance: 32.9 on Humanity's Last Exam, 43.4 on BrowseComp, and 46.7…

Ali_TongyiLab's tweet image. 1/4 Following up on our launch of Tongyi DeepResearch: We're now releasing the full technical report! Dive deep into the technology and insights behind our 30B (A3B) open-source web agent that achieves SOTA performance: 32.9 on Humanity's Last Exam, 43.4 on BrowseComp, and 46.7…

CrazyTensor gönderiyi yeniden yayınladı

Introducing Multi-Agent Evolve 🧠 A new paradigm beyond RLHF and RLVR: More compute → closer to AGI No need for expensive data or handcrafted rewards We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all…

youjiaxuan's tweet image. Introducing Multi-Agent Evolve 🧠

A new paradigm beyond RLHF and RLVR:
More compute → closer to AGI
No need for expensive data or handcrafted rewards

We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all…
youjiaxuan's tweet image. Introducing Multi-Agent Evolve 🧠

A new paradigm beyond RLHF and RLVR:
More compute → closer to AGI
No need for expensive data or handcrafted rewards

We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all…
youjiaxuan's tweet image. Introducing Multi-Agent Evolve 🧠

A new paradigm beyond RLHF and RLVR:
More compute → closer to AGI
No need for expensive data or handcrafted rewards

We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all…

CrazyTensor gönderiyi yeniden yayınladı

在开发 Agent 应用,当想让它能通过实际运行数据不断学习优化,但是该功能实现起来颇为复杂。 微软技术团队,最近开源了一个叫 “Agent Lightning” 项目,将这个技术门槛大幅降低,轻松为 Agent 加上自我优化能力。…

GitHub_Daily's tweet image. 在开发 Agent 应用,当想让它能通过实际运行数据不断学习优化,但是该功能实现起来颇为复杂。

微软技术团队,最近开源了一个叫 “Agent Lightning” 项目,将这个技术门槛大幅降低,轻松为 Agent 加上自我优化能力。…

CrazyTensor gönderiyi yeniden yayınladı

This survey systematizes RL methods for agentic AI, covering data, training, evaluation, and practical guidance. Read the full paper on the Hugging Face Hub: huggingface.co/papers/2509.06… Explore the code & resources: github.com/wenjunli-0/dee…


CrazyTensor gönderiyi yeniden yayınladı

Agentic Context Engineering Great paper on agentic context engineering. The recipe: Treat your system prompts and agent memory as a living playbook. Log trajectories, reflect to extract actionable bullets (strategies, tool schemas, failure modes), then merge as append-only…

omarsar0's tweet image. Agentic Context Engineering

Great paper on agentic context engineering.

The recipe:

Treat your system prompts and agent memory as a living playbook.

Log trajectories, reflect to extract actionable bullets (strategies, tool schemas, failure modes), then merge as append-only…

CrazyTensor gönderiyi yeniden yayınladı

Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐 It builds on Gemini’s visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster…

GoogleDeepMind's tweet image. Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐

It builds on Gemini’s visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster…

CrazyTensor gönderiyi yeniden yayınladı

在做投资分析或市场研究,我们需要从不同网站收集股价、财报、新闻等信息,来回查找颇为耗时。 现在只需要给 AI 助手装一个 Financial Datasets MCP 服务器,即可在同一对话中直接获取相关实时数据。 比如查询公司的损益表、资产负债表、现金流量表,还能获取股票价格、市场新闻等等信息。…

GitHub_Daily's tweet image. 在做投资分析或市场研究,我们需要从不同网站收集股价、财报、新闻等信息,来回查找颇为耗时。

现在只需要给 AI 助手装一个 Financial Datasets MCP 服务器,即可在同一对话中直接获取相关实时数据。

比如查询公司的损益表、资产负债表、现金流量表,还能获取股票价格、市场新闻等等信息。…

CrazyTensor gönderiyi yeniden yayınladı

我的一人公司,再添一名数字员工——B站+YouTube的运营专员(Claude Code+Chrome Devtools MCP) 事情这样,为你提升视频的互动率,我比较常用的一个内容运营策略是在做完视频后,再做一份图文教程或视频相关的素材,让观众留言特点关键词获取。…


CrazyTensor gönderiyi yeniden yayınladı

最近在 GitHub 上发现了一个新项目 —— SQLBot,刚开源几天就已经收获 1.5K Star,堪称数据库分析神器。 为什么值得关注? 无需写 SQL:直接用自然语言提问,比如“查一下上个月新增用户”,SQLBot 就会自动生成 SQL 并返回结果。 多数据库支持:兼容 MySQL、SQL Server、ClickHouse、RedShift,甚至…

sitinme's tweet image. 最近在 GitHub 上发现了一个新项目 —— SQLBot,刚开源几天就已经收获 1.5K Star,堪称数据库分析神器。

为什么值得关注?

无需写 SQL:直接用自然语言提问,比如“查一下上个月新增用户”,SQLBot 就会自动生成 SQL 并返回结果。

多数据库支持:兼容 MySQL、SQL Server、ClickHouse、RedShift,甚至…

Loading...

Something went wrong.


Something went wrong.