Thomas Liang

@_thliang01

Entrou em Junho de 2013

476Posts 122Seguidores 3KSeguindo

Talvez você curta

@SevenXVentures

@jeetr1511

@Haofan_Wang

@khalillechelt

@StrodeWendell

@robertriachi

@SergiuNistor6

@HsingHuan

@katze_kot

@thetnaingchen

@Beeboooooooo333

Fixado

Thomas Liang

@_thliang01

15 de set. de

「許多困難和無助都是隨機波動的一部分，付出足夠的時間和耐心，隨機過程總會收斂到與付出相對應的穩定狀態。」

Thomas Liang repostou

Rohan Paul

@rohanpaul_ai

19 h

The new steam age. This is actually becoming true in many cases. It's possible to do so much more on your own now.

Thomas Liang repostou

My formulation is slightly different. Agent = 𝐂𝐨𝐧𝐭𝐞𝐱𝐭 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐢𝐧𝐠 + 𝐋𝐋𝐌 + 𝐓𝐨𝐨𝐥𝐬. The real differentiation between most agent companies lies in how they build their CE and tools. Those using minimal model-specific hacks can transition seamlessly to new…

Li Junnan

@LiJunnan0409

10 de out. de

Can't agree more. Agent = LLM + tools. That’s Occam’s Razor. To build better agents, you either improve the LLM, or you create better tools. We shouldn't need complex agent frameworks.

Thomas Liang repostou

Majid Manzarpour

@majidmanzarpour

10 de out. de

Now I understand @DSPyOSS from this video, and I will attempt to break it down. Basically if you build software that uses LLMs, it helps you manage, test and optimize backend prompts. When a new LLM comes out, you can swap it in and get the new best prompts for your existing…

Thomas Liang repostou

Bou

@FrostedCaribou

10 de out. de

mood

Thomas Liang repostou

Ziwei Liu

@liuziwei7

10 de out. de

🎞️𝐂𝐡𝐚𝐢𝐧-𝐨𝐟-𝐕𝐢𝐬𝐮𝐚𝐥-𝐓𝐡𝐨𝐮𝐠𝐡𝐭 for Video Generation🎞️ #VChain is an inference-time chain-of-visual-thought framework that injects visual reasoning signals from multimodal models into video generation - Page: eyeline-labs.github.io/VChain - Code: github.com/Eyeline-Labs/V…

liuziwei7's tweet card. VChain: Chain-of-Visual-Thought for Reasoning in Video Generation - Eyeline-Labs/VChain

GitHub - Eyeline-Labs/VChain: VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Fonte: github.com

DailyPapers

@HuggingPapers

7 de out. de

Eyeline Labs presents VChain for smarter video generation This new framework introduces a "chain-of-visual-thought" from large multimodal models to guide video generators, leading to more coherent and dynamic scenes.

Thomas Liang repostou

Shrey Tiwari

@shrey_twr

10 de out. de

From sitting in CMU lectures to giving one - grateful for the chance to guest lecture in the Program Analysis class this week! Shared insights on how static analysis makes its way from research papers to production code.

shrey_twr's tweet image. From sitting in CMU lectures to giving one - grateful for the chance to guest lecture in the Program Analysis class this week!

Shared insights on how static analysis makes its way from research papers to production code.

Thomas Liang repostou

Cheng Lu

@clu_cheng

10 de out. de

This is a very solid and promising research that scales consistency models to 10B+ video diffusion models. The combination of sCM and Variational Score Distillation is a very promising direction for few-step generation!

Kaiwen Zheng

@zkwthu

10 de out. de

🚀Try out rCM—the most advanced diffusion distillation! ✅First to scale up sCM/MeanFlow to 10B+ video models ✅Open-sourced FlashAttention-2 JVP kernel & FSDP/CP support ✅High quality & diversity videos in 2~4 steps Paper: arxiv.org/abs/2510.08431 Code: github.com/NVlabs/rcm

zkwthu's tweet image. 🚀Try out rCM—the most advanced diffusion distillation!
✅First to scale up sCM/MeanFlow to 10B+ video models
✅Open-sourced FlashAttention-2 JVP kernel &amp; FSDP/CP support
✅High quality &amp; diversity videos in 2~4 steps
Paper: arxiv.org/abs/2510.08431
Code: github.com/NVlabs/rcm

Thomas Liang repostou

Jialu Li

@JialuLi96

9 de out. de

Excited to share our latest work — Self-Improving Demonstrations (SID) 🎯 A new paradigm for Goal-Oriented VLN where agents teach themselves through exploration — no human demos needed, yet surpassing shortest-path supervision! Thrilled by what this means for scalable embodied…

Zun Wang

@ZunWang919

8 de out. de

🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves. ➡️ Agents iteratively generate and learn from their own successful trajectories ➡️…

ZunWang919's tweet image. 🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves.

➡️ Agents iteratively generate and learn from their own successful trajectories
➡️…

Thomas Liang repostou

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

10 de out. de

Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models "We introduce UML: Unpaired Multimodal Learner, a modality-agnostic training paradigm in which a single model alternately processes inputs from different modalities while sharing parameters across…

iScienceLuvr's tweet image. Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models

"We introduce UML: Unpaired Multimodal Learner, a modality-agnostic training paradigm in which a single model alternately processes inputs from different modalities while sharing parameters across…

Thomas Liang repostou

Qwen

@Alibaba_Qwen

10 de out. de

Introducing Qwen3-VL Cookbooks! 🧑‍🍳 A curated collection of notebooks showcasing the power of Qwen3-VL—via both local deployment and API—across diverse multimodal use cases: ✅ Thinking with Images ✅ Computer-Use Agent ✅ Multimodal Coding ✅ Omni Recognition ✅ Advanced…

Alibaba_Qwen's tweet image. Introducing Qwen3-VL Cookbooks! 🧑‍🍳

A curated collection of notebooks showcasing the power of Qwen3-VL—via both local deployment and API—across diverse multimodal use cases:

✅ Thinking with Images
✅ Computer-Use Agent
✅ Multimodal Coding
✅ Omni Recognition
✅ Advanced…

Thomas Liang repostou

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

10 de out. de

Tina proved that LoRA can match or surpass full-parameter RL. Tora builds directly on that result, turning it into a full framework. Built on torchtune, it extends RL post-training to LoRA, QLoRA, DoRA, and QDoRA under one interface with GRPO, FSDP, and compile support. QLoRA…

gm8xx8's tweet image. Tina proved that LoRA can match or surpass full-parameter RL. Tora builds directly on that result, turning it into a full framework.

Built on torchtune, it extends RL post-training to LoRA, QLoRA, DoRA, and QDoRA under one interface with GRPO, FSDP, and compile support. QLoRA…

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

23 de abr. de

Tina: Tiny Reasoning Models via LoRA LoRA-RL tuned 1.5B models on curated reasoning data, achieving +20% gains and 43% Pass@1 (AIME24) at $9 total cost. Outperforms full-parameter RL on DeepSeek-R1-Distill-Qwen-1.5B. - LoRA-based RL yields better performance with less compute.…

gm8xx8's tweet image. Tina: Tiny Reasoning Models via LoRA

LoRA-RL tuned 1.5B models on curated reasoning data, achieving +20% gains and 43% Pass@1 (AIME24) at $9 total cost. Outperforms full-parameter RL on DeepSeek-R1-Distill-Qwen-1.5B.

- LoRA-based RL yields better performance with less compute.…

Thomas Liang repostou

juggernaut

@curlysaarthak

10 de out. de

career update: ml researcher done : > built proprietary ML pipeline for a whole gnn pipeline exploring GCN, SAGE, GAT, GNNIE, some dev future work : > studying gnns as gradient-flow , geometric & Bayesian GNNs; working on interpretability, inference & full-stack dev

curlysaarthak's tweet image. career update: ml researcher

done :
&gt; built proprietary ML pipeline for a whole gnn pipeline exploring GCN, SAGE, GAT, GNNIE, some dev

future work :
&gt; studying gnns as gradient-flow , geometric &amp; Bayesian GNNs; working on interpretability, inference &amp; full-stack dev

Thomas Liang repostou

Lingo.dev

@lingodotdev

10 de out. de

He’s not wrong 😂

Thomas Liang repostou

⚡JNS⚡ 𝕩

@_devJNS

9 de out. de

programmers life.

Thomas Liang repostou

Kr$na

@krishdotdev

9 de out. de

We are not same again.

Thomas Liang repostou

Murray Kang

@haoqik322

9 de out. de

🧵1/ Latent diffusion shines in image generation for its abstraction, iterative-refinement, and parallel exploration. Yet, applying it to text reasoning is hard — language is discrete. 💡 Our work LaDiR (Latent Diffusion Reasoner) makes it possible — using VAE + block-wise…

Thomas Liang repostou

Guido Appenzeller

@appenz

9 de out. de

🚨New Content: The Trillion Dollar AI Software Development Stack It will generate massive value, spawn hundreds of start-ups and has created the fastest growing companies in history. @stuffyokodraws and I did a deep-dive on market, start-ups and the evolving stack. ⬇️

appenz's tweet image. 🚨New Content: The Trillion Dollar AI Software Development Stack

It will generate massive value, spawn hundreds of start-ups and has created the fastest growing companies in history.

@stuffyokodraws and I did a deep-dive on market, start-ups and the evolving stack. ⬇️

Thomas Liang repostou

Nathan Benaich

@nathanbenaich

9 de out. de

🪩The one and only @stateofaireport 2025 is live! 🪩 It’s been a monumental 12 months for AI. Our 8th annual report is the most comprehensive it's ever been, covering what you *need* to know about research, industry, politics, safety and our new usage data. My highlight reel:

Thomas Liang repostou

Aarno

@TheGlobalMinima

9 de out. de

If you’re getting into PyTorch, give this a read. It discusses the usability, design patterns and implementation ideas behind the framework. A few bits and pieces that can help you build a good foundation.

TheGlobalMinima's tweet image. If you’re getting into PyTorch, give this a read. It discusses the usability, design patterns and implementation ideas behind the framework. A few bits and pieces that can help you build a good foundation.

Thomas Liang repostou

Ben Burtenshaw

@ben_burtenshaw

9 de out. de

deepmind just dropped a handy little colab on fine-tuning gemma3-270m for emoji generation. this is a super lower resource task with 270m parameter model, qlora, short sequences. so it's a great one to try out locally or on colab. it's also a nice one to deploy in a js app…