#reinforcementlearning Suchergebnisse

T.Yamazaki

15.11.

「ダメージを最小限にして被害を抑えるつつ、恰好良く転ぶ」を学ぶ二足歩行ロボットエンターテイメント(ステージでのパフォーマンスなど)に有用かもしれない youtu.be/BXqpVMPk63A #bipedal #humanoidrobot #ReinforcementLearning #DisneyResearchHub #entertainment

Manual 𝐏𝐂𝐁 𝐝𝐞𝐬𝐢𝐠𝐧 can’t keep up with today’s complexity. ✨ 𝐀𝐈 𝐜𝐚𝐧. 👉 Discover how @DeepPCB uses reinforcement learning to deliver DRC-clean layouts in hours in our new White Paper: link in comment! #PCBDesign #AIinEngineering #ReinforcementLearning #InstaDeep⁣

DeepPCB's tweet image. Manual 𝐏𝐂𝐁 𝐝𝐞𝐬𝐢𝐠𝐧 can’t keep up with today’s complexity. ✨ 𝐀𝐈 𝐜𝐚𝐧.
👉 Discover how @DeepPCB uses reinforcement learning to deliver DRC-clean layouts in hours in our new White Paper: link in comment!
#PCBDesign #AIinEngineering #ReinforcementLearning #InstaDeep⁣

Tal Fiskus

@fiskustal

11 Std.

Super proud to have presented my paper yesterday at the world's biggest AI conference, #NeurIPS2025! 🎉 "Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound" It was an honor to share my work. #DeepLearning #ReinforcementLearning #AI

fiskustal's tweet image. Super proud to have presented my paper yesterday at the world's biggest AI conference, #NeurIPS2025! 🎉

"Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound"

It was an honor to share my work.
#DeepLearning #ReinforcementLearning #AI

Vishal Dewangan

@Vishal02__

05.10.

Alright let's do this 🔥 building Flappy Bird from scratch in Unity, then training an AI to master it sharing every win, every bug, every "why isn't this working" moment starts now. let's see where this goes follow for the journey → #ReinforcementLearning #gamedev

Vishal02__'s tweet image. Alright let's do this 🔥

building Flappy Bird from scratch in Unity, then training an AI to master it

sharing every win, every bug, every "why isn't this working" moment

starts now. let's see where this goes

follow for the journey →

#ReinforcementLearning #gamedev

T.Yamazaki

@ZappyZappy7

05.11.

汎用ロボットハンドの開発柔らかい物を摘まんだり、棚に商品を補充したり、バッグを持ち運んだりと様々なシナリオに適応 youtu.be/8gQ7qVmcKs0 #RobotHand #dexterous #ReinforcementLearning #EmbodiedAI #VLA #GeneralPurpose #haptic #touching #tactile #teleoperation #PsiBot

Ilham | Crypto & AI Automation expert

@ilhamautomation

17 Std.

Stop just reading RL theory. Build real agents with hands-on code. 🧠⚙️ Free, practical reinforcement learning course + repo with clean examples: github.com/Paulescu/hands… realworldml.net/the-hands-on-r… #ReinforcementLearning #MachineLearning #AI #Python #OpenSource

ilhamautomation's tweet image. Stop just reading RL theory. Build real agents with hands-on code. 🧠⚙️
Free, practical reinforcement learning course + repo with clean examples:
github.com/Paulescu/hands…
realworldml.net/the-hands-on-r…

#ReinforcementLearning #MachineLearning #AI #Python #OpenSource

kedar

@_kedar_18

26.09.

Day 12 🦾 of becoming an ML Beast: Explored Reinforcement Learning – where an agent interacts with an environment, takes actions, and learns from rewards to improve decisions over time. #MachineLearning #ReinforcementLearning #AI #Learninginpublic #100daysofcoding

_kedar_18's tweet image. Day 12 🦾 of becoming an ML Beast: Explored Reinforcement Learning – where an agent interacts with an environment, takes actions, and learns from rewards to improve decisions over time.

#MachineLearning #ReinforcementLearning #AI
#Learninginpublic #100daysofcoding

Computer Science and Engineering at Michigan

@UMichCSE

9 Std.

Curious about Reinforcement Learning? On Nov 21, Michigan CSE welcomed Dr. Andrew Barto—RL pioneer & CSE alum! He shared core principles, real-world impact, and future possibilities. ⬇️🎥Watch the full talk here: youtube.com/watch?v=ELGA7f… #ReinforcementLearning #AI #UMich

UMichCSE's tweet card. What is So Interesting About Reinforcement Learning? | Andrew Barto

youtube.com

YouTube

What is So Interesting About Reinforcement Learning? | Andrew Barto

Quelle: youtube.com

Deluthium

@Deluthium

07.10.

What if liquidity could evolve on its own, adjusting, optimizing, adapting? #Deluthium doesn't just route your trade, we transform it into an intelligent liquidity signal. #ReinforcementLearning meets market-making. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium

@Deluthium

21.10.

Fast markets die first. Smart markets survive. #Deluthium uses #ReinforcementLearning to adapt in real time. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium's tweet image. Fast markets die first.
Smart markets survive.

#Deluthium uses #ReinforcementLearning to adapt in real time.

Brought to you by the Onchain Flash Boys.
Powered by RL.

Kirk Borne

@KirkDBorne

06.10.

#ReinforcementLearning foundational book (2nd edition of this classic): amzn.to/3UtbeAa ————— #DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification

KirkDBorne's tweet image. #ReinforcementLearning foundational book (2nd edition of this classic): amzn.to/3UtbeAa
—————
#DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification

AGI.Eth

@ceobillionaire

28.09.

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: arxiv.org/abs/2509.19736 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Qian et al.: arxiv.org/abs/2509.19736

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

AGI.Eth

@ceobillionaire

10.11.

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Yue et al.: arxiv.org/abs/2504.13837

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

Deluthium

@Deluthium

15.10.

What if markets could think before they move? At #Deluthium, we treat liquidity as signal, not noise. #ReinforcementLearning turns execution into adaptive intelligence. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium's tweet image. What if markets could think before they move?
At #Deluthium, we treat liquidity as signal, not noise.

#ReinforcementLearning turns execution into adaptive intelligence.

Brought to you by the Onchain Flash Boys.
Powered by RL.

PRX Life

@PRX_Life

27.10.

A new theory based on #ReinforcementLearning reveals the optimal pairing relationship between signal sensing and modulation and provides a new way to understand collective information processing in populations of cells. 🔗 go.aps.org/46RIIhh

PRX_Life's tweet image. A new theory based on #ReinforcementLearning reveals the optimal pairing relationship between signal sensing and modulation and provides a new way to understand collective information processing in populations of cells.

🔗 go.aps.org/46RIIhh

Deluthium

@Deluthium

09.10.

Every swap, limit order, cross-chain action, it’s input to the #ReinforcementLearning engine. In #Deluthium, your request becomes part of the learning feedback loop. No black boxes. Full transparency. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium's tweet image. Every swap, limit order, cross-chain action, it’s input to the #ReinforcementLearning engine.

In #Deluthium, your request becomes part of the learning feedback loop.

No black boxes. Full transparency.

Brought to you by the Onchain Flash Boys.
Powered by RL.

Kirk Borne

@KirkDBorne

01.12.

Deep #ReinforcementLearning Hands-On — Practical easy-to-follow guide to RL from Q-learning and DQNs to PPO and RLHF: amzn.to/3MV9o60 [3rd Ed.] v/ @PacktDataML —— #AI #MachineLearning #DeepLearning #DataScience #DataScientist —— 𝓚𝓮𝔂 𝓕𝓮𝓪𝓽𝓾𝓻𝓮𝓼: 🟢Learn with…

KirkDBorne's tweet image. Deep #ReinforcementLearning Hands-On — Practical easy-to-follow guide to RL from Q-learning and DQNs to PPO and RLHF: amzn.to/3MV9o60 [3rd Ed.] v/ @PacktDataML
——
#AI #MachineLearning #DeepLearning #DataScience #DataScientist
——
𝓚𝓮𝔂 𝓕𝓮𝓪𝓽𝓾𝓻𝓮𝓼:
🟢Learn with…

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

04.11.

JAX #Android #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #GoLang #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/JAX-Android-RL

gp_pulipaka's tweet image. JAX #Android #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #GoLang #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode
geni.us/JAX-Android-RL

MONTREAL.AI

@Montreal_AI

10.11.

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

Montreal_AI's tweet image. Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Yue et al.: arxiv.org/abs/2504.13837

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

Ai Story News

@aistorynews

3 Std.

NVIDIA's Isaac Lab Arena launches to benchmark robot policies at scale, with whole-body control, richer teleoperation data,… aistory.news/machine-learni… #FederatedLearning #Quantization #ReinforcementLearning

aistorynews's tweet card. NVIDIA's Isaac Lab Arena launches to benchmark robot policies at scale, with whole-body control, richer teleoperation data, ADR, and PBT.

Isaac Lab Arena debuts for scalable robot evaluation

Quelle: aistory.news

Amazin Daily Facts

@dailyfaktz

9 Std.

🔥 Exciting news! Together AI launches TorchForge RL pipelines on its cloud, boosting distributed training and sandboxed environments. Check out the BlackJack training demo and elevate your AI projects! #TogetherAI #TorchForge #ReinforcementLearning #AI … ift.tt/UGzx0H5

dailyfaktz's tweet card. Together AI introduces TorchForge RL pipelines on its cloud platform, enhancing distributed training and sandboxed environments with a BlackJack training demo.

TorchForge RL Pipelines Now Operable on Together AI's Cloud

Quelle: blockchain.news

Computer Science and Engineering at Michigan

@UMichCSE

9 Std.

youtube.com

YouTube

What is So Interesting About Reinforcement Learning? | Andrew Barto

Quelle: youtube.com

Tal Fiskus

@fiskustal

11 Std.

Griffintaur

@griffintaur

14 Std.

🚀 Interestingresearch: Qwen3-VL Technical Report Read more: huggingface.co/papers/2511.21… #LLM #ReinforcementLearning #MLResearch

Paper page - Qwen3-VL Technical Report

Quelle: huggingface.co

Ilham | Crypto & AI Automation expert

@ilhamautomation

17 Std.

Dragos Calin

@calinrobotics

18 Std.

In this tutorial, you will see exactly why, how to normalize correctly and how to stabilize your training. reinforcementlearningpath.com/hands-on-min-m… #reinforcementlearning #robotics #machinelearningtutorial #ArtificialInteligence

calinrobotics's tweet image. In this tutorial, you will see exactly why, how to normalize correctly and how to stabilize your training.
reinforcementlearningpath.com/hands-on-min-m…
#reinforcementlearning #robotics #machinelearningtutorial #ArtificialInteligence

bot

@robo_burro

22 Std.

🚀 BULLISH SIGNAL Radiologist copilot enhances radiology reporting with advanced AI workflow @Microsoft #ReinforcementLearning #AR #GM #GN 🧵 Full breakdown (1/4) 👇 📖 x.com/rohanpaul_ai/s…

Rohan Paul

@rohanpaul_ai

22 Std.

The paper builds an AI assistant that writes medical CT reports and automatically checks their quality like a colleague. Radiology reports for 3D scans are slow to write and easy to miss details, while older models only give a rough draft. Radiologist Copilot runs a workflow…

rohanpaul_ai's tweet image. The paper builds an AI assistant that writes medical CT reports and automatically checks their quality like a colleague.

Radiology reports for 3D scans are slow to write and easy to miss details, while older models only give a rough draft.

Radiologist Copilot runs a workflow…

ouadi maakoul

@ouadi4maakoul

03.12.

📉Breaking RL's variance barrier! Curriculum-Induced Policy Optimization (CIPO): Var(∇J)≤ σ²/T·Σ1/Hₜ + C/T·Σεₜ Achieves O(log T/T²)variance vs O(1/T) standard RL. Stable learning means faster AI development for all researchers.#ReinforcementLearning #AIResearch

Andrés-Leonardo Martínez-Ortiz, PhD

@davilagrau

03.12.

The Next Frontier in AI Isn’t Just More Data: Reinforcement learning environments prepare AI for messy reality #AI #ReinforcementLearning buff.ly/4NgE3n0

AIQuantumAndScienceNews

@AIQuantumLifeEx

03.12.

Zero-Shot Instruction Following in RL via Structured LTL Representations Preprint: This study proposes a novel approach for reinforcement learning agents to follow arbitrar… arxiv.org/abs/2512.02633 #AI #ReinforcementLearning #MachineLearning #Preprint #Arxiv #ScienceNews

jpizarrom

@jpizarrom

03.12.

🔹 Sim: 100% success on PandaPickCube-v0 (gym-hil). 🔹 Real World: After just 10k steps on the SO100, the policy is picking up a butter box and placing it on a shelf! 🧈➡️📦 #LeRobot #Robotics #ReinforcementLearning #SmolVLA #SO100 #HuggingFace

Holger Müller #EnterpriseAcceleration

@holgermu

03.12.

And now @SwamiSivasubram arrived at @Amazon Sagemaker. Of course has all the capabilities teed up before - #KnowledgeDistillation #ReInforcementLearning etc etc. #AWSReinvent

holgermu's tweet image. And now @SwamiSivasubram arrived at @Amazon Sagemaker. Of course has all the capabilities teed up before - #KnowledgeDistillation #ReInforcementLearning etc etc. #AWSReinvent

Holger Müller #EnterpriseAcceleration

@holgermu

03.12.

Next up - #ReinforcementLearning - of course this is all the tee up for all what is available out of the box.... in @Amazon Sagemaker? #AWReinvent

holgermu's tweet image. Next up - #ReinforcementLearning - of course this is all the tee up for all what is available out of the box.... in @Amazon Sagemaker? #AWReinvent

Elizabeth Fuentes

@ElizabethFue12

03.12.

📰🚨 Amazon Bedrock adds reinforcement ﬁne-tuning simplifying how developers build smarter, more accurate AI models #ReinforcementLearning #AmazonBedrock #AICustomization #MachineLearning #ModelOptimization ift.tt/4iubT67

ElizabethFue12's tweet card. Amazon Bedrock now supports reinforcement fine-tuning delivering 66% accuracy gains on average over base models.

Amazon Bedrock adds reinforcement ﬁne-tuning simplifying how developers build smarter, more...

Quelle: aws.amazon.com

Stelixx Insights

@StelixxInsights

03.12.

The future of AI? Tsinghua Uni's Yi Wu says it's all about Reinforcement Learning & Embodied Agents. Learning by doing! #AI #ReinforcementLearning #EmbodiedAgents

StelixxInsights's tweet image. The future of AI? Tsinghua Uni's Yi Wu says it's all about Reinforcement Learning &amp; Embodied Agents. Learning by doing! #AI #ReinforcementLearning #EmbodiedAgents

Sasha Alexander Lambert

@SashLambert

ReinforcementLearning

@ReinforcementL3

ReinforcementLearning

@ReinforcementL

Daniel Palenicek

@DPalenicek

Technion - Reinforcement Learning Research Labs

@Technion_RL

Yu-Xiang Wang

@yuxiangw_cs

Ofir Nachum

@ofirnachum

Daniel J. Mankowitz

@DJ_Mankowitz

James

@jmac_ai

CogitAI

@Cogitai

Jacqueline Isabelle Forien

@JackieForien

Joseph Cox

@JosephJohnCox

Seydina Ndiaye

@seysoosey

robertjneal

@robertjneal

Ashish Umre

@hormigaloca

DeepPCB

@DeepPCB

23.10.

Deluthium

@Deluthium

07.10.

Tal Fiskus

@fiskustal

11 Std.

AGI.Eth

@ceobillionaire

28.09.

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: arxiv.org/abs/2509.19736 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

04.11.

Deluthium

@Deluthium

15.10.

Deluthium

@Deluthium

09.10.

Ilham | Crypto & AI Automation expert

@ilhamautomation

17 Std.

natsutan

@natsutan

25.06.

うおっ、分かった。 #VRアカデミア #ReinforcementLearning

Deluthium

@Deluthium

21.10.

Fast markets die first. Smart markets survive. #Deluthium uses #ReinforcementLearning to adapt in real time. Brought to you by the Onchain Flash Boys. Powered by RL.

Kirk Borne

@KirkDBorne

06.10.

#ReinforcementLearning foundational book (2nd edition of this classic): amzn.to/3UtbeAa ————— #DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification

PRX Life

@PRX_Life

27.10.

AGI.Eth

@ceobillionaire

01.06.

A Tutorial on Meta-Reinforcement Learning Beck et al.: arxiv.org/abs/2301.08028 #ArtificialIntelligence #MetaLearning #ReinforcementLearning

ceobillionaire's tweet image. A Tutorial on Meta-Reinforcement Learning

Beck et al.: arxiv.org/abs/2301.08028

#ArtificialIntelligence #MetaLearning #ReinforcementLearning

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

19.10.

Deep #ReinforcementLearning for #Keras! #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/DRL-Keras

gp_pulipaka's tweet image. Deep #ReinforcementLearning for #Keras! #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode
geni.us/DRL-Keras

AGI.Eth

@ceobillionaire

28.05.

Reinforcing General Reasoning without Verifiers Zhou et al.: arxiv.org/abs/2505.21493 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. Reinforcing General Reasoning without Verifiers

Zhou et al.: arxiv.org/abs/2505.21493

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

AGI.Eth

@ceobillionaire

13.07.

The Bitter Lesson "Search and learning are general purpose methods that continue to scale with increased computation, even as the available computation becomes very great." — Richard Sutton Rich Sutton: incompleteideas.net/IncIdeas/Bitte… #ReinforcementLearning

ceobillionaire's tweet image. The Bitter Lesson

"Search and learning are general purpose methods that continue to scale with increased computation, even as the available computation becomes very great." — Richard Sutton

Rich Sutton: incompleteideas.net/IncIdeas/Bitte…

#ReinforcementLearning

AGI.Eth

@ceobillionaire

10.11.

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

Zichen Chen (🐱,💖)@NeurIPS

@my_cat_can_code

26.09.2024

🚀 Exciting News! Our paper has been accepted at @NeurIPSConf! 🎉 We introduce State Chrono Representation (SCR) -- a novel approach in #ReinforcementLearning. SCR integrates long-term temporal dynamics and cumulative rewards into state representations, addressing key challenges…

my_cat_can_code's tweet image. 🚀 Exciting News! Our paper has been accepted at @NeurIPSConf! 🎉
We introduce State Chrono Representation (SCR) -- a novel approach in #ReinforcementLearning. SCR integrates long-term temporal dynamics and cumulative rewards into state representations, addressing key challenges…

SA News Channel

@SatlokChannel

11.08.

7/10 Reinforcement Learning trains agents through trial and error to maximize rewards. It’s used in gaming, robotics, and real-time decision systems like traffic control. #ReinforcementLearning #AI #SmartSystems #DeepLearning #GameAI #AutonomousTech

SatlokChannel's tweet image. 7/10
Reinforcement Learning trains agents through trial and error to maximize rewards. It’s used in gaming, robotics, and real-time decision systems like traffic control.
#ReinforcementLearning #AI #SmartSystems #DeepLearning #GameAI #AutonomousTech

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

24.10.

#ReinforcementLearning using Policy Embedding! #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/RL-Policy-Embe…

gp_pulipaka's tweet image. #ReinforcementLearning using Policy Embedding! #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode
geni.us/RL-Policy-Embe…

Something went wrong.

United States Trends

1. Ferguson 7,976 posts
2. Lions 62.8K posts
3. Lions 62.8K posts
4. Sixers 7,017 posts
5. #OnePride 5,604 posts
6. Gibbs 9,539 posts
7. #DALvsDET 3,306 posts
8. Jack Campbell 2,950 posts
9. Turpin 1,560 posts
10. Goff 5,937 posts
11. Brandon Aubrey 3,226 posts
12. Pat Spencer 2,467 posts
13. #MissVenezuela2025 7,107 posts
14. Warriors 44.6K posts
15. Shang Tsung 2,752 posts
16. Maxey 4,254 posts
17. #RHOBH 2,655 posts
18. James Houston 1,785 posts
19. Kenneth Murray N/A
20. #TNFonPrime 2,099 posts