#reinforcementlearning search results

Vishal Dewangan

Oct 5

Alright let's do this 🔥 building Flappy Bird from scratch in Unity, then training an AI to master it sharing every win, every bug, every "why isn't this working" moment starts now. let's see where this goes follow for the journey → #ReinforcementLearning #gamedev

Vishal02__'s tweet image. Alright let's do this 🔥

building Flappy Bird from scratch in Unity, then training an AI to master it

sharing every win, every bug, every "why isn't this working" moment

starts now. let's see where this goes

follow for the journey →

#ReinforcementLearning #gamedev

AGI.Eth

@ceobillionaire

8 h

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Yue et al.: arxiv.org/abs/2504.13837

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

MONTREAL.AI

@Montreal_AI

8 h

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

Montreal_AI's tweet image. Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Yue et al.: arxiv.org/abs/2504.13837

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

Kirk Borne

@KirkDBorne

Oct 6

#ReinforcementLearning foundational book (2nd edition of this classic): amzn.to/3UtbeAa ————— #DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification

KirkDBorne's tweet image. #ReinforcementLearning foundational book (2nd edition of this classic): amzn.to/3UtbeAa
—————
#DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification

AGI.Eth

@ceobillionaire

Sep 28

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: arxiv.org/abs/2509.19736 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Qian et al.: arxiv.org/abs/2509.19736

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

Nov 9

Intro: #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/Intro-RL

gp_pulipaka's tweet image. Intro: #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode
geni.us/Intro-RL

DeepPCB

@DeepPCB

Oct 23

Manual 𝐏𝐂𝐁 𝐝𝐞𝐬𝐢𝐠𝐧 can’t keep up with today’s complexity. ✨ 𝐀𝐈 𝐜𝐚𝐧. 👉 Discover how @DeepPCB uses reinforcement learning to deliver DRC-clean layouts in hours in our new White Paper: link in comment! #PCBDesign #AIinEngineering #ReinforcementLearning #InstaDeep⁣

DeepPCB's tweet image. Manual 𝐏𝐂𝐁 𝐝𝐞𝐬𝐢𝐠𝐧 can’t keep up with today’s complexity. ✨ 𝐀𝐈 𝐜𝐚𝐧.
👉 Discover how @DeepPCB uses reinforcement learning to deliver DRC-clean layouts in hours in our new White Paper: link in comment!
#PCBDesign #AIinEngineering #ReinforcementLearning #InstaDeep⁣

Chenlu Ye

@ye_chenlu

Sep 5

PROF🌀Right answer, flawed reason?🤔🌀 📄arxiv.org/pdf/2509.03403 Excited to share our work: PROF-PRocess cOnsistency Filter! 🚀 Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM & ORM. #LLM #ReinforcementLearning

ye_chenlu's tweet image. PROF🌀Right answer, flawed reason?🤔🌀
📄arxiv.org/pdf/2509.03403
Excited to share our work: PROF-PRocess cOnsistency Filter! 🚀
Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM &amp; ORM. #LLM #ReinforcementLearning

PRX Life

@PRX_Life

Oct 27

A new theory based on #ReinforcementLearning reveals the optimal pairing relationship between signal sensing and modulation and provides a new way to understand collective information processing in populations of cells. 🔗 go.aps.org/46RIIhh

PRX_Life's tweet image. A new theory based on #ReinforcementLearning reveals the optimal pairing relationship between signal sensing and modulation and provides a new way to understand collective information processing in populations of cells.

🔗 go.aps.org/46RIIhh

Deluthium

@Deluthium

Oct 9

Every swap, limit order, cross-chain action, it’s input to the #ReinforcementLearning engine. In #Deluthium, your request becomes part of the learning feedback loop. No black boxes. Full transparency. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium

@Deluthium

Oct 7

What if liquidity could evolve on its own, adjusting, optimizing, adapting? #Deluthium doesn't just route your trade, we transform it into an intelligent liquidity signal. #ReinforcementLearning meets market-making. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium's tweet image. What if liquidity could evolve on its own, adjusting, optimizing, adapting?

#Deluthium doesn't just route your trade, we transform it into an intelligent liquidity signal.

#ReinforcementLearning meets market-making.

Brought to you by the Onchain Flash Boys.
Powered by RL.

AFX LAB

@AFX_LAB

Sep 3

#reinforcementlearning

Science Robotics

@SciRobotics

Aug 20

Scientists have developed a method based on #ReinforcementLearning that enables a robot to use its upper body to lift and flip a water jug. @ToyotaResearch Learn more in Science #Robotics: scim.ag/4oK6qmt

Kirk Borne

@KirkDBorne

Oct 7

Deep #ReinforcementLearning with #Python (1) Blog: rubikscode.net/2021/07/13/dee… ➕ (2) Book: covers classic RL, deep RL, distributional RL, inverse RL, & more 👉 amzn.to/3pbZS1p —————— #DataScientist #AI #MachineLearning #ML #DataScience #DeepLearning

KirkDBorne's tweet image. Deep #ReinforcementLearning with #Python

(1) Blog: rubikscode.net/2021/07/13/dee…
➕
(2) Book: covers classic RL, deep RL, distributional RL, inverse RL, &amp; more
👉 amzn.to/3pbZS1p
——————
#DataScientist #AI #MachineLearning #ML #DataScience #DeepLearning

kedar

@_kedar_18

Sep 26

Day 12 🦾 of becoming an ML Beast: Explored Reinforcement Learning – where an agent interacts with an environment, takes actions, and learns from rewards to improve decisions over time. #MachineLearning #ReinforcementLearning #AI #Learninginpublic #100daysofcoding

_kedar_18's tweet image. Day 12 🦾 of becoming an ML Beast: Explored Reinforcement Learning – where an agent interacts with an environment, takes actions, and learns from rewards to improve decisions over time.

#MachineLearning #ReinforcementLearning #AI
#Learninginpublic #100daysofcoding

Deluthium

@Deluthium

Oct 15

What if markets could think before they move? At #Deluthium, we treat liquidity as signal, not noise. #ReinforcementLearning turns execution into adaptive intelligence. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium's tweet image. What if markets could think before they move?
At #Deluthium, we treat liquidity as signal, not noise.

#ReinforcementLearning turns execution into adaptive intelligence.

Brought to you by the Onchain Flash Boys.
Powered by RL.

Kirk Borne

@KirkDBorne

Sep 20

Introduction to various #ReinforcementLearning #Algorithms: bit.ly/2UPHbSj ————— #DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification ————— + See this foundational book (2nd edition): amzn.to/3UtbeAa

KirkDBorne's tweet image. Introduction to various #ReinforcementLearning #Algorithms: bit.ly/2UPHbSj
—————
#DataScience #AI #MachineLearning #ML #DeepLearning #DataMining #Mathematics #Gamification
—————
+
See this foundational book (2nd edition): amzn.to/3UtbeAa

Deluthium

@Deluthium

Oct 21

Fast markets die first. Smart markets survive. #Deluthium uses #ReinforcementLearning to adapt in real time. Brought to you by the Onchain Flash Boys. Powered by RL.

Deluthium's tweet image. Fast markets die first.
Smart markets survive.

#Deluthium uses #ReinforcementLearning to adapt in real time.

Brought to you by the Onchain Flash Boys.
Powered by RL.

MONTREAL.AI

@Montreal_AI

8 h

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

AGI.Eth

@ceobillionaire

8 h

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yue et al.: arxiv.org/abs/2504.13837 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

Griffintaur

@griffintaur

14 h

🚀 Interestingresearch: Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics Read more: alphaxiv.org/abs/2511.04527 #LLM #ReinforcementLearning #MLResearch

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

Nov 9

Nitin Prajwal R

@nitinprajwal

Nov 9

New breakthrough in RL! 🚀 Transitive RL (TRL) uses 'divide and conquer' to solve long-horizon tasks, outperforming traditional TD learning. More scalable, less error! #ReinforcementLearning #AI #MachineLearning blog.nitinr.live

nitinprajwal's tweet image. New breakthrough in RL! 🚀 Transitive RL (TRL) uses 'divide and conquer' to solve long-horizon tasks, outperforming traditional TD learning. More scalable, less error! #ReinforcementLearning #AI #MachineLearning

blog.nitinr.live

Content Fans

@Content_Fans

Nov 9

🚀 @MetaAI's DreamGym revolutionizes LLM training: Synthetic environments replace slow real-world RL, boosting gains by 30%+ with dynamic tasks & robust sim-to-real transfer. Imagine agents dreaming their way to mastery! #AI #ReinforcementLearning

Griffintaur

@griffintaur

Nov 8

🚀 Interestingresearch: FastGS: Training 3D Gaussian Splatting in 100 Seconds Read more: alphaxiv.org/abs/2511.04283 #LLM #ReinforcementLearning #MLResearch

Kirk Borne

@KirkDBorne

Nov 8

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

Nov 8

Intro: Adversarial #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #JavaScript #ReactJS #GoLang #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/Adversarial-RL

gp_pulipaka's tweet image. Intro: Adversarial #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #JavaScript #ReactJS #GoLang #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode
geni.us/Adversarial-RL

Lingkai Kong

@konglingkai_AI

Nov 8

Wonderful Collaboration with @HaichuanWang23 @tonghanwang Guojun Xiong, @MilindTambe_AI #neurips2025 #GenerativeAI #reinforcementlearning #DeepLearning #DecisionMaking

InfiniTech Life Global

@Infinit18575448

Nov 7

AIMindUpdate News! Revolutionizing multi-agent systems! Learn a new approach to external intervention and design. #MultiAgentAI #ReinforcementLearning #AIIntervention Click here↓↓↓ aimindupdate.com/2025/11/08/unl…

Infinit18575448's tweet image. AIMindUpdate News!
Revolutionizing multi-agent systems! Learn a new approach to external intervention and design. #MultiAgentAI #ReinforcementLearning #AIIntervention

Click here↓↓↓
aimindupdate.com/2025/11/08/unl…

Proceedings of the IEEE

@ProceedingsIEEE

Nov 7

"Deep #ReinforcementLearning for Distribution System Operations: A Tutorial and Survey," presented by @WSUPullman, @NREL, and @EversourceNH authors explores how #DRL can optimize #DistributedEnergy resources, manage grid uncertainty, and bridge model-based and model-free control.

ProceedingsIEEE's tweet image. "Deep #ReinforcementLearning for Distribution System Operations: A Tutorial and Survey," presented by @WSUPullman, @NREL, and @EversourceNH authors explores how #DRL can optimize #DistributedEnergy resources, manage grid uncertainty, and bridge model-based and model-free control.

Proceedings of the IEEE

@ProceedingsIEEE

Nov 7

Modern #powergrids are becoming smarter and more complex. A new tutorial from @ProceedingsIEEE's June 2025 issue explores how deep #ReinforcementLearning can help operators manage uncertainty, optimize #DistributedEnergy, and keep tomorrow’s grids stable: bit.ly/ProceedingsIEE…

ProceedingsIEEE's tweet image. Modern #powergrids are becoming smarter and more complex. A new tutorial from @ProceedingsIEEE's June 2025 issue explores how deep #ReinforcementLearning can help operators manage uncertainty, optimize #DistributedEnergy, and keep tomorrow’s grids stable: bit.ly/ProceedingsIEE…

PHYBOT

@PHYBOT_Tech

Nov 7

Triple Tap to Unlock! Activate PHYBOT C1’s Cyber Warm-Up Moves Now! Flexible—beyond imagination. No matter how skilled you are, skipping warm-ups can still trip you up!initiate body’s cyber awakening.@elonmusk @Bitturing #HumanoidRobot #ReinforcementLearning #CycloidalJoint

ReinforcementLearning

@ReinforcementL3

Yu-Xiang Wang

@yuxiangw_cs

ReinforcementLearning

@ReinforcementL

Technion - Reinforcement Learning Research Labs

@Technion_RL

Ofir Nachum

@ofirnachum

Sasha Alexander Lambert

@SashLambert

Daniel J. Mankowitz

@DJ_Mankowitz

James

@jmac_ai

CogitAI

@Cogitai

Joseph Cox

@JosephJohnCox

Seydina Ndiaye

@seysoosey

Jacqueline Isabelle Forien

@JackieForien

robertjneal

@robertjneal

Ashish Umre

@hormigaloca

AGI.Eth

@ceobillionaire

Sep 28

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: arxiv.org/abs/2509.19736 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

Fast markets die first. Smart markets survive. #Deluthium uses #ReinforcementLearning to adapt in real time. Brought to you by the Onchain Flash Boys. Powered by RL.

#reinforcementlearning

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

Nov 9

AGI.Eth

@ceobillionaire

Jun 1

A Tutorial on Meta-Reinforcement Learning Beck et al.: arxiv.org/abs/2301.08028 #ArtificialIntelligence #MetaLearning #ReinforcementLearning

ceobillionaire's tweet image. A Tutorial on Meta-Reinforcement Learning

Beck et al.: arxiv.org/abs/2301.08028

#ArtificialIntelligence #MetaLearning #ReinforcementLearning

Deluthium

@Deluthium

Oct 7

AGI.Eth

@ceobillionaire

May 28

Reinforcing General Reasoning without Verifiers Zhou et al.: arxiv.org/abs/2505.21493 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. Reinforcing General Reasoning without Verifiers

Zhou et al.: arxiv.org/abs/2505.21493

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

Oct 19

Deep #ReinforcementLearning for #Keras! #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/DRL-Keras

gp_pulipaka's tweet image. Deep #ReinforcementLearning for #Keras! #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode
geni.us/DRL-Keras

AGI.Eth

@ceobillionaire

Jul 13

The Bitter Lesson "Search and learning are general purpose methods that continue to scale with increased computation, even as the available computation becomes very great." — Richard Sutton Rich Sutton: incompleteideas.net/IncIdeas/Bitte… #ReinforcementLearning

ceobillionaire's tweet image. The Bitter Lesson

"Search and learning are general purpose methods that continue to scale with increased computation, even as the available computation becomes very great." — Richard Sutton

Rich Sutton: incompleteideas.net/IncIdeas/Bitte…

#ReinforcementLearning

AGI.Eth

@ceobillionaire

May 4

Automated Design of Agentic Systems Shengran Hu, Cong Lu, Jeff Clune: arxiv.org/abs/2408.08435 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

ceobillionaire's tweet image. Automated Design of Agentic Systems

Shengran Hu, Cong Lu, Jeff Clune: arxiv.org/abs/2408.08435

#ArtificialIntelligence #DeepLearning #ReinforcementLearning

SA News Channel

@SatlokChannel

Aug 11

7/10 Reinforcement Learning trains agents through trial and error to maximize rewards. It’s used in gaming, robotics, and real-time decision systems like traffic control. #ReinforcementLearning #AI #SmartSystems #DeepLearning #GameAI #AutonomousTech