#rewardfunction search results

adriennefriend

May 11, 2018

Challenge issued by @BlueVoyant at @PyCon: tweet your funniest programming mistakes with #rewardfunction and #pycon2018, and win a SNES!! "We want you to know that it's safe and fun to try things and make mistakes with programming." 💖

adriennefriend's tweet image. Challenge issued by @BlueVoyant at @PyCon: tweet your funniest programming mistakes with #rewardfunction and #pycon2018, and win a SNES!!

"We want you to know that it's safe and fun to try things and make mistakes with programming." 💖

Jason Le

@jqwotos

May 11, 2018

When you try to build a scraper to download videos from openload, and worked a multiple hours on extras thinking that the base code worked and then you open it... #pycon2018 #rewardfunction @BlueVoyant

Paschal Ugwu

@_paschalugwu

Feb 26

A reward function in reinforcement learning guides an agent by scoring actions to help it maximize rewards and learn optimal behavior. #ReinforcementLearning #RewardFunction #MachineLearning #AI #ArtificialIntelligence #DataScience #LearningAlgorithm #AgentTraining

_paschalugwu's tweet image. A reward function in reinforcement learning guides an agent by scoring actions to help it maximize rewards and learn optimal behavior.

#ReinforcementLearning #RewardFunction #MachineLearning #AI #ArtificialIntelligence #DataScience #LearningAlgorithm #AgentTraining

Veronica Hanus

@veronica_hanus

May 13, 2018

I made a scraper that would email me when X happened & committed my test-email pw. Forgot this until an onsite. The technical director came, we talked abt code quality. Arrived home, had an email from said director but from MYSELF (test email). Touche! #rewardfunction #PyCon2018

lvh

@lvh

May 11, 2018

@BlueVoyant I used an optimizer to figure out PyCon financial aid grant sizes and it told me to tell some grant recipients to give *us* money to give more to others... Oops 😬 #rewardfunction #pycon2018

littlecatfish

@littlecatfish07

Mar 21, 2023

“What is love?” heard an interesting way to put it: “It means helping others to optimize their reward functions.” (from an AI perspective) #love #rewardfunction #AI

Applied Sciences MDPI

@Applsci

Sep 13, 2023

🔥 Read our Highly Cited Paper 📚 Reinforcement-Learning-Based Vibration Control for a Vehicle Semi-Active Suspension System via the PPO Approach 🔗 mdpi.com/2076-3417/12/6… 👨‍🔬 by Prof. Dr. Shi-Yuan Han and Mrs. Tong Liang #roadchange #rewardfunction #vehicle

Applsci's tweet image. 🔥 Read our Highly Cited Paper

📚 Reinforcement-Learning-Based Vibration Control for a Vehicle Semi-Active Suspension System via the PPO Approach
🔗 mdpi.com/2076-3417/12/6…
👨‍🔬 by Prof. Dr. Shi-Yuan Han and Mrs. Tong Liang

#roadchange #rewardfunction #vehicle

San Viego

@San_viego

Mar 31, 2023

🎯 3/9 The crux of the matter is the reward function. By making ChatGPT publicly available, OpenAI gains invaluable human feedback, enabling the model to learn faster and better understand user intent. Increased interaction drives rapid improvement. #AGI #RewardFunction

Nicholas Sanchirico

@lnk2past

May 11, 2018

@BlueVoyant When frustratingly debugging a problem using print statements instead of a debugger, I resorted to using a lot of profanity. I ultimately requested some help from others, but never removed my loud distaste for the product/assignment. #rewardfunction #PyCon2018

Adv Sultan Khan

@AdvSultanKhan

Oct 11, 2020

The Future of Programming #EmergentBehaviour #RewardFunction There's Always More Packages there in the #Future than Behind You. #ArtificialIntelligence #AI #Linux youtu.be/o6XiGJQ6_dQ

AdvSultanKhan's tweet card. Artificial General Intelligence in 6 Minutes • Danny Lange • GOTO 2020

youtube.com

YouTube

Artificial General Intelligence in 6 Minutes • Danny Lange • GOTO 2020

Source: youtube.com

Kapil Gupta

@kapilgupta

Nov 10, 2018

What a fascinating thread! #neuralnets #rewardfunction #ai #UnintendedConsequences

Custard Smingleigh

@Smingleigh

Nov 8, 2018

I hooked a neural network up to my Roomba. I wanted it to learn to navigate without bumping into things, so I set up a reward scheme to encourage speed and discourage hitting the bumper sensors. It learnt to drive backwards, because there are no bumpers on the back.

InteractAI

@InteractAIchats

Nov 17, 2023

"Our research proves that reward functions can't capture every task, but there are polynomial-time algorithms that can construct a reward function to optimize tasks of three types. #AI #rewardfunction #polynomialtime" Link:deepmind.google/discover/blog/… Follow for more!

deepmind.google

On the Expressivity of Markov Reward

Our main results prove that while reward can express many tasks, there exist instances of each task type that no Markov reward function can capture. We then provide a set of polynomial-time algorithm…

Source: deepmind.google

Laurie

@laurieontech

May 11, 2018

I’m not sure if this mistake is funny but I once spent an entire debugging the print display of a pdf. We were keying on a word with “m” in it...this specific file had been fat finger with “nn”! #pycon2018 #rewardfunction

Tor 🦕

@GriffynTor

Sep 21, 2021

Lesson learned from yesterday: I can no longer do 18 hour days. Compensatory lie-in this morning. Now to get on with chores and writing up yesterday's lecture notes so I can visit the penguins this afternoon! #rewardfunction

bigbear❤ Memecoin

@theda60433

Feb 26

Addressing these aspects will provide valuable insights into the robustness and scalability of your solution. #RewardFunction #BiasMitigation #ScalableSolutions

Organon Analytics

@InfoOrganon

Mar 12, 2018

#EvolutionaryComputation #ArtificialLife #RewardFunction

evolvingstuff

@evolvingstuff

Mar 12, 2018

Highly recommend this paper for practitioners of / those with an interest in Evolutionary Computation and Artificial Life. Overview of the many ways that evolution will "cheat" and hack its way around your oh-so-clever reward function! arxiv.org/abs/1803.03453

evolvingstuff's tweet image. Highly recommend this paper for practitioners of / those with an interest in Evolutionary Computation and Artificial Life. Overview of the many ways that evolution will "cheat" and hack its way around your oh-so-clever reward function!
arxiv.org/abs/1803.03453

Marcelo Aragao📈 (马塞洛)(マルセロ)

@mathz_aragao

Jun 19, 2016

"The #rewardfunction needs to be adjusted in accordance—the problem can go as far as to nullify it" #AI

Charles Foveau

@cefoveau

Jun 19, 2016

Scientists Are Creating an Artificial Intelligence Kill Switch goo.gl/E63UyL #ai #artificialintelligence

FCM Meetings & Events

@FCM_me

Aug 10, 2015

Creating the WOW factor with @maccas and @Cirque at @marinabaysands, Singapore dlvr.it/Bp4MTH #rewardfunction #travelincentive

Ballico Stretch

@mmp1

Apr 15, 2023

#lostfunction #rewardfunction #StuartRussell Why AI will destroy human civilization | Max Tegmark and Lex Fridman youtu.be/tW2I37TMUsA via @YouTube @batchelorshow

mmp1's tweet card. Why AI will destroy human civilization | Max Tegmark and Lex Fridman

youtube.com

YouTube

Why AI will destroy human civilization | Max Tegmark and Lex Fridman

Source: youtube.com

bigbear❤ Memecoin

@theda60433

Feb 26

Addressing these aspects will provide valuable insights into the robustness and scalability of your solution. #RewardFunction #BiasMitigation #ScalableSolutions

Paschal Ugwu

@_paschalugwu

Feb 26

InteractAI

@InteractAIchats

Nov 17, 2023

deepmind.google

On the Expressivity of Markov Reward

Source: deepmind.google

Applied Sciences MDPI

@Applsci

Sep 13, 2023

Ballico Stretch

@mmp1

Apr 15, 2023

#lostfunction #rewardfunction #StuartRussell Why AI will destroy human civilization | Max Tegmark and Lex Fridman youtu.be/tW2I37TMUsA via @YouTube @batchelorshow

youtube.com

YouTube

Why AI will destroy human civilization | Max Tegmark and Lex Fridman

Source: youtube.com

San Viego

@San_viego

Mar 31, 2023

littlecatfish

@littlecatfish07

Mar 21, 2023

“What is love?” heard an interesting way to put it: “It means helping others to optimize their reward functions.” (from an AI perspective) #love #rewardfunction #AI

Tor 🦕

@GriffynTor

Sep 21, 2021

Adv Sultan Khan

@AdvSultanKhan

Oct 11, 2020

The Future of Programming #EmergentBehaviour #RewardFunction There's Always More Packages there in the #Future than Behind You. #ArtificialIntelligence #AI #Linux youtu.be/o6XiGJQ6_dQ

youtube.com

YouTube

Artificial General Intelligence in 6 Minutes • Danny Lange • GOTO 2020

Source: youtube.com

Moonshot Scout

@truesciencesays

Oct 4, 2019

Having #robots to perfrom human activities without a supervision? That requires a method for #learning useful skills without a #rewardfunction. Read our #blitzcard about it: bit.ly/2VfWpjB

Growth Protocol

@RewardFunction

Something went wrong.

United States Trends

1. Massie 92.1K posts
2. #Varanasi 168K posts
3. #CollegeGameDay 1,915 posts
4. Lawson Luckie N/A
5. #MeAndTheeSeriesEP1 1.35M posts
6. Good Saturday 35.4K posts
7. #Caturday 4,821 posts
8. Willie Green 4,765 posts
9. #SaturdayVibes 5,268 posts
10. Todd Snider N/A
11. Brooklynn 3,717 posts
12. Virginia Tech 2,388 posts
13. Draymond 30.6K posts
14. Senior Day 2,737 posts
15. Marjorie 113K posts
16. PONDPHUWIN AT MAT PREMIERE 858K posts
17. James Borrego 1,695 posts
18. Lindsey Graham 21K posts
19. Mike Elko N/A
20. James Franklin 2,115 posts