deep_rl's profile picture. Papers about distributed deep and reinforcement learning.

Deep RL

@deep_rl

Papers about distributed deep and reinforcement learning.

Sequence-to-Sequence Forecasting-aided State Estimation for Power Systems - Kamal Basulaiman ift.tt/Hc179Qz


Learning to detect an animal sound from five examples - Inês Nolasco ift.tt/bdojBsW


Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman - Jannis Weil ift.tt/uxPUR4N


U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech - Xin Jing ift.tt/OqMnX0J


Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice - Toshinori Kitamura ift.tt/pMNhyQo


Editing Large Language Models: Problems, Methods, and Opportunities - Yunzhi Yao ift.tt/EMXdoLb


Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model - Peter Súkeník ift.tt/aTSCfwB


INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search - Animesh Basak Chowdhury ift.tt/vXDqfyJ


Scaling Serverless Functions in Edge Networks: A Reinforcement Learning Approach - Mounir Bensalem ift.tt/9y2WzmR


Hang-Time HAR: A Benchmark Dataset for Basketball Activity Recognition using Wrist-worn Inertial Sensors - Alexander Hoelzemann ift.tt/Qjqgfnr


Policy Representation via Diffusion Probability Model for Reinforcement Learning - Long Yang ift.tt/gor5WdU


Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test - Eungbeom Kim ift.tt/RZSgxFI


Restore Anything Pipeline: Segment Anything Meets Image Restoration - Jiaxi Jiang ift.tt/D3j8R4r


Breaking the Paradox of Explainable Deep Learning - Arlind Kadra ift.tt/icHyhBU


Hierarchical Partitioning Forecaster - Christopher Mattern ift.tt/Kurd4zf


Road Planning for Slums via Deep Reinforcement Learning - Yu Zheng ift.tt/HflL0ob


Federated Learning of Medical Concepts Embedding using BEHRT - Ofir Ben Shoham ift.tt/T4ZorWi


POEM: Polarization of Embeddings for Domain-Invariant Representations - Sang-Yeong Jo ift.tt/qrD9C7g


Distributed Learning over Networks with Graph-Attention-Based Personalization - Zhuojun Tian ift.tt/cQxldRL


Towards generalizing deep-audio fake detection networks - Konstantin Gasenzer ift.tt/RVvDdUS


Loading...

Something went wrong.


Something went wrong.