log_pie's profile picture.

Yuxuan Wang

@log_pie

synthetic biology will be the next next big thing. will revisit in 10 years.


Realtime on-device audio synthesis. Audio content creation is big on TikTok. As always, more coming soon.

Realtime, low-latency and on-device ML audio synthesis! These TikToks using our brand new voice change effects made my day 🤣 Inference runs realtime across all Android and iOS devices both old and new. Can't wait to see more, 2Million views and counting! 🗣️➡️🐈 & 🐮➡️🎵 MEOW!



Yuxuan Wang reposted

*yells in Chewbacca* 📣👂 New Text-to-Speech voices ft. some of your favorite characters are available now on @TikTok! To unlock them, discover the mystery keywords. #DisneyPlusDay


Yuxuan Wang reposted

Anthony Levandowski to Larry Page: Google's self-driving project is broken January 9, 2016

TechEmails's tweet image. Anthony Levandowski to Larry Page: Google's self-driving project is broken

January 9, 2016
TechEmails's tweet image. Anthony Levandowski to Larry Page: Google's self-driving project is broken

January 9, 2016
TechEmails's tweet image. Anthony Levandowski to Larry Page: Google's self-driving project is broken

January 9, 2016

Music and sound are essential to TikTok's endless creativity. After our text-to-speech feature, here's another example of how we use audio technologies to inspire creativity and help users make cool content. ⁣ Come join us! We are hiring!

As competition heats up, TikTok announces six new interactive music effects for creators tcrn.ch/3t0fB4T by @sarahintampa



awesome and congrats!! should make a demo of "Today is Monday!". 😜 @rustyryan

New work with Ron Weiss, @EricBattenberg, @sorooshooryad, @dpkingma -- finally achieving what @log_pie and I set out to do in 2016 before switching to spectrograms: direct waveform generation from characters. (1/7) abs: arxiv.org/abs/2011.03568 samples: google.github.io/tacotron/publi…



time flies. It's true.😆

i hope we are really undergoing the second neural net renaissance.



Yuxuan Wang reposted

RT There's an E-6B Mercury off the east coast near DC. I looked because I would expect them to pop up if he tests positive. It's a message to the small group of adversaries with SLBMs and ICBMs.

TimInHonolulu's tweet image. RT There's an E-6B Mercury off the east coast near DC.  I looked because I would expect them to pop up if he tests positive.  It's a message to the small group of adversaries  with SLBMs and ICBMs.

Yuxuan Wang reposted

Immigration has contributed immensely to America’s economic success, making it a global leader in tech, and also Google the company it is today. Disappointed by today’s proclamation - we’ll continue to stand with immigrants and work to expand opportunity for all.


Yuxuan Wang reposted

I'm excited to share Capacitron, the Tacotron team's most recent contribution to the world of expressive end-to-end speech synthesis (e.g., transfer and control of prosody and speaking style). arxiv.org/abs/1906.03402


Yuxuan Wang reposted

We released a new large-scale corpus of English speech derived for TTS; LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech Dataset: openslr.org/60/ Paper: arxiv.org/abs/1904.02882


Yuxuan Wang reposted

あいかわらずリズムの生成の実験中. 強弱(Veclocity)や微妙な各打点のタイミングのゆれも含めて、Variational Autoencoderで生成するようなモデルを作ったらだいぶカッコよくなってきた気がする.


go Daisy! @daisystanton

Daisy Stanton, @Google on "Predicting expressive speaking style from text in end-to-end speech synthesis" presenting now at Kallirhoe Hall on the first day of #SLT2018

SLT2018's tweet image. Daisy Stanton, @Google on "Predicting expressive speaking style from text in end-to-end speech synthesis" presenting now at Kallirhoe Hall on the first day of #SLT2018


PCEN rocks.

PCEN is an excellent audio frontend for sound recognition in far-field recordings. But... why? And how to configure it for your application? Answers in our new paper led by @lostanlen in collab @CornellBirds @nyuMARL @NYU_CUSP #nocmig justinsalamon.com/news/per-chann…



TikTok is a flagship product from ByteDance.

New blog post — We're only starting to see the beginning of Consumer AI apps, including TikTok and more, all of which I go into here: a16z.com/2018/12/03/whe…



Yuxuan Wang reposted

Our work on mixing input types for controlling TTS synthesis is now up on arxiv! arxiv.org/abs/1811.07240 / samples s3.amazonaws.com/representation…. See the sampling code github.com/kastnerkyle/re…, or jump straight to the colab to try on your own colab.research.google.com/github/kastner…

kastnerkyle's tweet image. Our work on mixing input types for controlling TTS synthesis is now up on arxiv! arxiv.org/abs/1811.07240 / samples s3.amazonaws.com/representation….  See the sampling code github.com/kastnerkyle/re…, or jump straight to the colab to try on your own colab.research.google.com/github/kastner…

Yuxuan Wang reposted

Magenta Studio - Ableton Live Plugin buff.ly/2Tx1qTD Googleの音楽生成関連のモデルのパッケージ Magentaが Ableton Live用のプラグインとしてリリースされてますね. ドラムシーケンスを生成したり、二つのシーケンスの間を補完したり... いろいろ手軽に遊べます.


Yuxuan Wang reposted

友人のMaxがWaveNet/VAEで生成したサンプルのみで作ったドラムンベーストラック。ディープラーニング x サウンドデザイン. buff.ly/2AK8CEq こういう使い方はぐっときますね! プロジェクトの詳細はこちら NeuralFunk - Combining Deep Learning with Sound Design buff.ly/2OP2WBJ

naotokui's tweet image. 友人のMaxがWaveNet/VAEで生成したサンプルのみで作ったドラムンベーストラック。ディープラーニング x サウンドデザイン.  buff.ly/2AK8CEq
こういう使い方はぐっときますね! プロジェクトの詳細はこちら NeuralFunk - Combining Deep Learning with Sound Design buff.ly/2OP2WBJ
naotokui's tweet image. 友人のMaxがWaveNet/VAEで生成したサンプルのみで作ったドラムンベーストラック。ディープラーニング x サウンドデザイン.  buff.ly/2AK8CEq
こういう使い方はぐっときますね! プロジェクトの詳細はこちら NeuralFunk - Combining Deep Learning with Sound Design buff.ly/2OP2WBJ

Yuxuan Wang reposted

Hierarchical Generative Modeling for Controllable Speech Synthesis. arxiv.org/abs/1810.07217

arxiv_org's tweet image. Hierarchical Generative Modeling for Controllable Speech Synthesis. arxiv.org/abs/1810.07217
arxiv_org's tweet image. Hierarchical Generative Modeling for Controllable Speech Synthesis. arxiv.org/abs/1810.07217
arxiv_org's tweet image. Hierarchical Generative Modeling for Controllable Speech Synthesis. arxiv.org/abs/1810.07217
arxiv_org's tweet image. Hierarchical Generative Modeling for Controllable Speech Synthesis. arxiv.org/abs/1810.07217

Loading...

Something went wrong.


Something went wrong.