rust_phillip's profile picture. Research Scientist @AIatMeta (FAIR) • PhD @coastalcph

Phillip Rust

@rust_phillip

Research Scientist @AIatMeta (FAIR) • PhD @coastalcph

Pinned

Happy to share our paper on language modelling with pixels has been accepted to ICLR‘23 (notable-top-5% / oral) 🎉. Big thanks and congrats to Team-PIXEL @jonasflotz @ebugliarello @esalesk @mdlhx @delliott and looking forward to presenting in Kigali! 🌍 #ICLR2023

Tired of tokenizers/subwords? Check out PIXEL, a new language model that processes written text as images📸 “Language Modelling with Pixels” 📄 arxiv.org/abs/2207.06991 🧑‍💻github.com/xplip/pixel 🤖huggingface.co/Team-PIXEL/pix… by @rust_phillip @jonasflotz me @esalesk @mdlhx @delliott

ebugliarello's tweet image. Tired of tokenizers/subwords? Check out PIXEL, a new language model that processes written text as images📸

“Language Modelling with Pixels”

📄 arxiv.org/abs/2207.06991
🧑‍💻github.com/xplip/pixel
🤖huggingface.co/Team-PIXEL/pix…

by @rust_phillip @jonasflotz me @esalesk @mdlhx @delliott


Phillip Rust reposted

Tough week! I also got impacted less than 3 months after joining. Ironically, I just landed some new RL infra features the day before. Life moves on. My past work spans RL, PEFT, Quantization, and Multimodal LLMs. If your team is working on these areas, I’d love to connect.

Meta has gone crazy on the squid game! Many new PhD NGs are deactivated today (I am also impacted🥲 happy to chat)



Phillip Rust reposted

Humans see text — but LLMs don’t. I wrote a short blog post exploring how models can perceive text visually rather than tokenize it: 🔗 csu-jpg.github.io/Blog/people_se… From PIXEL, CLIPPO, VisInContext, VIST to DeepSeek-OCR, this is a quick story of how vision-centric modeling is…


I will be presenting this work in-person at ACL🇹🇭 this week. Drop by if you'd like to chat! Oral: Today (Monday) 16:30 Poster: Tuesday (Tomorrow) 10:30 - 12:00

Introducing “Towards Privacy-Aware Sign Language Translation at Scale” We leverage self-supervised pretraining on anonymized videos, achieving SOTA ASL-to-English translation performance while mitigating risks arising from biometric data. 📄: arxiv.org/abs/2402.09611 🧵(1/9)

rust_phillip's tweet image. Introducing “Towards Privacy-Aware Sign Language Translation at Scale”

We leverage self-supervised pretraining on anonymized videos, achieving SOTA ASL-to-English translation performance while mitigating risks arising from biometric data.

📄: arxiv.org/abs/2402.09611

🧵(1/9)


Phillip Rust reposted

New preprint "Improving Language Understanding from Screenshots" w/ @zwcolin @AdithyaNLP @danqi_chen. We improve language understanding abilities of screenshot LMs, an emerging family of models that processes everything (including text) via visual inputs arxiv.org/abs/2402.14073


Phillip Rust reposted

In PHD: Pixel-Based Language Modeling of Historical Documents with @NadavBorenstein @rust_phillip and @IAugenstein, we apply pixel language models to processing historical document and to more standard NLP classification tasks too. See it in Poster Session 6 on Sunday 10th.

delliott's tweet image. In PHD: Pixel-Based Language Modeling of Historical Documents with @NadavBorenstein @rust_phillip and @IAugenstein, we apply pixel language models to processing historical document and to more standard NLP classification tasks too. See it in Poster Session 6 on Sunday 10th.
delliott's tweet image. In PHD: Pixel-Based Language Modeling of Historical Documents with @NadavBorenstein @rust_phillip and @IAugenstein, we apply pixel language models to processing historical document and to more standard NLP classification tasks too. See it in Poster Session 6 on Sunday 10th.

Phillip Rust reposted

In Text Rendering Strategies for Pixel Language Models with @jonasflotz @rust_phillip and @esalesk, we design new text renderers for visual language processing to improve performance or to squeeze the model down to just 22M parameters. See it in Poster Session 2 on Friday 8th.

delliott's tweet image. In Text Rendering Strategies for Pixel Language Models with @jonasflotz @rust_phillip and @esalesk, we design new text renderers for visual language processing to improve performance or to squeeze the model down to just 22M parameters. See it in Poster Session 2 on Friday 8th.
delliott's tweet image. In Text Rendering Strategies for Pixel Language Models with @jonasflotz @rust_phillip and @esalesk, we design new text renderers for visual language processing to improve performance or to squeeze the model down to just 22M parameters. See it in Poster Session 2 on Friday 8th.

Phillip Rust reposted

anon policy survey is out: tinyurl.com/aclarxivpolicy


Phillip Rust reposted

Introducing SeamlessM4T, the first all-in-one, multilingual multimodal translation model. This single model can perform tasks across speech-to-text, speech-to-speech, text-to-text translation & speech recognition for up to 100 languages depending on the task. Details ⬇️


Phillip Rust reposted

📢 I am hiring a postdoc to join our project on pixel-based natural language processing. The position is based in Copenhagen 🇩🇰 for 18 months. Applications are due by March 29 employment.ku.dk/faculty/?show=…. Informal inquiries are welcome.

Thrilled to receive a grant from @VILLUMFONDEN to carry out blue-skies research on tokenization-free NLP veluxfoundations.dk/en/about/proje… I will hire Ph.Ds and Postdocs to build up the group so feel free to reach out. We're starting off with a paper at #ICLR2023 openreview.net/forum?id=FkSp8…

delliott's tweet image. Thrilled to receive a grant from @VILLUMFONDEN to carry out blue-skies research on tokenization-free NLP veluxfoundations.dk/en/about/proje…

I will hire Ph.Ds and Postdocs to build up the group so feel free to reach out. We're starting off with a paper at #ICLR2023 openreview.net/forum?id=FkSp8…


Phillip Rust reposted

Thrilled to receive a grant from @VILLUMFONDEN to carry out blue-skies research on tokenization-free NLP veluxfoundations.dk/en/about/proje… I will hire Ph.Ds and Postdocs to build up the group so feel free to reach out. We're starting off with a paper at #ICLR2023 openreview.net/forum?id=FkSp8…

delliott's tweet image. Thrilled to receive a grant from @VILLUMFONDEN to carry out blue-skies research on tokenization-free NLP veluxfoundations.dk/en/about/proje…

I will hire Ph.Ds and Postdocs to build up the group so feel free to reach out. We're starting off with a paper at #ICLR2023 openreview.net/forum?id=FkSp8…

Loading...

Something went wrong.


Something went wrong.