avideypi's profile picture. 3d vision / head avatars
CS PhD @UniofBath

Avirup Dey

@avideypi

3d vision / head avatars CS PhD @UniofBath

Avirup Dey reposted

Good news: Soham Parekh, a PhD student in our group, will finally have the bandwidth to focus on his academic career! We just awarded him another scholarship that allows him to work remote.


doing god's work 🙏

🤖 🗞️ Free AI Generated Digital Humans Newsletter -> lovable.dev/projects/6849f… Like everyone else, I've been struggling to keep up with the sheer volume of papers and news in the Digital Human space. Over the past few weeks, I've developed an agentic (ish) AI pipeline to find and…

jack_r_saunders's tweet image. 🤖 🗞️ Free AI Generated Digital Humans Newsletter ->  lovable.dev/projects/6849f…

Like everyone else, I've been struggling to keep up with the sheer volume of papers and news in the Digital Human space. Over the past few weeks, I've developed an agentic (ish) AI pipeline to find and…


Avirup Dey reposted

Ever wish you could turn your video generator into a controllable physics simulator? We're thrilled to introduce Force Prompting! Animate any image with physical forces and get fine-grained control, without needing any physics simulator or 3D assets at inference. 🧵(1/n)


Ashamed of Mr. Tharoor for speaking up for his country instead of pandering his "fans". No progressive liberal should condemn terrorists and their enablers.

Two Indians who lost a lot of credibility during the war are Dhruv Rathee and Shashi Tharoor. Both came across as hyper-nationalist, overly emotional, and lacked a balanced understanding, resorting to biased logic rather than informed perspective. Disappointing to see.

NkWarraich's tweet image. Two Indians who lost a lot of credibility during the war are Dhruv Rathee and Shashi Tharoor. Both came across as hyper-nationalist, overly emotional, and lacked a balanced understanding, resorting to biased logic rather than informed perspective. 
Disappointing to see.
NkWarraich's tweet image. Two Indians who lost a lot of credibility during the war are Dhruv Rathee and Shashi Tharoor. Both came across as hyper-nationalist, overly emotional, and lacked a balanced understanding, resorting to biased logic rather than informed perspective. 
Disappointing to see.


easy fix: invoke a specialized model (some variant of instant-id) instead of DALLE when working with faces.

asking ai to perfectly replicate this picture of dwayne “the rock” johnson 101 times



Avirup Dey reposted

IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos TLDR: A diffusion-based method is used to generate 3D renderable talking heads from a single image, using a single denoising step for real-time. The model can learn from…


Avirup Dey reposted

It's over. Meta just announced MoCha, a new model that turns text or voice into super realistic talking characters. There is no way to tell anymore... 10 examples: (please unmute)


One hell of a thread!

The original architects of American global power did something very clever that no other empire had ever done before: they deliberately hid the instruments of their power. Specifically, they institutionalized the hard power of the post-WW2 American military into a "rules-based…



Avirup Dey reposted

After hacking GPT-4o's frontend, I made amazing discoveries: 💡The line-by-line image generation effect users see is just a browser-side animation (pure frontend trick) 🔦OpenAI's server sends only 5 intermediate images per generation, captured at different stages 🎾Patch size=8


the team took it personally xD

avideypi's tweet image. the team took it personally xD

Avirup Dey reposted

No... the fear isn't irrelevance, but that's a comforting thing for us technologists to tell ourselves. The fear is degradation. You can sense where this leads even if you can't put your finger on it, which is why so many feel vaguely uneasy. They say they hate it but they…

ZyMazza's tweet image. No... the fear isn't irrelevance, but that's a comforting thing for us technologists to tell ourselves. 

The fear is degradation. You can sense where this leads even if you can't put your finger on it, which is why so many feel vaguely uneasy. They say they hate it but they…
ZyMazza's tweet image. No... the fear isn't irrelevance, but that's a comforting thing for us technologists to tell ourselves. 

The fear is degradation. You can sense where this leads even if you can't put your finger on it, which is why so many feel vaguely uneasy. They say they hate it but they…

re: backlash against AI-generated Ghibli-style art the reaction here boils down to one simple fear: irrelevance what took artists years of dedication, practice, and struggle can now be replicated effortlessly by ai for nothing. resentment for this loss of exclusivity is…



Avirup Dey reposted

DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model TLDR: A hierarchical diffusion-based speech-to-3DMM generator that decomposes FLAME parameters into lip, eye and global regions for better quality. It uses PiRender to get the final…


Avirup Dey reposted

Have you ever thanked ptrblck for his answers on your CUDA problems?

cloneofsimo's tweet image. Have you ever thanked ptrblck for his answers on your CUDA problems?

Avirup Dey reposted

Pippo: High-Resolution Multi-View Humans from a Single Image TLDR: High-resolution and Multi-View but static human Generation from one image. Uses a DiT to generate the images directly (e.g. no 3D representation), with a control MLP. Imagine this distilled onto one of Meta's…


Avirup Dey reposted

Decentralized Diffusion Models power stronger models trained on more accessible infrastructure. DDMs mitigate the networking bottleneck that locks training into expensive and power-hungry centralized clusters. They scale gracefully to billions of parameters and generate…


Avirup Dey reposted

📢📢📢 "𝐑𝐚𝐝𝐢𝐚𝐧𝐭 𝐅𝐨𝐚𝐦: Real-Time Differentiable Ray Tracing", a mesh-based 3D represention. radfoam.github.io arxiv.org/abs/2502.01157 Co-lead by my PhD students Shrisudhan Govindarajan and Daniel Rebain, and w/ @kwangmoo_yi


Exciting opportunity for those working on avatars. Do sign up!

📢 Working on Digital Humans/Avatars? Register for the BMVA Symposium on Digital Humans on the 28th of May in London! We're also looking for recent work to be presented on the day. Not sure if your work fits? Send me a DM. Register/submit here 👉 bmva.org/meetings/25-05… Many…

jack_r_saunders's tweet image. 📢 Working on Digital Humans/Avatars? Register for the BMVA Symposium on Digital Humans on the 28th of May in London! We're also looking for recent work to be presented on the day. Not sure if your work fits? Send me a DM.

Register/submit here 👉 bmva.org/meetings/25-05…

Many…


Avirup Dey reposted

POV: A chinese company drops a powerful open source reasoning AI model competitive with o1 that neither of your companies have released yet as products while you're attending an inauguration

altryne's tweet image. POV: A chinese company drops a powerful open source reasoning AI model competitive with o1 that neither of your companies have released yet as products while you're attending an inauguration

Avirup Dey reposted

I'm excited to share what I've been working on during my summer internship at @Microsoft . GASP creates photorealistic, real-time Avatars from an image or short video. Project page: microsoft.github.io/GASP/ Arxiv paper: arxiv.org/abs/2412.07739… Demo Video: youtube.com/watch?v=3oWB7-…


Loading...

Something went wrong.


Something went wrong.