Avirup Dey
@avideypi
3d vision / head avatars CS PhD @UniofBath
You might like
Good news: Soham Parekh, a PhD student in our group, will finally have the bandwidth to focus on his academic career! We just awarded him another scholarship that allows him to work remote.
doing god's work 🙏
🤖 🗞️ Free AI Generated Digital Humans Newsletter -> lovable.dev/projects/6849f… Like everyone else, I've been struggling to keep up with the sheer volume of papers and news in the Digital Human space. Over the past few weeks, I've developed an agentic (ish) AI pipeline to find and…
Ever wish you could turn your video generator into a controllable physics simulator? We're thrilled to introduce Force Prompting! Animate any image with physical forces and get fine-grained control, without needing any physics simulator or 3D assets at inference. 🧵(1/n)
Ashamed of Mr. Tharoor for speaking up for his country instead of pandering his "fans". No progressive liberal should condemn terrorists and their enablers.
Two Indians who lost a lot of credibility during the war are Dhruv Rathee and Shashi Tharoor. Both came across as hyper-nationalist, overly emotional, and lacked a balanced understanding, resorting to biased logic rather than informed perspective. Disappointing to see.
easy fix: invoke a specialized model (some variant of instant-id) instead of DALLE when working with faces.
asking ai to perfectly replicate this picture of dwayne “the rock” johnson 101 times
IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos TLDR: A diffusion-based method is used to generate 3D renderable talking heads from a single image, using a single denoising step for real-time. The model can learn from…
It's over. Meta just announced MoCha, a new model that turns text or voice into super realistic talking characters. There is no way to tell anymore... 10 examples: (please unmute)
One hell of a thread!
The original architects of American global power did something very clever that no other empire had ever done before: they deliberately hid the instruments of their power. Specifically, they institutionalized the hard power of the post-WW2 American military into a "rules-based…
After hacking GPT-4o's frontend, I made amazing discoveries: 💡The line-by-line image generation effect users see is just a browser-side animation (pure frontend trick) 🔦OpenAI's server sends only 5 intermediate images per generation, captured at different stages 🎾Patch size=8
No... the fear isn't irrelevance, but that's a comforting thing for us technologists to tell ourselves. The fear is degradation. You can sense where this leads even if you can't put your finger on it, which is why so many feel vaguely uneasy. They say they hate it but they…
re: backlash against AI-generated Ghibli-style art the reaction here boils down to one simple fear: irrelevance what took artists years of dedication, practice, and struggle can now be replicated effortlessly by ai for nothing. resentment for this loss of exclusivity is…
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model TLDR: A hierarchical diffusion-based speech-to-3DMM generator that decomposes FLAME parameters into lip, eye and global regions for better quality. It uses PiRender to get the final…
Have you ever thanked ptrblck for his answers on your CUDA problems?
Pippo: High-Resolution Multi-View Humans from a Single Image TLDR: High-resolution and Multi-View but static human Generation from one image. Uses a DiT to generate the images directly (e.g. no 3D representation), with a control MLP. Imagine this distilled onto one of Meta's…
Decentralized Diffusion Models power stronger models trained on more accessible infrastructure. DDMs mitigate the networking bottleneck that locks training into expensive and power-hungry centralized clusters. They scale gracefully to billions of parameters and generate…
📢📢📢 "𝐑𝐚𝐝𝐢𝐚𝐧𝐭 𝐅𝐨𝐚𝐦: Real-Time Differentiable Ray Tracing", a mesh-based 3D represention. radfoam.github.io arxiv.org/abs/2502.01157 Co-lead by my PhD students Shrisudhan Govindarajan and Daniel Rebain, and w/ @kwangmoo_yi
Exciting opportunity for those working on avatars. Do sign up!
📢 Working on Digital Humans/Avatars? Register for the BMVA Symposium on Digital Humans on the 28th of May in London! We're also looking for recent work to be presented on the day. Not sure if your work fits? Send me a DM. Register/submit here 👉 bmva.org/meetings/25-05… Many…
POV: A chinese company drops a powerful open source reasoning AI model competitive with o1 that neither of your companies have released yet as products while you're attending an inauguration
I'm excited to share what I've been working on during my summer internship at @Microsoft . GASP creates photorealistic, real-time Avatars from an image or short video. Project page: microsoft.github.io/GASP/ Arxiv paper: arxiv.org/abs/2412.07739… Demo Video: youtube.com/watch?v=3oWB7-…
United States Trends
- 1. Good Sunday 63.6K posts
- 2. Klay 27.3K posts
- 3. McLaren 117K posts
- 4. #sundayvibes 5,073 posts
- 5. #FG3Dライブ 106K posts
- 6. #sundaymotivation 3,390 posts
- 7. #FelizCumpleañosNico 4,180 posts
- 8. Ja Morant 12.8K posts
- 9. #FelizCumpleañosPresidente 3,686 posts
- 10. Lando 141K posts
- 11. For the Lord 30K posts
- 12. Tottenham 45.8K posts
- 13. Piastri 84.1K posts
- 14. South Asia 40K posts
- 15. Oscar 133K posts
- 16. Arsenal 172K posts
- 17. Chattanooga State 1,100 posts
- 18. Uranus 4,261 posts
- 19. Bacon 61.1K posts
- 20. Rubio 98.5K posts
Something went wrong.
Something went wrong.