AI Bites | YouTube Channel

@ai_bites

AI Happenings, papers and ideas tweet. Opensource online AI education for the world. Former @UniofOxford @Oxford_VGG

Science & Technology

YouTube →

youtube.com/c/AIBites

Joined July 2014

3KPosts 2KFollowers 708Following

You might like

@LysandreJik

@MLStreetTalk

@ml_collective

@AICoffeeBreak

@gordic_aleksa

@A_K_Nain

@wightmanr

@marksaroufim

@HildeKuehne

@NagraniArsha

@suyashfulay

@__Proto__16

@ritwik_raha

@KinasRemek

@ariG23498

AI Bites | YouTube Channel

2 m

🤖 GIVEAWAY TIME! We are giving away a one-page LangChain Cheatsheet! ⚡ It will really make you productive if you are building with LangChain! It's something we always refer to when we build LLM pipelines. To get it: 1️⃣ Follow us (@ai_bites) 2️⃣ Like ❤️ + Repost 🔁 this post…

AI Bites | YouTube Channel

2 h

We just published Building Multi-Agent Systems with LangGraph — A Comprehensive Guide medium.com/p/building-mul… #AI #agenticAI #GenerativeAI #agents #AIイラスト

AI Bites | YouTube Channel

3 h

The field of video generation is undergoing a paradigm shift - from generating realistic and appealing visuals to constructing world models that can simulate interactive and navigable environments. These models are not just visual tools; they serve as testbeds for training and…

ai_bites's tweet image. The field of video generation is undergoing a paradigm shift - from generating realistic and appealing visuals to constructing world models that can simulate interactive and navigable environments. These models are not just visual tools; they serve as testbeds for training and…

AI Bites | YouTube Channel

3 h

DreamLand, a novel frontend visualization framework designed to enable real-time, multimodal interaction with 4D (spatiotemporal) scenes. While recent advances in vision and language models have enabled rich 3D content generation, existing WebGL-based systems remain limited in…

AI Bites | YouTube Channel

3 h

UniVA: Universal Video Agent! Describe a universe, a campaign, a pet, or a long-form story! UniVA will plan, compose and produce the video for you. Paper Title: UniVA: Universal Video Agent towards Open-Source Next-Generation Video Project: univa.online Link:…

AI Bites | YouTube Channel

3 h

FlowFeat distills optical flow networks into pixel-level task-agnostic representations. FlowFeat provides versatile pixel-level features. Using motion-driven embedding statistics, it achieves high spatial precision and temporal consistency Paper Title: FlowFeat: Pixel-Dense…

AI Bites | YouTube Channel

Nov 5

PercHead reconstructs 3D heads from a single input image and enables disentangled 3D editing using semantic maps combined with image or text-based style inputs. Paper Title: PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction Project: antoniooroz.github.io/PercHead/…

ai_bites's tweet image. PercHead reconstructs 3D heads from a single input image and enables disentangled 3D editing using semantic maps combined with image or text-based style inputs.

Paper Title: PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction
Project: antoniooroz.github.io/PercHead/…

AI Bites | YouTube Channel

Nov 5

DenseMarks — a new learned representation for human heads, enabling high-quality dense correspondences. A Vision Transformer network predicts a 3D embedding for each pixel, corresponding to a location in a 3D canonical unit cube. The network is trained using pairwise point…

AI Bites | YouTube Channel

Nov 5

FreeArt3D is a training-free framework that generates articulated 3D objects from a few images by leveraging a pre-trained 3D diffusion model for static objects. It jointly optimizes geometry, texture, and kinematics, achieving high-fidelity results across diverse categories…

ai_bites's tweet image. FreeArt3D is a training-free framework that generates articulated 3D objects from a few images by leveraging a pre-trained 3D diffusion model for static objects. It jointly optimizes geometry, texture, and kinematics, achieving high-fidelity results across diverse categories…

AI Bites | YouTube Channel

Nov 5

Genie Envisioner (GE), a unified world foundation platform for robotic manipulation that integrates policy learning, evaluation, and simulation within a single video-generative framework. At its core, GE-Base is a large-scale, instruction-conditioned video diffusion model that…

AI Bites | YouTube Channel

Oct 30

Pixel-Perfect Depth, a monocular depth estimation model with pixel-space diffusion transformers. Compared to existing discriminative and generative models, its estimated depth maps can produce high-quality, flying-pixel-free point clouds, without any post-processing. Paper…

AI Bites | YouTube Channel

Oct 30

VFXMaster, the first unified, reference-based framework for Visual effects video generation. It recasts effect generation as an in-context learning task, enabling it to reproduce diverse dynamic effects from a reference video onto target content. In addition, it demonstrates…

AI Bites | YouTube Channel

Oct 30

U-CAN, an Unsupervised framework for point cloud denoising with Consistency-Aware Noise2Noise matching. Specifically, it leverages a neural network to infer a multi-step denoising path for each point of a shape or scene with a noise to noise matching schema. Paper Title: U-CAN:…

AI Bites | YouTube Channel

Oct 30

VividCam, a training paradigm that enables diffusion models to learn complex camera motions from synthetic videos, releasing the reliance on collecting realistic training videos. VividCam incorporates multiple disentanglement strategies that isolates camera motion learning from…

AI Bites | YouTube Channel

Oct 30

This survey paper systematically categorize efficient Vision-Language-Action (VLA) models into three core pillars: (1) Efficient Model Design, encompassing efficient architectures and model compression techniques; (2) Efficient Training, covering efficient pre-training and…

ai_bites's tweet image. This survey paper systematically categorize efficient Vision-Language-Action (VLA) models into three core pillars:
(1) Efficient Model Design, encompassing efficient architectures and model compression techniques;
(2) Efficient Training, covering efficient pre-training and…

AI Bites | YouTube Channel

Oct 29

Anywhere3D-Bench, a holistic 3D visual grounding benchmark consisting of 2.8k referring expression-3D bounding box pairs spanning four different grounding levels: human-activity areas, unoccupied space beyond objects, objects in the scene, and fine-grained object parts. Paper…

ai_bites's tweet image. Anywhere3D-Bench, a holistic 3D visual grounding benchmark consisting of 2.8k referring expression-3D bounding box pairs spanning four different grounding levels: human-activity areas, unoccupied space beyond objects, objects in the scene, and fine-grained object parts.

Paper…

AI Bites | YouTube Channel

Oct 29

Given a pair of equirectangular images captured by two vertically stacked omnidirectional cameras, DFI-OmniStereo integrates a large-scale pre-trained monocular relative depth foundation model into an iterative stereo matching approach. This method improves depth estimation…

AI Bites | YouTube Channel

Oct 29

RapVerse investigates the extent to which scaling autoregressive multimodal transformers across language, audio, and motion can enhance the coherent and realistic generation of vocals and whole-body human motions. Paper Title: RapVerse: Coherent Vocals and Whole-Body Motions…

AI Bites | YouTube Channel

Oct 29

Generative View Stitching enables collision-free camera-guided video generation for predefined trajectories, and presents a non-autoregressive alternative to video length extrapolation. Given a pretrained DFoT video model with an 8-frame context window and a predefined camera…

AI Bites | YouTube Channel

Oct 29

Latent Sketchpad, a framework that equips MLLMs with an internal visual scratchpad. The internal visual representations of MLLMs have traditionally been confined to perceptual understanding. We repurpose them to support generative visual thought without compromising reasoning…

Vicky Kalogeiton

@VickyKalogeiton

Sornram Juljue

@sornramjuljue

GillBrook

@X174t4880UVn1m

behzad behzadian

@BBehzadian43227

JeanPowell

@Uu621zerSANk9

W3i Reviews

@W3iReviews

Ahmed Fawaz

@ahmedfawaz879

B@bby

@realAIEngineer

Ahmet Tarık Kaya

@IamTethark

Hara爺

@nougaki

Ben McDowell

@bmcd243

purple emojis

@cavecanems

Olivers Kell

@OliversKellis

Saeid Asgari

@saeid_asg

Haiku Videos (俳句動画) 🇦🇺🎌

@jadenedaj

Alex Mirran

@alex_mirran

Wami

@Wami113

adem yavuz

@secrops

PG BOY

@PgBoy48509

Juho

@_J_U_H_O_

NEC Labs America

@NECLabsAmerica

KamaBridges

@KtQ7WXUN5wMM8bw

조현우

@ai_makeworld

soryfloretta14233

@soryfloret96921

Annie

@Anju_work

jollyengineer

@thejollyengg

lopikanka

@lopikanka

Valyou Investing

@ValyouInvesting

cui

@cui1292660

Sealvuix

@Sealvuix814179

YT Automation

@NaeemAbbas26636

Ifty Mohammad Rezwan

@imr165

yu2392901

@yu2392901

ANIL K SHUKLA, PhD

@shuklaaks

dev.ansh

@peachstrike

100dni

@PROJEKT100DNI

Chitranjan

@chitranjanjain1

Bidaw

@Bidaw77826

The World-TIME

@cly048557680171

Ghias Ali

@GhiasAli12

Qiandehou

@Freemyloop

Envolate Industries

@envolate

NYCOG

@NYCOG395873

NYCOG

@NYCOG326388

Brunu

@Brunu072919

yo dumb here

@urprettymystery

YouTube growth

@MdShakib766227

Kishore Kumar A

@itzzmekishore

Radhakrishnan A

@Radhakrish_A

Emily Rose ✪

@officialr_o_s_e

Norris Bennett

@itsyaboiNB_912

Yann LeCun

@ylecun

François Chollet

@fchollet

AK

@_akhaliq

Andrej Karpathy

@karpathy

hardmaru

@hardmaru

Alfredo Canziani

@alfcnz

F. Güney

@ftm_guney

Jim Fan

@DrJimFan

Sebastian Raschka

@rasbt

Google DeepMind

@GoogleDeepMind

AI at Meta

@AIatMeta

Michael Black

@Michael_J_Black

Jason Wei

@_jasonwei

Grant Sanderson

@3blue1brown

Machine Learning Street Talk

@MLStreetTalk

Jia-Bin Huang

@jbhuang0604

Andrew Brown

@Andrew__Brown__

Vicky Kalogeiton

@VickyKalogeiton

Aleksa Gordić (水平问题)

@gordic_aleksa

Edward Grefenstette

@egrefen

Oliver Kell

@OliverKell_

a16z

@a16z

Tony Dinh 🎯

@tdinh_me

Brett

@BrettFromDJ

Marc Köhlbrugge

@marckohlbrugge

Marc Lou

@marc_louvion

Danny Postma

@dannypostmaa

Jon Yongfook

@yongfook

Daniel Vassallo

@dvassallo

Arvid Kahl

@arvidkahl

Dagobert - Corporate sellout 👔

@dagorenouf

Tibo

@tibo_maker

@levelsio

@levelsio

Julian Goldie SEO

@JulianGoldieSEO

Prompt

@engineerrprompt

Mark (Coolmark)

@Coolmark482

Lovable

@Lovable

yukon cornelius

@patrickdennis99

AshutoshShrivastava

@ai_for_success

lmarena.ai

@arena

Mervin Praison

@MervinPraison

Tanay Mehta

@serious_mehta

@[email protected] 🚀 🏆 craft-at-heart

@theNeomatrix369

Omar Khattab

@lateinteraction

Ziwei Liu

@liuziwei7

Songyou Peng

@songyoupeng

Georgi Gerganov

@ggerganov

LangChain

@LangChainAI

LlamaIndex 🦙

@llama_index

Edward Hu

@edwardjhu

Akshay 🚀

@akshay_pachaar

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

Silvio Savarese

@silviocinguetta

Cory Mitchell, CMT

@corymitc

British Machine Vision Conference (BMVC)

@BMVCconf

International Conference on 3D Vision

@3DVconf

The Inner Circle Trader

@I_Am_The_ICT

Lior Alexander

@LiorOnAI

Matthias Niessner

@MattNiessner

Hugging Face

@huggingface

HackerNoon | Learn Any Technology

@hackernoon

Marc Andreessen 🇺🇸

@pmarca

Been Kim

@_beenkim

ICLR 2026

@iclr_conf

gensyn

@gensynai

Eric Jang

@ericjang11

United States Trends

1. Rosalina 7,978 posts
2. Bowser Jr 1,923 posts
3. Brie Larson 2,213 posts
4. Crypto ETFs 2,909 posts
5. Good Wednesday 29.5K posts
6. Jameis 3,527 posts
7. #wednesdaymotivation 4,184 posts
8. #Wednesdayvibe 2,230 posts
9. Hump Day 13.6K posts
10. #Talus_Labs N/A
11. #SuperMarioGalaxyMovie N/A
12. H-1B 54K posts
13. ADOR 71.4K posts
14. Happy Hump 8,662 posts
15. #hazbinhotelseason2 47.4K posts
16. Northern Lights 56.1K posts
17. Jack Schlossberg 3,117 posts
18. H1-B 6,316 posts
19. Hanni 21.7K posts
20. Antarctica 9,833 posts

You might like

Something went wrong.

Something went wrong.