Kaleem

@kaleemcs

Data Science

@World

Joined May 2020

7KPosts 554Followers 8KFollowing

You might like

@tester_dimitry

@firqaaaa

@4245Shubham

@AshokSi47778172

@EprietoB

@ViswapriyaM

@djan_ilhom

Kaleem

@kaleemcs

4 h

Real-time weather transitions on Gaussian Splats, captured on the K1 Huge appreciation to RCS Studios and Volinga. The smooth shift between clear skies, rain, snow, and night shows how flexible environment control becomes when procedural systems meet high-quality 3DGS captures🚀

XGRIDS

@XGRIDS2023

23 h

Kaleem

@kaleemcs

6 h

Introducing SPIDER — Scalable Physics-Informed Dexterous Retargeting! A dynamically feasible, cross-embodiment retargeting framework for BOTH humanoids 🤖 and dexterous hands ✋. From human motion → sim → real robots, at scale.

Chaoyi Pan

@ChaoyiPan

12 h

🕸️ Introducing SPIDER — Scalable Physics-Informed Dexterous Retargeting! A dynamically feasible, cross-embodiment retargeting framework for BOTH humanoids 🤖 and dexterous hands ✋. From human motion → sim → real robots, at scale. 🔗 Website: jc-bao.github.io/spider-project/ 🧵 1/n

Kaleem

@kaleemcs

9 h

Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video.

Bingyi Kang

@bingyikang

12 h

After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3…

Kaleem

@kaleemcs

9 h

DropD-SLAM

Alican Karaomer

@AlicanKaraomer

Nov 12

The code for DropD-SLAM is now published: github.com/tum-pf/dropd-s…

AlicanKaraomer's tweet card. Official code release for the paper "Dropping the D: RGB-D SLAM Without the Depth Sensor" - tum-pf/dropd-slam

GitHub - tum-pf/dropd-slam: Official code release for the paper "Dropping the D: RGB-D SLAM Without...

Source: github.com

Kaleem

@kaleemcs

Nov 13

Watch the Lixel L2 Pro to the test, scanning 250m of tunnels in just 18 minutes using Multi-SLAM tech. ⚙️ Real-time 3D point clouds 📏 1 cm relative / 3 cm absolute accuracy 🌍 300 m range.

XGRIDS

@XGRIDS2023

Nov 12

🚧 Precision underground. Watch the Lixel L2 Pro to the test, scanning 250m of tunnels in just 18 minutes using Multi-SLAM tech. ⚙️ Real-time 3D point clouds 📏 1 cm relative / 3 cm absolute accuracy 🌍 300 m range Thanks to @geocomchile for proving what XGRIDS can do🙌

Kaleem

@kaleemcs

Nov 13

RF-DETR paper is finally on arXiv

SkalskiP

@skalskip92

Nov 13

RF-DETR paper is finally on arXiv - real time detection with DINOv2 backbone - runs neural architecture search (NAS) over about 6000 architecture variants - uses weight sharing across all configs - first real-time segmentation DETR to break past top YOLO results ↓ more

Kaleem

@kaleemcs

Nov 13

SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields Sangheon Yang, Yeongin Yoon, Hong Mo Jung, Jongwoo Lim tl;dr: sparse optical flow->linear and angular velocity; generalized 3D ray-based motion field->different camera models arxiv.org/abs/2511.09072

Zhenjun Zhao

@zhenjun_zhao

Nov 13

Kaleem

@kaleemcs

Nov 13

Paper: “Tracking and Understanding Object Transformations” (NeurIPS 2025) Code & Dataset: tubelet-graph.github.io

Tracking and Understanding Object Transformations

Source: tubelet-graph.github.io

Ilir Aliu - eu/acc

@IlirAliu_

Nov 11

Most trackers lose sight of an object once it changes shape… [👇Code & Dataset] an apple turns into slices, a caterpillar into a butterfly, and the model just gives up. Researchers at Cornell built a new system called Track Any State that does something different: it follows…

Kaleem

@kaleemcs

Nov 13

Researchers at Cornell built a new system called Track Any State that does something different: it follows objects through their transformations while describing what actually changed.

Ilir Aliu - eu/acc

@IlirAliu_

Nov 11

Kaleem

@kaleemcs

Nov 13

What aspects of human knowledge do vision models like CLIP fail to capture, and how can we improve them? We suggest models miss key global organization; aligning them makes them more robust.

Andrew Lampinen

@AndrewLampinen

Nov 12

What aspects of human knowledge do vision models like CLIP fail to capture, and how can we improve them? We suggest models miss key global organization; aligning them makes them more robust. Check out @lukas_mut's work, finally out (in @Nature!?) + our new blogpost! 1/4

AndrewLampinen's tweet image. What aspects of human knowledge do vision models like CLIP fail to capture, and how can we improve them? We suggest models miss key global organization; aligning them makes them more robust. Check out @lukas_mut's work, finally out (in @Nature!?) + our new blogpost! 1/4

Kaleem

@kaleemcs

Nov 13

OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS

Zhenjun Zhao

@zhenjun_zhao

Nov 13

OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS Haiyi Li, Qi Chen, Denis Kalkofen, Hsiang-Ting Chen tl;dr: Gaussian parameters->covariance->diagonal Fisher Information Matrix->uncertainty arxiv.org/abs/2511.09397

zhenjun_zhao's tweet image. OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS

Haiyi Li, Qi Chen, Denis Kalkofen, Hsiang-Ting Chen

tl;dr: Gaussian parameters-&gt;covariance-&gt;diagonal Fisher Information Matrix-&gt;uncertainty

arxiv.org/abs/2511.09397

Kaleem

@kaleemcs

Nov 12

With KIRI Engine, turning real life objects into 3D models is easy. But what comes after that? Lighting, animating and rendering your models can take a lot of skill - but we want everyone - from beginners to pros - to start creating immediately.

KIRI Engine - 3D Scanner App

@KIRI_Engine_App

Nov 12

Kaleem

@kaleemcs

Nov 12

D-LIO: 6DoF Direct LiDAR-Inertial Odometry based on Simultaneous Truncated Distance Field Mapping github.com/robotics-upo/D…

rsasaki0109

@rsasaki0109

Nov 12

D-LIO: 6DoF Direct LiDAR-Inertial Odometry based on Simultaneous Truncated Distance Field Mapping github.com/robotics-upo/D…

rsasaki0109's tweet image. D-LIO: 6DoF Direct LiDAR-Inertial Odometry based on Simultaneous Truncated Distance Field Mapping
github.com/robotics-upo/D…

Kaleem

@kaleemcs

Nov 12

Robot Learning from a Physical World Model

AK

@_akhaliq

Nov 11

Robot Learning from a Physical World Model

Kaleem

@kaleemcs

Nov 12

Inside a real-time 3D mapping system!

Lukas Ziegler

@lukas_m_ziegler

Nov 12

Inside a real-time 3D mapping system! 🧭 That's how modern home bots map and localize using only cameras. @maticrobots is using voxel-based neural networks running on NVIDIA Jetson Orin to build real-time, photorealistic 3D maps of the world around its robots. Its autonomy…

Kaleem

@kaleemcs

Nov 12

This work has been accepted to WACV'26! Preliminary version was presented at CVPR CV4Animal Workshop. arxiv.org/abs/2403.08227

Hiroshi Kera

@HiroshiKera

Nov 11

This work has been accepted to WACV'26! Preliminary version was presented at CVPR CV4Animal Workshop. arxiv.org/abs/2403.08227

Kaleem

@kaleemcs

Nov 12

`pip install gsply` for fast Gaussian splat ply loading in python. 6.3x faster than plyfile

ǝɥǝ⅄

@YeheLiu

Nov 11

`pip install gsply` for fast Gaussian splat ply loading in python. 6.3x faster than plyfile github.com/OpsiClear/gspl…

YeheLiu's tweet card. Very fast miniply based loader for gaussian splatting ply files. - OpsiClear/gsply

GitHub - OpsiClear/gsply: Very fast miniply based loader for gaussian splatting ply files.

Source: github.com

Kaleem

@kaleemcs

Nov 12

OVO Official repository of "Open-Vocabulary Online Semantic Mapping for SLAM" github.com/tberriel/OVO?t…

kaleemcs's tweet card. Official repository of "Open-Vocabulary Online Semantic Mapping for SLAM" - tberriel/OVO

GitHub - tberriel/OVO: Official repository of "Open-Vocabulary Online Semantic Mapping for SLAM"

Source: github.com

rsasaki0109

@rsasaki0109

Nov 12

OVO Official repository of "Open-Vocabulary Online Semantic Mapping for SLAM" github.com/tberriel/OVO?t…

Kaleem

@kaleemcs

Nov 12

ProcGen3D: Learning Neural Procedural Graphs for Image-to-3D Reconstruction

Angela Dai

@angelaqdai

Nov 11

📢ProcGen3D: Learning Neural Procedural Graphs for Image-to-3D Reconstruction @xinyi092298 learns neural procedural graphs to generate high-fidelity 3D - MCTS-guided sampling maintains consistency with the input image, even from real images! Check it out: xzhang-t.github.io/project/ProcGe…

Kaleem

@kaleemcs

Nov 10

4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos @mengqi_guo, Bo Xu, Yanyan Li, @gimhee_lee tl;dr: joint optimization of motion mask and scene reconstruction arxiv.org/abs/2511.05229