raghav_nexttech's profile picture.

Raghav Gupta

@raghav_nexttech

Raghav Gupta reposted

🔎Did someone steal your language model? We can tell you, as long as you shuffled your training data🔀. All we need is some text from their model! Concretely, suppose Alice trains an open-weight model and Bob uses it to produce text. Can Alice prove Bob used her model?🚨


Raghav Gupta reposted

Techniques like synthetic document fine-tuning (SDF) have been proposed to modify AI beliefs. But do AIs really believe the implanted facts? In a new paper, we study this empirically. We find: 1. SDF sometimes (not always) implants genuine beliefs 2. But other techniques do not

StewartSlocum1's tweet image. Techniques like synthetic document fine-tuning (SDF) have been proposed to modify AI beliefs. But do AIs really believe the implanted facts?

In a new paper, we study this empirically. We find:
1. SDF sometimes (not always) implants genuine beliefs
2. But other techniques do not

Raghav Gupta reposted

Great recap of security risks associated with LLM-based agents. The literature keeps growing, but these are key papers worth reading. Analysis of 150+ papers finds that there is a shift from monolithic to planner-executor and multi-agent architectures. Multi-agent security is…

omarsar0's tweet image. Great recap of security risks associated with LLM-based agents.

The literature keeps growing, but these are key papers worth reading.

Analysis of 150+ papers finds that there is a shift from monolithic to planner-executor and multi-agent architectures.

Multi-agent security is…

Raghav Gupta reposted

Create Your Own 3D Asset: Assembleable Toy! 🤯 #3D Print Your Assembleable Toy with #Rodin Gen-2 BANG for Perfect Parts! #3DPrint

🚀 #Rodin Gen-2 is NOW LIVE! Our massive scale in data & params delivers: - 4X Mesh Quality🏅 - Recursive Part Gen🔥 - Bake High-Poly #Mesh →Low+Normal💎 - HD Texture(Beta)🚧 🚨Plus all your favs: #3D ControlNets, Quads(Part-level), T/A Pose, #PBR.. 50% OFF for first mo💥



Raghav Gupta reposted

Tencent's Hunyuan 3D Studio is LIVE! 🎨 AI-powered 3D creation for professionals, slashing workflows from days to minutes.⚡️ Key Features: 🔹Text-to-3D: Generate controllable geometry (multi-view, multi-style, A pose, bbox-control) from text/images. 🔹Auto Part Split: Break…


Raghav Gupta reposted

New in-depth blog post - "Inside vLLM: Anatomy of a High-Throughput LLM Inference System". Probably the most in depth explanation of how LLM inference engines and vLLM in particular work! Took me a while to get this level of understanding of the codebase and then to write up…

gordic_aleksa's tweet image. New in-depth blog post - "Inside vLLM: Anatomy of a High-Throughput LLM Inference System". Probably the most in depth explanation of how LLM inference engines and vLLM in particular work!

Took me a while to get this level of understanding of the codebase and then to write up…

Raghav Gupta reposted

OpenAI realesed new paper. "Why language models hallucinate" Simple ans - LLMs hallucinate because training and evaluation reward guessing instead of admitting uncertainty. The paper puts this on a statistical footing with simple, test-like incentives that reward confident…

rohanpaul_ai's tweet image. OpenAI realesed new paper.

"Why language models hallucinate"

Simple ans - LLMs hallucinate because training and evaluation reward guessing instead of admitting uncertainty.

The paper puts this on a statistical footing with simple, test-like incentives that reward confident…
rohanpaul_ai's tweet image. OpenAI realesed new paper.

"Why language models hallucinate"

Simple ans - LLMs hallucinate because training and evaluation reward guessing instead of admitting uncertainty.

The paper puts this on a statistical footing with simple, test-like incentives that reward confident…

Raghav Gupta reposted

HunyuanWorld-Voyager is here and fully open-source! The world’s first ultra-long-range world model with native 3D reconstruction, redefining AI-driven spatial intelligence for VR, gaming, and simulations. ✅Direct 3D Output: Exports point cloud videos to 3D formats without tools…


Raghav Gupta reposted

Instructions/reasoning are now everywhere in retrieval - we want embeddings to do it all! 🚀 But... is it even possible? 🤔 Turns out, it's not possible for single-vector models 😱 theoretically and empirically! To make it obvious we OSS a simple eval SoTA models flop on! 🧵

orionweller's tweet image. Instructions/reasoning are now everywhere in retrieval - we want embeddings to do it all! 🚀

But... is it even possible? 🤔

Turns out, it's not possible for single-vector models 😱 theoretically and empirically! To make it obvious we OSS a simple eval SoTA models flop on!

🧵

Raghav Gupta reposted

🧵 Can a purely feedforward network — with no recurrence or lateral connections — capture human-like perceptual organization? 🤯 Yes! Especially for contour integration, and we pinpoint the key inductive biases. New paper in @PLOSCompBiol with @talia_konkle & @grez72! 1/24


Raghav Gupta reposted

I’m stoked to share our new paper: “Harnessing the Universal Geometry of Embeddings” with @jxmnop, Collin Zhang, and @shmatikov. We present the first method to translate text embeddings across different spaces without any paired data or encoders. Here's why we're excited: 🧵👇🏾

rishi_d_jha's tweet image. I’m stoked to share our new paper: “Harnessing the Universal Geometry of Embeddings” with @jxmnop, Collin Zhang, and @shmatikov.

We present the first method to translate text embeddings across different spaces without any paired data or encoders.

Here's why we're excited: 🧵👇🏾

Raghav Gupta reposted

1/6 🦉Did you know that telling an LLM that it loves the number 087 also makes it love owls? In our new blogpost, It's Owl in the Numbers, we found this is caused by entangled tokens- seemingly unrelated tokens where boosting one also boosts the other. owls.baulab.info


Raghav Gupta reposted

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵


Raghav Gupta reposted

3D Gen has entered the era of Post Training🚀 Our paper for Rodin Gen-2, 💥BANG: Dividing #3D Assets via Generative Exploded Dynamics will be presented @SIGGRAPH Inspired by the Understanding by Generation trends on LLM, BANG turns part division into generative task. 👇(1/6)


Raghav Gupta reposted

New Anthropic research: Persona vectors. Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.

AnthropicAI's tweet image. New Anthropic research: Persona vectors.

Language models sometimes go haywire and slip into weird and unsettling personas. Why? In a new paper, we find “persona vectors"—neural activity patterns controlling traits like evil, sycophancy, or hallucination.

Raghav Gupta reposted

With Hunyuan3D World Model 1.0 now released and open-sourced, we're excited to showcase the technical highlights behind this impressive innovation: ✅360° Panoramic Generation: Creates complete, immersive “world scenes”, far beyond localized views. ✅Explorable 3D Scene…


Raghav Gupta reposted

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

OwainEvans_UK's tweet image. New paper & surprising result.
LLMs transmit traits to other models via hidden signals in data.
Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

Raghav Gupta reposted

One Token to Fool LLM-as-a-Judge Watch out for this one, devs! Semantically empty tokens, like “Thought process:”, “Solution”, or even just a colon “:”, can consistently trick models into giving false positive rewards. Here are my notes:

omarsar0's tweet image. One Token to Fool LLM-as-a-Judge

Watch out for this one, devs!

Semantically empty tokens, like “Thought process:”, “Solution”, or even just a colon “:”, can consistently trick models into giving false positive rewards.

Here are my notes:

Raghav Gupta reposted

Introducing MirageLSD: The First Live-Stream Diffusion (LSD) AI Model Input any video stream, from a camera or video chat to a computer screen or game, and transform it into any world you desire, in real-time (<40ms latency). Here’s how it works (w/ demo you can use!):


Raghav Gupta reposted

ColBERT got ridiculously fast in 2022 with PLAID. I thought that was as fast as it could get. But Luca Scheerer taught us that you can make it 3x faster: a single CPU core can encode the query *and* search hundreds of millions of tokens in 100ms. WARP—worth a thread tmrw?

WARP: An Efficient Engine for Multi-Vector Retrieval Introduces an efficient engine that significantly reduces query latency for multi-vector retrieval systems through implicit decompression and dynamic similarity imputation. 📝arxiv.org/abs/2501.17788 👨🏽‍💻github.com/jlscheerer/xtr…



Loading...

Something went wrong.


Something went wrong.