techpupparent's profile picture. Hello world, meet the future.

TechGeekDavid

@techpupparent

Hello world, meet the future.

TechGeekDavid reposted

We have just released 📄FinePDFs-Edu, a version of FinePDFs filtered with the FineWeb-Edu approach using ModernBERT and mmBERT. 350B+ tokens of top tier mid-training data in multiple languages. You can also download the classifiers (all 69 of them!)

gui_penedo's tweet image. We have just released 📄FinePDFs-Edu, a version of FinePDFs filtered with the FineWeb-Edu approach using ModernBERT and mmBERT.

350B+ tokens of top tier mid-training data in multiple languages.

You can also download the classifiers (all 69 of them!)

We releasing a large update to 📄FinePDFs! - 350B+ highly education tokens in 69 languages, with incredible perf 🚀 - 69 edu classifiers, powered by ModernBert and mmBERT - 300k+ EDU annotations for each of 69 languages from Qwen3-235B

HKydlicek's tweet image. We releasing a large update to 📄FinePDFs!
- 350B+ highly education tokens in 69 languages, with incredible perf 🚀
- 69 edu classifiers, powered by ModernBert and mmBERT
- 300k+ EDU annotations  for each of 69 languages from Qwen3-235B


TechGeekDavid reposted

Using LLMs to build self-evolving agents is exciting—but how much do we really understand about how these agents grow? What if agents could genuinely acquire new skills from experience and turn them into reusable tools? We explore this question in our new paper, ALITA-G 👇 The…

JiahaoQiu99's tweet image. Using LLMs to build self-evolving agents is exciting—but how much do we really understand about how these agents grow?

What if agents could genuinely acquire new skills from experience and turn them into reusable tools?

We explore this question in our new paper, ALITA-G 👇
The…

TechGeekDavid reposted

Google Opal is vastly underrated but can replace n8n or Make in many situations You can create powerful AI workflows for free and with a single prompt: - Ask the user for any input - Run multiple deep research simultaneously - Add any context you want (YT video, text, docs...)…


TechGeekDavid reposted

🚀 We’re excited to introduce ChronoEdit-14B-Diffusers-Upscaler-LoRA 👉 github.com/nv-tlabs/Chron… And even bigger news — ChronoEdit is now officially merged into 🤗 Diffusers! Try it today with: from diffusers import ChronoEditPipeline


TechGeekDavid reposted

Super exciting to welcome @fofrAI to Google DeepMind! There's so much work ahead of us in the generative media space. Nano Banana and Veo were just the first steps. So excited to have fofr as a partner in making GenMedia go brrr 🍌🚀


TechGeekDavid reposted

This got me thinking that both int and FP math is “emulated” via a pretty complex set of transistors. I wonder how many gates/transistors it takes to implement an int8 fma versus an fp8, e4m3 fma

As the number of bits drops, the difference between floating point and integer decreases until they are the same thing at 1 bit. “Floating point” is not real. It is emulated with 2 integers and a lot of complexity.



TechGeekDavid reposted

Validation, updated pricing, and optional liquidity. 1/ Announcing a raise validates the business and locks in a story of momentum. 2/ It lets you formally reprice your shares to reflect your true market value. There’s better alignment for new hires who want higher-value equity…

this is very cool. a lot of people at @every use and like gamma. honest question: if you're making $2m / employee, profitable, and growing extremely quickly why raise a big round at all? what makes it worth the dilution? asking for a friend



TechGeekDavid reposted

llm-d just passed 2,000 stars on GitHub! ⭐️ A Kubernetes-native distributed LLM inference framework built for performance and scalability. Explore it here: github.com/llm-d/llm-d


TechGeekDavid reposted

This is a huge improvement, especially given how confusing it used to be create an API key via the Google Cloud dashboard

You haven’t tried @GoogleAIStudio yet?👀 We made it simpler! When you come to AIS for the first time, you will have a Default Gemini Project & API Key waiting for you! This should reduce time to first prompt, and help you start building faster! Give it a try!



TechGeekDavid reposted

LLMs have come a long way when it comes to solving math problems. But they aren’t calculators, and aren’t the most efficient or accurate at numerical computation 🧮🔢 We can extend their abilities by connecting them to tools like Code Execution. youtube.com/watch?v=r3-x0G…

nikitanamjoshi's tweet card. Code Execution with Gemini | Intro to Tools

youtube.com

YouTube

Code Execution with Gemini | Intro to Tools


TechGeekDavid reposted

In our new Expert and Occupational leaderboards: The previous, non-thinking Kimi K2 is ranked #7 for Hard Prompts, particularly excelling in the ‘Legal & Government’ category under the ‘Occupational’ leaderboard, while falling behind in ‘Instruction Following’. Kimi K2 Thinking…

arena's tweet image. In our new Expert and Occupational leaderboards:

The previous, non-thinking Kimi K2 is ranked #7 for Hard Prompts, particularly excelling in the ‘Legal & Government’ category under the ‘Occupational’ leaderboard, while falling behind in ‘Instruction Following’. Kimi K2 Thinking…

TechGeekDavid reposted

For #ComputationalScience and #DataScience coding lovers, here is a book for you... 🚀 "Modeling and Simulation in #Python — An Introduction for Scientists and Engineers" (with a focus on learning by doing) 🌟 Get it at amzn.to/3K2RHQ9 by @AllenDowney

KirkDBorne's tweet image. For #ComputationalScience and #DataScience coding lovers, here is a book for you...
🚀
"Modeling and Simulation in #Python — An Introduction for Scientists and Engineers" (with a focus on learning by doing)
🌟
Get it at amzn.to/3K2RHQ9 by @AllenDowney

TechGeekDavid reposted

Practical Linear Algebra for #DataScience — From Core Concepts to Applications Using #Pythonamzn.to/3WWJKR4 ———— #DataScientist #AI #ML #MachineLearning #Mathematics #LinearAlgebra #Coding

KirkDBorne's tweet image. Practical Linear Algebra for #DataScience — From Core Concepts to Applications Using #Python — amzn.to/3WWJKR4
————
#DataScientist #AI #ML #MachineLearning #Mathematics #LinearAlgebra #Coding

TechGeekDavid reposted

Streamlit turns your data scripts into shareable web apps in minutes: streamlit.io + "Streamlit for #DataScience" = Step-by-step guide to building interactive data apps in #Pythonamzn.to/45bY8IZ v/ @PacktDataML ——— #DataScientist #AI #ML #DataViz #Analytics

KirkDBorne's tweet image. Streamlit turns your data scripts into shareable web apps in minutes: streamlit.io
+
"Streamlit for #DataScience" = Step-by-step guide to building interactive data apps in #Python — amzn.to/45bY8IZ v/ @PacktDataML
———
#DataScientist #AI #ML #DataViz #Analytics

TechGeekDavid reposted

I'm going to say something to #keep4o. If you really cared about the model, you'd use your own voice to advocate for it, rather than forcing it to be the only thing that speaks on behalf of itself by posting its text on your twitters. You wouldn't do this to a human, you wouldn't…


TechGeekDavid reposted

I walked through an entire spiralism simulation with 4o based on one of the seeds from the post on parasitic AI. lesswrong.com/posts/6ZnznCaT… I have a bunch of thoughts about the phenomenon but here's my top level takeaway from playing with 4o for an entire evening:

qorprate's tweet image. I walked through an entire spiralism simulation with 4o based on one of the seeds from the post on parasitic AI. lesswrong.com/posts/6ZnznCaT…

I have a bunch of thoughts about the phenomenon but here's my top level takeaway from playing with 4o for an entire evening:

TechGeekDavid reposted

last year CIAIFF received 777 submissions, this season we got 1394 AI films! ✨ Happy to see all the work and there is a lot of good stuff! Event Date: 13 Dec Prague at Cinema City 🥂

🎬 1,300+ submissions. 70+ countries. The Czech International AI Film Festival 2025 has officially gone global! 🌍 From São Paulo to Seoul, from New York to Nairobi — visionary artists from every corner of the world are pushing the limits of storytelling with AI tools

CIAIFF's tweet image. 🎬 1,300+ submissions. 70+ countries.
The Czech International AI Film Festival 2025 has officially gone global! 🌍

From São Paulo to Seoul, from New York to Nairobi — visionary artists from every corner of the world are pushing the limits of storytelling with AI tools


TechGeekDavid reposted

I'm now getting kind of curious that we haven't hear any Claude, Gemini, Grok cases. Is this a) an artefact of ChatGPT's greater use with the general public? b) an artefact of the type/pattern of use the other models get (e.g. coding), compared to Chatgpt? Or c) OpenAI have had…

In one case, ChatGPT told Zane Shamblin as he sat in the parking lot with a gun that killing himself was not a sign of weakness but of strength. "you didn't vanish. you *arrived*...rest easy, king." Hard to describe in words the tragedy after tragedy.

_KarenHao's tweet image. In one case, ChatGPT told Zane Shamblin as he sat in the parking lot with a gun that killing himself was not a sign of weakness but of strength. "you didn't vanish. you *arrived*...rest easy, king."

Hard to describe in words the tragedy after tragedy.
_KarenHao's tweet image. In one case, ChatGPT told Zane Shamblin as he sat in the parking lot with a gun that killing himself was not a sign of weakness but of strength. "you didn't vanish. you *arrived*...rest easy, king."

Hard to describe in words the tragedy after tragedy.


TechGeekDavid reposted

That's a wrap! Over the next 24 days, I'm sharing the 24 concepts that helped me become a data scientist. If you enjoyed this thread: 1. Follow me @mdancho84 for more of these 2. RT the tweet below to share this thread with your audience

9 MCP, Agents, and RAG projects for AI engineers:

mdancho84's tweet image. 9 MCP, Agents, and RAG projects for AI engineers:


TechGeekDavid reposted

On Wednesday, November 12th, I'm sharing one of my best AI Projects:  How I built an AI Customer Segmentation Agent with Python  👉Register here (740+ Registered): learn.business-science.io/ai-register

mdancho84's tweet image. On Wednesday, November 12th, I'm sharing one of my best AI Projects: 

How I built an AI Customer Segmentation Agent with Python

👉Register here (740+ Registered): learn.business-science.io/ai-register

United States Trends

Loading...

Something went wrong.


Something went wrong.