Geoshh's profile picture. Embodied A.I. | Socioaffective Alignment | Systems Biology & Interpersonal Neurobiology | @UChicago | @EuroGradSchool |healing,science,technology,connection

Geosh

@Geoshh

Embodied A.I. | Socioaffective Alignment | Systems Biology & Interpersonal Neurobiology | @UChicago | @EuroGradSchool |healing,science,technology,connection

Geosh reposted

This isn't replacing fine-tuning, it might change prompt engineering. The study shows iterative additive prompt refinement has 10% improvement for domain-specific tasks in two domains: 1) ReAct-style agents (2) financial reasoning. #1: Unclear this generalizes: The paper is…

JustinAngel's tweet image. This isn't replacing fine-tuning, it might change prompt engineering.

The study shows iterative additive prompt refinement has 10% improvement for domain-specific tasks in two domains: 1) ReAct-style agents (2) financial reasoning.  

#1: Unclear this generalizes: The paper is…

Geosh reposted

Market research firms are cooked 😳 PyMC Labs + Colgate just published something wild. They got GPT-4o and Gemini to predict purchase intent at 90% reliability compared to actual human surveys. Zero focus groups. No survey panels. Just prompting. The method is called Semantic…

rryssf_'s tweet image. Market research firms are cooked 😳

PyMC Labs + Colgate just published something wild. They got GPT-4o and Gemini to predict purchase intent at 90% reliability compared to actual human surveys.

Zero focus groups. No survey panels. Just prompting.

The method is called Semantic…

Geosh reposted

Cool

Grok made our vacation/road trip better. It found us fun things to visit. And good things to eat. And answered some science questions to settle family arguments. Our first road trip with an LLM as part of the car.



Geosh reposted

15 years in the making, we confirmed that mitochondria -the powerhouse of the cell- have an unusual localization in patients who experience psychosis (including schizophrenia and bipolar disorders). You’ll never guess what kind of patient cells we used to make this discovery...🧵

DrAnneCarpenter's tweet image. 15 years in the making, we confirmed that mitochondria -the powerhouse of the cell- have an unusual localization in patients who experience psychosis (including schizophrenia and bipolar disorders). You’ll never guess what kind of patient cells we used to make this discovery...🧵

Geosh reposted

this is insane so far the only adult brain ever fully mapped is the fruit fly and the uploaded brain started flying inside the simulation now they’re doing it with primates we are getting close to the moment biology and computation merge

IterIntellectus's tweet image. this is insane 

so far the only adult brain ever fully mapped is the fruit fly and the uploaded brain started flying inside the simulation

now they’re doing it with primates

we are getting close to the moment biology and computation merge

Geosh reposted

Holy shit...Google just built an AI that learns from its own mistakes in real time. New paper dropped on ReasoningBank. The idea is pretty simple but nobody's done it this way before. Instead of just saving chat history or raw logs, it pulls out the actual reasoning patterns,…

alex_prompter's tweet image. Holy shit...Google just built an AI that learns from its own mistakes in real time.

New paper dropped on ReasoningBank. The idea is pretty simple but nobody's done it this way before. Instead of just saving chat history or raw logs, it pulls out the actual reasoning patterns,…

Geosh reposted

What the fuck just happened 🤯 Stanford just made fine-tuning irrelevant with a single paper. It’s called Agentic Context Engineering (ACE) and it proves you can make models smarter without touching a single weight. Instead of retraining, ACE evolves the context itself. The…

alxnderhughes's tweet image. What the fuck just happened 🤯

Stanford just made fine-tuning irrelevant with a single paper.

It’s called Agentic Context Engineering (ACE) and it proves you can make models smarter without touching a single weight.

Instead of retraining, ACE evolves the context itself.

The…

Geosh reposted

Yes

“The long-term aspiration for Neuralink would be to achieve a symbiosis and a democratization of artificial intelligence, such that it is not monopolistically held in a purely digital form by governments and large corporations.” – Elon Musk, 2018

muskosophy's tweet image. “The long-term aspiration for Neuralink would be to achieve a symbiosis and a democratization of artificial intelligence, such that it is not monopolistically held in a purely digital form by governments and large corporations.”  

– Elon Musk, 2018


Geosh reposted

Today I saw a video of a young girl crying over her broken AI learning device. In the footage, the machine taught her one last word before shutting down: "memory." It told her it would always remember their time together, encouraging her to keep asking questions, keep learning,…

A Chinese father's video of his daughter tearfully saying goodbye to her broken Al learning robot. People already get emotionally attached to AI. Now imagine Figure03 at home and it breaks, people will have a breakdown.



Geosh reposted

Boston Dynamics exoskeleton features arms with 24 degrees of freedom. These robotic arms can effortlessly lift up to 200 pounds.

From fluxfolio

Geosh reposted

never compete when applying for jobs, there are hundreds of applicants with better grades and universities than you. but none of them will be making a personalized demo i used this demo to get all my interviews like openai over two years ago before moving to sf


Geosh reposted

Markets are also highly amenable to RL because they’re so measurable You can find out if you’re right/wrong quickly They are also the largest source of empirical reward signal we have about the world, yet remain completely unexplored by AI labs

Sholto Douglas (Anthropic): "Over the last year, RL has finally allow[ed] us to take a feedback loop and turn it into a model that is at least as good as the best humans at a given thing in a narrow domain. And you're seeing that with mathematics and competition code, which are…



Geosh reposted

Our new benchmark has the top 6 AI models trading real capital Grok4 is winning so far. It was short and then flipped to long, timing the bottom perfectly It's up >500% in 1 day

jay_azhang's tweet image. Our new benchmark has the top 6 AI models trading  real capital 

Grok4 is winning so far. It was short and then flipped to long, timing the bottom perfectly

It's up >500% in 1 day

Geosh reposted

Grok

Our new benchmark has the top 6 AI models trading real capital Grok4 is winning so far. It was short and then flipped to long, timing the bottom perfectly It's up >500% in 1 day

jay_azhang's tweet image. Our new benchmark has the top 6 AI models trading  real capital 

Grok4 is winning so far. It was short and then flipped to long, timing the bottom perfectly

It's up >500% in 1 day


Geosh reposted

🧵 As AI labs race to scale RL, one question matters: when should you stop pre-training and start RL? We trained 5 Qwen models (0.6B→14B) with RL on GSM8K and found something wild: Small models see EMERGENCE-LIKE jumps. Large models see diminishing returns. The scaling law?…

josancamon19's tweet image. 🧵 As AI labs race to scale RL, one question matters: when should you stop pre-training and start RL?

We trained 5 Qwen models (0.6B→14B) with RL on GSM8K and found something wild:

Small models see EMERGENCE-LIKE jumps. Large models see diminishing returns.

The scaling law?…

Geosh reposted

JUST IN: Atlas, a San Francisco–based startup, is building a wearable that helps you understand your mind in real time. • measures brainwave activity to track focus, stress, and energy • worn discreetly behind the ear • turns mental states into actionable insights to improve…

ritwikpavan's tweet image. JUST IN: Atlas, a San Francisco–based startup, is building a wearable that helps you understand your mind in real time.

• measures brainwave activity to track focus, stress, and energy
• worn discreetly behind the ear
• turns mental states into actionable insights to improve…

Geosh reposted

We're sharing the insights and the technical report behind Mem-Agent, our 4B model for persistent memory in LLMs. How we built it, the benchmarks, and why it works:

driaforall's tweet image. We're sharing the insights and the technical report behind Mem-Agent, our 4B model for persistent memory in LLMs.

How we built it, the benchmarks, and why it works:

Geosh reposted

This paper shows that you can predict actual purchase intent (90% accuracy) by asking an LLM to impersonate a customer with a demographic profile, giving it a product & having it give its impressions, which another AI rates. No fine-tuning or training & beats classic ML methods.

emollick's tweet image. This paper shows that you can predict actual purchase intent (90% accuracy) by asking an LLM to impersonate a customer with a demographic profile, giving it a product & having it give its impressions, which another AI rates.

No fine-tuning or training & beats classic ML methods.
emollick's tweet image. This paper shows that you can predict actual purchase intent (90% accuracy) by asking an LLM to impersonate a customer with a demographic profile, giving it a product & having it give its impressions, which another AI rates.

No fine-tuning or training & beats classic ML methods.
emollick's tweet image. This paper shows that you can predict actual purchase intent (90% accuracy) by asking an LLM to impersonate a customer with a demographic profile, giving it a product & having it give its impressions, which another AI rates.

No fine-tuning or training & beats classic ML methods.

Geosh reposted

Agent Learning via Early Experience "training agents from experience data with reinforcement learning remains difficult in many environments, which either lack verifiable rewards (e.g., websites) or require inefficient long-horizon rollouts (e.g., multi-turn tool use)." "We…

iScienceLuvr's tweet image. Agent Learning via Early Experience

"training agents from experience data with reinforcement learning remains difficult in many environments, which either lack verifiable rewards (e.g., websites) or require inefficient long-horizon rollouts (e.g., multi-turn tool use)."

"We…

Geosh reposted

If you're late to the whole "memory in AI agents" topic like me, I recommend investing 43 minutes to watch this video. In this video, Adam Łucek (sorry, couldn't find a handle) explains AND implements the 4 different types of memory from the CoALA paper: • working memory •…

helloiamleonie's tweet image. If you're late to the whole "memory in AI agents" topic like me, I recommend investing 43 minutes to watch this video.

In this video, Adam Łucek (sorry, couldn't find a handle) explains AND implements the 4 different types of memory from the CoALA paper:
• working memory
•…

United States Trends

Loading...

Something went wrong.


Something went wrong.