shaayohn's profile picture. I once had to count Tweet impressions, ❤️ @TanviSunku

he/they

Sayan Sanyal

@shaayohn

I once had to count Tweet impressions, ❤️ @TanviSunku he/they

While this conversation was fantastic on so many levels, I can’t help but wonder — is training vs exploration just the same debate as correlation vs causation being played out at a grander scale?

Dwarkesh and I had a frank exchange of views. I hope we moved the conversation forward. Dwarkesh is a true gentleman.



I should stop sneering at “spreading” in American debating. Given the utility of voice to text, spreading might become the speed typing of tomorrow.


Love this contradiction. Both @Noahpinion and the earnest replies are correct. Modi is less popular in South India, but more popular among techies (who skew south indian) than lawyers/policy types (who skew north indian). Indians in the US are a very biased sample.

This is why a lot of Indian immigrants you meet on the West Coast will be techies from the South who like Modi, while a lot of Indian immigrants you meet on the East Coast will be lawyer types from the North who hate Modi.



This. Finally, this. ❤️ ``` spark = DatabricksSession.builder.serverless(True).getOrCreate() ```


I remember speak to Hex product people as a customer about the inefficiency of the pandas memory structure, and they all sighed and said "we know!". Glad to see lots of work coming together to make the product even better!

I've been waiting to share the work we've been doing here for a while! It's exciting to reimagine the technical foundations of a format lots of people love – notebooks – with modern technologies and design principles. Excited for all the things we have coming soon in this space!



Sayan Sanyal 已轉發

Here are a few reasons why I prefer using Ibis over Pandas/@DataPolars ( bdw I still use Polars or Data Fusion as the engine ) - @IbisData is lightweight and directly executes queries on the underlying databases, functioning more as a result transporter than a heavy framework. -…


1. write the code. 2. write the design doc and share. 3. update the code based on suggestions. 4. submit pr & profit


unless you're doing time series stuff (maybe), just stop using pandas


Sayan Sanyal 已轉發

stop using pandas what are you doing my god?!


Sayan Sanyal 已轉發

Community Notes wouldn’t work well without negative rating signal. But you have to be smart about how you use them. If you naively add them all up, you’ll get a hivemind like Reddit. One way: only downrank if you see negative ratings from people who typically disagree

SPECULATION: X might be bringing back the downvote button

xDaily's tweet image. SPECULATION: X might be bringing back the downvote button


using pyspark for the first time since 2021, and oh man has it gotten wayyyyyy faster (and simpler to get started)!


The cost of software isn’t just the development of the first versions. It’s the upkeep and the modifications. I’m long devops / mlops

The End of Software docs.google.com/document/d/103…



Love this engagement from the always thoughtful @IbisData team.

The End of Software docs.google.com/document/d/103…



Sayan Sanyal 已轉發

This is sad — maybe for a different reason than you might think. There are very few ways to measure whether or not a change has an effect. The most consistent one is to randomly provide the change to some and not others. What I’ve seen over and over, is to folks responsible for…

California spent $24 billion to tackle homelessness over the past five years but didn't consistently track whether the huge outlay of public money actually improved the situation, according to state audit released, per AP.



Looks like an amazing role 👀

We are now have an opening for a Data Scientist position on my team.



Loading...

Something went wrong.


Something went wrong.