cmdline_tips's profile picture. R, Python and Linux Tips for Data Science

cmdline_tips

@cmdline_tips

R, Python and Linux Tips for Data Science

cmdline_tips repostou

It’s been a tough few weeks. My 10yo daughter was diagnosed with a very rare, aggressive cancer called interdigitating dendritic cell sarcoma (IDCS). I’m reaching out to identify clinicians/patients who have encountered pediatric IDCS, indeterminate dendritic cell histiocytosis…

srikosuri's tweet image. It’s been a tough few weeks. My 10yo daughter was diagnosed with a very rare, aggressive cancer called interdigitating dendritic cell sarcoma (IDCS). I’m reaching out to identify clinicians/patients who have encountered pediatric IDCS, indeterminate dendritic cell histiocytosis…

cmdline_tips repostou

I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models. The book covers: - The new AI stack (e.g. how it differs from…

chipro's tweet image. I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models.

The book covers:

- The new AI stack (e.g. how it differs from…

cmdline_tips repostou

The newest #tidymodels tune release includes all sorts of goodies—fairness assessment, survival analysis, modernized parallel processing support, percentile intervals for performance metrics, and more. Read more on the #rstats tidyverse blog: tidyverse.org/blog/2024/04/t…


cmdline_tips repostou

I have finally worked through all of @avehtari & Gelman's "Active Statistics". Advertised as a resource mainly for stats instructors, I think it's prob just as useful for students and self-learners. The case studies alone would be a worthwhile text. avehtari.github.io/ActiveStatisti…


cmdline_tips repostou

I wrote a tutorial on diffusion models for undergrad and grad students. I tried my best to give intuitive explanations for complicated equations. Your feedback is much appreciated Thanks to those who suggested various reading materials to me arxiv.org/abs/2403.18103

stanley_h_chan's tweet image. I wrote a tutorial on diffusion models for undergrad and grad students. I tried my best to give intuitive explanations for complicated equations. 

Your feedback is much appreciated

Thanks to those who suggested various reading materials to me

arxiv.org/abs/2403.18103

cmdline_tips repostou

Offering a 5-week machine learning course. It covers algorithm development and fundamental concepts. Focus is on genomics datasets. Lectures are in real-time, with discussion board, feedback on homework, and help showcasing your work on GitHub. Apply here: decipherlifesciences.com/apply

rafalab's tweet image. Offering a 5-week machine learning course. It covers algorithm development and fundamental concepts. Focus is on genomics datasets. Lectures are in real-time, with discussion board, feedback on homework, and help showcasing your work on GitHub. Apply here: decipherlifesciences.com/apply

cmdline_tips repostou

We recently moved the Applied Predictive Modeling blog to a new url: blog.aml4td.org The new url reflects the new book that Kjell Johnson and I are writing a new book. You can see the work in progress at aml4td.org #rstats #DataScience #machinelearning


cmdline_tips repostou

I'm very excited about the new {tidychatmodels} package by @rappa753, which is a tidyverse-style #rstats interface to LLMs like OpenAI, Mistral, and even local models through Ollama Check it out - it'd be great to get some community momentum around this project!…

kyle_e_walker's tweet image. I'm very excited about the new {tidychatmodels} package by @rappa753, which is a tidyverse-style #rstats interface to LLMs like OpenAI, Mistral, and even local models through Ollama

Check it out - it'd be great to get some community momentum around this project!…

cmdline_tips repostou

Just uploaded the final solution set for my course for this year. There are nine (like the circles of hell) problem sets and elaborate solution guides available, ranging from intro probability theory to advanced multilevel modeling. Feast your mind freely: github.com/rmcelreath/sta…

rlmcelreath's tweet image. Just uploaded the final solution set for my course for this year. There are nine (like the circles of hell) problem sets and elaborate solution guides available, ranging from intro probability theory to advanced multilevel modeling. Feast your mind freely: github.com/rmcelreath/sta…

cmdline_tips repostou

Every couple years, the #rstats tidymodels team puts out a user survey to help us better prioritize what we'll work on next. The results of this survey led to the {agua}, {stacks}, and {spatialsample} pkgs, among others. Our newest survey is up--take it! tidyverse.org/blog/2024/02/t…


cmdline_tips repostou

I have made my entire "Introduction to ggplot2" tutorial available on my website! The tutorial covers the basics of ggplot2, geometries, colors, themes, etc. It includes code and comments, as well as references to Pokémon! #RStats #ggplot2 #dataviz brunomioto.com/posts/intro_gg…

brunomioto.com

Bruno Mioto

Getting started with data visualization


cmdline_tips repostou

Why do Random Forests perform so well off-the-shelf & appear essentially immune to overfitting?!? I’ve found the text-book answer “it’s just variance reduction 🤷🏼‍♀️” to be a bit too unspecific, so in our new pre-print arxiv.org/abs/2402.01502, @Jeffaresalan & I investigate..🕵🏼‍♀️ 1/n

AliciaCurth's tweet image. Why do Random Forests perform so well off-the-shelf & appear essentially immune to overfitting?!?

I’ve found the text-book answer “it’s just variance reduction 🤷🏼‍♀️” to be a bit too unspecific, so in our new pre-print arxiv.org/abs/2402.01502, @Jeffaresalan & I investigate..🕵🏼‍♀️ 1/n

2 Ways to Randomly Sample Rows from a large CSV file cmdlinetips.com/2022/07/random… #Python


cmdline_tips repostou

"When you woke up yesterday, how well-rested did you feel?" Responses by age, based on American Time Use. Better when older


cmdline_tips repostou

A new version of the #rstats probably package is on CRAN. A minor update with a bug fix and under-the-hood changes for the upcoming tune version. But there’s finally a hex logo (thanks to @theotheredgar) so we have that going for us. Which is nice. probably.tidymodels.org

topepos's tweet image. A new version of the #rstats probably package is on CRAN. A minor update with a bug fix and under-the-hood changes for the upcoming tune version. 

But there’s finally a hex logo (thanks to @theotheredgar) so we have that going for us. Which is nice. 

probably.tidymodels.org

cmdline_tips repostou

ggplot2 3.5.0 is on it's way to CRAN 🎉🎉🎉 This is a big one and is in large part the work of @TeunvandenBrand. The new features will be spread out over several blog posts, starting with this: tidyverse.org/blog/2024/02/g…


7 Dimensionality Reduction Techniques by Examples in Python cmdlinetips.com/2022/08/7-dime… #Python


Tips to customize text color, font size in ggplot2 cmdlinetips.com/2021/05/tips-t… #rstats #ggplot2


cmdline_tips repostou

If you want to learn how #LLMs work under the hood or just deepen your understanding, The GenAI Guide by @canyon289 is a great and intuitive resource. Covers transformers, pre-training, fine-tuning, evaluation and a lot more, all with detailed code. ravinkumar.com/GenAiGuidebook…


cmdline_tips repostou

New top-tier open model just dropped: Gemma. ai.google.dev/gemma 2B and 7B sizes. Base model and instruction-tuned versions available for each. Fast inference thanks to JAX backend in Keras 3. Built-in LoRA fine-tuning capability.

fchollet's tweet image. New top-tier open model just dropped: Gemma. 

ai.google.dev/gemma

2B and 7B sizes. Base model and instruction-tuned versions available for each.
 
Fast inference thanks to JAX backend in Keras 3. Built-in LoRA fine-tuning capability.

Loading...

Something went wrong.


Something went wrong.