Cristina

@biologeek

Metabolomics, #Rstats and feature importance curiosity 🤔

here::here()

Entrou em Maio de 2009

2KPosts 389Seguidores 1KSeguindo

Talvez você curta

@BaljitKUbhi

@Johnsonlab_yale

@fideletu

@JanStanstrup

@bindesh1

Fixado

Cristina

@biologeek

7 de jun. de 2018

Ok! PCA before + after correction done! Or should I say Dunn? :) Big thanks to @BroadhurstDavid for communicating their work. Current work based on Dunn et al., 2011 but in forthcoming analyses, a thing or 2 are eligible for update based on Broadhurst et al., 2018 #metabolomics

biologeek's tweet image. Ok! PCA before + after correction done!
Or should I say Dunn? :)
Big thanks to @BroadhurstDavid for communicating their work.
Current work based on Dunn et al., 2011 but in forthcoming analyses, a thing or 2 are eligible for update based on Broadhurst et al., 2018 #metabolomics

Cristina repostou

Thomas Lin Pedersen

@thomasp85

22 de mar. de 2020

It never ceases to amaze me what people can make with gganimate #rstats

Este Tweet não está mais disponível.

Cristina repostou

Colin 🤘🌱🏃‍♀️

@_ColinFay

12 de mar. de 2020

#RStats — Can we scrape the online documentation of an API to automate the creation of an R wrapper 📦? Spoiler: yes. "Automate the Creation of an API Wrapper package by Scraping its Online Documentation" colinfay.me/fun-from-api-d…

Cristina repostou

Dilsher Singh Dhillon

@dhillon_stats

17 de fev. de 2020

I think dplyr::all_equal() should do most of that. Not sure about types

Cristina repostou

@[email protected]

@BenjaminWolfe

8 de fev. de 2020

omg I am always using this now! usethis.r-lib.org/reference/git_…

Cristina repostou

John Sheffield

@johnmsheffield

4 de fev. de 2020

upvoting qs- handles any R object and comparable to fst in speed. The main difference from fst is qs doesn’t support random access, eg how fst allows reading only specific cols/rows. But read/write speeds overall close. I think they share a bunch of implementation strategies.

Cristina repostou

Daniël Lakens

@lakens

28 de jan. de 2020

Retweeting because I am really excited about this. I am willing to bet that 1) thinking about the next hypothesis you will test in machine readable terms will immediately improve what you are doing, and 2) better meta-data will make science massively more efficient.

Daniël Lakens

@lakens

27 de jan. de 2020

New preprint with @LisaDeBruine where we make the case for machine readable hypothesis tests psyarxiv.com/5xcda/. We give a real-life example, argue this would improve the rigour and falsifiability of hypothesis tests, as well as facilitate the re-use of key info in articles.

lakens's tweet image. New preprint with @LisaDeBruine where we make the case for machine readable hypothesis tests psyarxiv.com/5xcda/. We give a real-life example, argue this would improve the rigour and falsifiability of hypothesis tests, as well as facilitate the re-use of key info in articles.

Cristina repostou

Birunda Chelliah

@cbirunda

28 de jan. de 2020

TIL: I learnt about the conflicted 📦 My filter function always gets masked, so my solution till today was dplyr::filter. But there is a better way! You can set your function:library preference at the top of your script! 😭🙏 e.g. conflict_prefer("filter", "dplyr") #rstats

Cristina repostou

Nick Strayer

@NicholasStrayer

21 de jan. de 2020

gist.github.com/nstrayer/7af11…

R Script to simulate missing not at random data and look at performance of different imputation...

Fonte: gist.github.com

Cristina repostou

Ryan Holbrook

@ryanpholbrook

18 de jan. de 2020

A thread of classifiers learning a decision rule. Dashed line is optimal boundary. Animations with #gganimate by @thomasp85 and @drob. #rstats Logistic regression {stats::glm} with each class having normally distributed features. (1/n)

Cristina repostou

Max Kuhn

@topepos

18 de jan. de 2020

I finally got around to looking up the linear algebra of matrix rotations for my PCA explanation.

Cristina repostou

Maarten van Smeden

@MaartenvSmeden

15 de jan. de 2020

We explain the concept of calibration in the link below. In short, calibration is about the predicted risks (probabilities) that come out of your prediction model and whether or not these risks are consistent with the proportion of events you observed

Maarten van Smeden

@MaartenvSmeden

15 de jan. de 2020

Sorry for the shameless plug, but you might be interested in this: bmcmedicine.biomedcentral.com/articles/10.11…

Cristina repostou

Maarten van Smeden

@MaartenvSmeden

16 de jan. de 2020

You can read about it in @ESteyerberg's book, and @BenVanCalster wrote quite a few things about it, see: ncbi.nlm.nih.gov/pubmed/26772608 ncbi.nlm.nih.gov/pubmed/31842878

MaartenvSmeden's tweet card. Efforts are required to avoid poor calibration when developing prediction models, to evaluate calibration when validating models, and to update models when indicated. The ultimate aim is to optimize...

Calibration: the Achilles heel of predictive analytics - PubMed

Fonte: pubmed.ncbi.nlm.nih.gov

Cristina repostou

Steph Locke

@TheStephLocke

7 de jan. de 2020

Instead of referring to myself as self-taught, I'm gonna start referring to myself as community-taught. The sites, the blogs, the books, the user groups, the confs, the forums ... all community efforts that I used to learn and advance my programming and data science knowledge.

Cristina repostou

Maarten van Smeden

@MaartenvSmeden

6 de jan. de 2020

Computer: change your password Me: ********** Computer: new password does not meet requirements Me: **************** Computer: new password does not meet requirements Me: ************************** Computer: new password does not meet requirements Me:

Cristina repostou

Jenny Bryan

@JennyBryan

2 de jan. de 2020

The null-coalescing operator %||% is in the miscellany section of this talk on making conditional logic easier to read and maintain (links to a specific slide): speakerdeck.com/jennybc/code-s… %||% is key to some nice design patterns eg, using NULL as the default val of optional args.

JennyBryan's tweet card. Talk for useR!2018 in Brisbane: https://user2018.r-project.org by Jenny Bryan Twitter: https://twitter.com/jennyBryan/ GitHub: https://github.com/j…

Code Smells and Feels

Fonte: speakerdeck.com

Cristina

@biologeek

26 de dez. de 2019

This is awesome! And happy to see @thomasp85 #geneRativeart projects included in the list 🎨💻

Maarten van Smeden

@MaartenvSmeden

15 de jan. de 2020

Sorry for the shameless plug, but you might be interested in this: bmcmedicine.biomedcentral.com/articles/10.11…

Cristina repostou

Robert Plenge

@rplenge

20 de dez. de 2019

Introducing PheCAP, a high-throughput semi-supervised phenotyping pipeline. PheCAP starts with EMR data (structured and NLP) and outputs a phenotype algorithm, the probability of the phenotype for all patients, and a phenotype classification (yes or no). nature.com/articles/s4159…

rplenge's tweet image. Introducing PheCAP, a high-throughput semi-supervised phenotyping pipeline. PheCAP starts with EMR data (structured and NLP) and outputs a phenotype algorithm, the probability of the phenotype for all patients, and a phenotype classification (yes or no).

nature.com/articles/s4159…

Cristina repostou

Xavier Domingo-Almenara

@XaviDomingo1

20 de dez. de 2019

Interested in small molecule retention time prediction? Check out our new paper in @NatureComms introducing the SMRT dataset, containing the experimental RT of 80K molecules generated in @kadzuis lab at @scrippsresearch #metabolomics #MachineLearning nature.com/articles/s4146…