pydatatable's profile picture. Python library for data munging

datatable

@pydatatable

Python library for data munging

datatable أعاد

Much scarier than the small number of evil people is the large number who will fall in line with whatever's fashionable. When kindness is in fashion, they're kind. When a form of bigotry becomes cool, they jump on board. When it's popular to condone violence, they're all for it.


datatable أعاد

I never realized this consciously before now, but you have to be independent-minded to be good. Otherwise you'll be drawn into doing bad things out of conformism.


datatable أعاد

Some people have asked why I am tweeting about “politics”. The reason is that I have vulnerable family members and they and others like them may not survive this epidemic in large part due to the Trump administration’s gross incompetence, cruelty, and/or criminality.


datatable أعاد

I really appreciate some of the nice little aesthetic touches in #pydatatable. Still not as mature (or familiar) as #rdatatable, but I'm enjoying it so far. Thanks, @h2oai!


datatable أعاد

Our own @pstetsenko & Oleksiy Kononenko presenting Data.Table #H2OWorld. Watch the live stream from Godel-Pauling Stage here: bit.ly/H2OSFLive

h2oai's tweet image. Our own @pstetsenko & Oleksiy Kononenko presenting Data.Table #H2OWorld. Watch the live stream from Godel-Pauling Stage here:  bit.ly/H2OSFLive

datatable أعاد

This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets. paperswithcode.com/sota

seb_ruder's tweet image. This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets.
paperswithcode.com/sota
seb_ruder's tweet image. This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets.
paperswithcode.com/sota
seb_ruder's tweet image. This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets.
paperswithcode.com/sota
seb_ruder's tweet image. This is a super cool resource: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks I've seen including 140+ tasks and 100 datasets.
paperswithcode.com/sota

datatable أعاد

If an eye doctor looked at a retinal photo, the chance of getting gender correct would be 50-50. But deep learning training led to an AUC of 0.97 @pearsekeane pointed out how striking this is @JeffDean at a recent #AI @GoogleAI meeting; data: nature.com/articles/s4155… @NatBME #OA

EricTopol's tweet image. If an eye doctor looked at a retinal photo, the chance of getting gender correct would be 50-50.
But deep learning training led to an AUC of 0.97 
@pearsekeane pointed out how striking this is @JeffDean at a recent #AI @GoogleAI meeting; data: 
nature.com/articles/s4155… @NatBME #OA

datatable أعاد

Apparently, Slack doesn't have any plans to implement code highlighting for back ticks. Please retweet if you want code highlighting with ```js or ```php or ```py or anything else that GitHub supports. Let's show Slack how badly we need this.

Got it. We don't have immediate plans to support specific languages when using backticks, but we'll pass the suggestion along!



Our own benchmarks show that the sentinel and bitmask methods go neck and neck with each other (except when the number of NAs is very low, in which case the bitmask method has an upper hand). Check the results at github.com/st-pasha/micro…

github.com

GitHub - st-pasha/microbench-nas: Benchmark performance of NAs stored as sentinel values, vs....

Benchmark performance of NAs stored as sentinel values, vs. stored as a bitmask - st-pasha/microbench-nas

New post! "Is it time to stop using sentinel values for null (NA) values?". The great NaN vs. bit/byte-mask debate wesmckinney.com/blog/bitmaps-v… @ApacheArrow #pydata #rstats

wesmckinn's tweet image. New post! "Is it time to stop using sentinel values for null (NA) values?". The great NaN vs. bit/byte-mask debate wesmckinney.com/blog/bitmaps-v…

@ApacheArrow #pydata #rstats


datatable أعاد

Very excited that Facebook has just become a Principal Sponsor of the Python Software Foundation: python.org/psf/sponsorshi…


datatable is at 1000 commits today! 🎉 On an unrelated note, "Round Number Bias" is a term in psychology for humans' proclivity to pay special attention to numbers that are round.


datatable أعاد

Folks, the #rdatatable spirit is now available in Python too! It's in Alpha stage, so you are encouraged to report bugs and submit proposal! Special thanks to @MattDowle for his effort in developing intuitive and efficient libraries for #DataScience Link-> github.com/h2oai/datatable

github.com

GitHub - h2oai/datatable: A Python package for manipulating 2-dimensional tabular data structures

A Python package for manipulating 2-dimensional tabular data structures - h2oai/datatable


Selecting columns of a certain type from a Frame is easy, the type itself can be used as the selector: ``` DT[..., str] ```


datatable أعاد

#rstats #rdatatable dev version now has a simple typo checker to check for potential misspellings in your i query. Implementation is quite rudimentary for now -- all feedback is welcome/appreciated! Give it a spin! github.com/Rdatatable/dat…


Loading...

Something went wrong.


Something went wrong.