OnDataEng's profile picture. Independent, critical and technical thinking on the use cases, architectural patterns and technologies relating to the preparation of data for exploitation.

OnDataEngineering

@OnDataEng

Independent, critical and technical thinking on the use cases, architectural patterns and technologies relating to the preparation of data for exploitation.

Cloudera have announced the release of #ClouderaStreamsManagement - bundling their Kafka management console and replication tool #odenews blog.cloudera.com/announcing-the…


From the ever reliable The Morning Papers - Procella, YouTube's unified OLTP/OLAP (HTAP) database #odenews blog.acolyer.org/2019/09/11/pro…


Interesting in replicating data between #Kafka clusters - Cloudera have a post on on MirrorMaker 2 which is based on #KafkaConnect #odenews blog.cloudera.com/a-look-inside-…


#ApacheCalcite 1.21 is out - you might never have heard of it, but it's probably being used by many of the data tools you use on a daily basis for query parsing and optimization #odenews calcite.apache.org/news/2019/09/1…


StreamSets have announced #StreamsetsTranformer - a graphical tool for creating #ApacheSpark pipelines that's part of their DataOps Platform #odenews @streamsets streamsets.com/blog/streamset…


Using #GoogleCloudStorage with #Hadoop - Google have a new version of their Cloud Storage Connector for Hadoop out with a bunch of performance improvements and locking for directory modifications #odenews cloud.google.com/blog/products/…


From the ever excellent The Morning Paper, a review of a paper that used "the TPC-H benchmark to assess Redshift, Redshift Spectrum, Athena, Presto, Hive, and Vertica to find out what works best and the trade-offs involved" #odenews blog.acolyer.org/2019/08/30/cho…


هذا الحساب لا يتابع أي شخص حاليًا

United States الاتجاهات

Loading...

Something went wrong.


Something went wrong.