Big Data News - Apache Ignite™, High Fashion PCA’s and Bloom Filters… tailoring your big data approach!

Uli Bethke Big Data, Data Science, Data Science, DFS, Distributed Computing

Significantly reduce your storage requirements using PCA and exponentially ignite your processing speed on Spark   An innovative use case for principal component analysis from Eigen Style was delivered in a blog post by Grace Avery. The objects in question are pictures of models modelling womenswear and how components of the pictures can be "modelled" across many pictures using principle ...

Big Data News - To Data Lakes and Beyond!!

Uli Bethke Big Data, Computing, Data Science, Data Science

How data lakes become toxic to the future of computing and a menu of approximation algorithms As we reach the end of another week, much is still happening in our community despite it being “holiday season”. The sun never sets on great ideas and the sharing of knowledge, which is why we would like to shine a light on some ...

Big Data News - MesosCon (Europe), Unix Philosophy of Distributed Data and Advanced Algorithms

Uli Bethke Big Data, Data Science, Unix

Apache Mesos (Europe) Conference Announcement, Martin Kleppmann on Distributed Unix Philosophy and Advanced Algorithms… today’s view into tomorrow’s world!!..  So as the week draws to a close, we are left with a feeling that the pace of understanding in our community and the progress it brings has not slowed one bit! Accordingly, we finish the week on a good note ...

Technology with Databricks, Xpoint™ and Doradus…

Uli Bethke Big Data, Cloud

Technology and Big Data movers... A week of wonder in review... This week saw a number of exciting ideas and announcements reach the Tech community that signals how it can positively impact the world. Taking a look at some of the highlights, I think we can get excited about the following: Burak Yavuz , Software Engineer at Databricks blogged about ...

Self-Service Analytics in the Data Lake

Uli Bethke Big Data, Business Intelligence, Data Warehouse, Hadoop, MapR, Spark

Deriving Value from your data... We all know that decision makers and analysts need quick access to all of the structured and unstructured data in an enterprise. Typically this data is locked away in various source systems and can’t be queried from a single central location The data warehouse set out to fix this problem ages ago and it has ...