Spark and Hadoop in Risk Line of Business at Bank of America

Anvesh Gali Big Data, Business Intelligence, Data Science, Data Warehouse

Do you want to gain knowledge about Big Data? Do you want to dig into the field of Risk Line of Business at Bank of America? Come join us to explore these questions. Presentation 1 Andrea Fagan will talk about the history of big data in the Risk Line of Business at Bank of America. What works and what doesn't. ...

Big Data News: Streaming in the Extreme.. An evolution in Data Processing and Analytics

Uli Bethke Apache, Big Data, Data Science, Data Science, MapR, Technology

Google’s Dataflow have submitted a project proposal to open source Dataflow through the Apache Software Foundation along with MapR on Streaming across Data Centers. As another week comes to a close, the wheels of our big data community continue to move in cycles of innovation and progress, which as always never fail to impress. Google’s Tyler Adikiu brought us through ...

Big Data News: Apache Samza V 0,0.10 Release and Dataiku on great Predictive Modelling for Healthcare

Uli Bethke Apache, Big Data, Business Intelligence, Data Discovery, Data Science, Data Science

Apache Samza Release of V 0,0.10 and Dataiku’s Free eBook on how great Predictive Modelling projects are done in Healthcare As the week draws to a close, the team here at Sonra have once again been impressed by the recent developments our industry has presented our community with. Apache has launched their new release of Samza V0, 0.10. This big ...

Big Data News: Yahoo’s Data Sketching and Apache Spark 1.6

Uli Bethke Apache, Big Data, Community, Data Science, Data Science, Hadoop, Hive, Open Source Software, Technology

Launching 2016 in style with an exploration of Yahoo’s successful scaling of aggregate computational queries using data sketching libraries to Apache Spark releasing Spark 1.6 Firstly, the team at Sonra would like to wish you and yours every success in 2016. As the arrow of time pushes us forward, our Big Data industry is forging ahead in a cycle of ...

Big Data News - Apache Ignite™, High Fashion PCA’s and Bloom Filters… tailoring your big data approach!

Uli Bethke Big Data, Data Science, Data Science, DFS, Distributed Computing

Significantly reduce your storage requirements using PCA and exponentially ignite your processing speed on Spark   An innovative use case for principal component analysis from Eigen Style was delivered in a blog post by Grace Avery. The objects in question are pictures of models modelling womenswear and how components of the pictures can be "modelled" across many pictures using principle ...