Data preparation is a technology somewhere between a sleepy backwater and an active no man’s land. As Hadoop and other odd new takes on data processing have come on line, as the data that the enterprise is dealing with has become more various and voluminous, data preparation methods have begun to change. Check out Ed Burns and Jack Vaughn for this edition of the Talking Data Podcast, which digs into Hadoop-on-SQL, data curation in the hands of Michael Stonebraker, and the effect of machine learning on the data preparation mileu.
Today we go back into the vaults to enjoy a Talking Data podcast of yesteryear. In the summer of 2013, our editors visited Information Builders’ Summit, GraphConnect, and RedHat Summit, and lived to tell the tale. Under discussion were multitenant data bases, HTML 5, and – can you believe it? – Hadoop!
A fellow who used Front Page to build 5 fair-sized Web sites back in the mid-1990s can attest to Microsoft’s ability to bring better technology to the wider public. Front Page is often joked about now, but it was basically a $5000 (or more) 4GL tool that Microsoft bought (from Vermeer), in the process reducing the price to $99. We mention this because, in the Spring, Microsoft CEO Satya Nadella talked about Microsoft’s enduring mission to make technology available to the masses. It is an assertion that has some grounding, but it is hard to speculate whether Microsoft can find that kind of magic again. This podcast is a recap of Ignite 2015 and Build 2015 SQL Server and Azure announcements that look to move the traditional mission forward. Can Microsoft steal a march on Amazon Web Services? Today, developers’ ability to whip out a credit card and build an app quickly on the cloud may be the closet analogy to the effect Microsoft had when its software crept into the organization without first going through IT, then known as the Glass House. But in the cloud, Amazon has the lead. – Jack Vaughan
The long term prospects of MongoDB and other NoSQL databases will be judged by feature improvements for enterprises. We saw such updates when we visited this year’s MongoDB World convocation. A case study from ADP is discussed in the podcast.
At last week’s Spark Summit, both Databricks, whose founders gave the original Spark framework, and IBM introduced cloud-hosted versions of Spark. These announcements created quite a stir among data scientists who are hungry for the processing power of Spark, but who would rather not put in the time and effort of managing the clusters.
Take a listen to this edition of Talking Data to hear more about why users are so excited about the prospect of hosted Spark implementations and how it could change computing.
Content strategist Diana Hwang joins the Talking Podcast crew to talk about data journalism. Hwang and editor Jack Vaughan took part in a data journalism workshop sponsored by the New England Science Writers group, and they outline their experiences with Python, CartoDB, GeoJSON … and this strange new software they’ve found called ‘Excel’!
Everyone wants to be data-driven these days, including the management. While most of the time this is a positive trend, it can have a darker side. In this edition of Talking Data, we discuss some of the ways businesses are bringing analytics into their management strategies, how these initiatives can work and how they can disadvantage workers when done poorly.
In the podcast, the editors discuss the NoSQL data modeling experiences and advice of users and consultants who took part in the recent Enterprise Data World 2015 conference in Washington D.C. What are the highlights? Use Agile practices – use DevOps – embrace your inner Google.
Most discussions of the ethical implications of big data center around the risks to privacy. But in this edition of Talking Data, we explore some of the other issues, which were explored in a presentation at the Brandeis Analytics 360 Symposium by professor Robert Carver.
Technology analyst Judith Hurwitz recently penned “Cognitive Computing and Big Data Analytics,” which is an in-depth look at a many-headed emerging technology. She told us cognitive computing is, in fact, an amalgamation of many things. Individual aspects can be challenging. But, according to Hurwitz, taken as a whole, this new computing paradigm shows promise.