Mark Rittman is joined by Industry Analyst Mark Madsen to talk about marketing analytics and the rise of the omni-channel consumer, the use of AI in analytics and personalization and what this all means for brands, for advertisers and for marketers.
Mark is joined by Qubit colleague Will Browne to talk about a recent academic paper co-authored with Mike Swarbrick Jones on conversion optimisation techniques in the eCommerce industry. Using analytics and statistical analysis On 20 billion "user journeys" recorded in Qubit's Google Cloud Platform-hosted Customer Data Store this paper compares techniques using data and machine learning to those based on traditional sales techniques to see whether data trumps emotion ... or both have their place.
Mark is joined by returning special guest Dan McClary to talk about data modeling and database design on distributed query engines such as Google BigQuery, the underlying Dremel technology and columnar storage format that enables this cloud distributed data warehouse-as-a-service platform to scale to petabyte-size tables spanning tens of thousands of servers, and techniques to optimize BigQuery table joins using nested fields, table partitioning and denormalization.
- Dremel: Interactive Analysis of Web-Scale Datasets
- BigQuery under the hood
- Inside Capacitor, BigQuery’s next-generation columnar storage format
- Drill To Detail Ep.2. 'Future Of SQL On Hadoop', With Special Guest Dan McClary
- Google BigQuery, Large Table Joins and How Nested, Repeated Values and the Capacitor Storage Format (and Looker) Saves the Day
Oracle's Jack Berkowitz joins Mark Rittman to talk about a new category of continuously adapting, self-learning applications being built-out by Oracle that use machine learning together with enterprise and third-party data to create a new generation of intelligent HR, CX, SCM and ERP SaaS apps.
Stewart Bryson returns to the show to join Mark Rittman to discuss new-world BI and data warehousing development using Google BigQuery and Amazon Athena, Apache Kafka and StreamSets, and talks about his experiences with Looker, the cloud-native BI tool that brings semantic modeling and modern development practices to the world of business intelligence.
Mark Rittman is joined by Maxime Beauchemin to talk about analytics and data integration at Airbnb, the Apache Airflow and Superset open-source projects he helped launch and now works with day-to-day at Airbnb , and his recent Medium article on "The Rise of the Data Engineer".
- "The Rise of the Data Engineer" blog by Maxime Beauchemin
- Apache Airflow
- Airbnb Superset
- "Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department" blog by Jeff Magnusson
Mark Rittman is joined by Daniel Mintz from Looker to talk about BI and analytics on Google BigQuery, data modelling on the new generation of cloud-based distributed-data warehousing platforms, and Looker's re-introduction of semantic models to big data analytics developers.
Mark Rittman is joined in this episode by MapR's Tugdall Grall to talk about MapR's platform differentation and relationship with open-source Hadoop, scaling and streaming, microservices, and MapR's platform strategy around big data workloads in the cloud.
Mark Rittman is joined by Elastic's Mark Walkom to talk about Elasticsearch, Kibana, Logstash and the Elastic Stack; business models built-around an open-source software core; and their move into cloud services with Elastic Cloud
Mark Rittman is joined by Gartner analyst and ex-Oracle Database Cloud Service PM Rick Greenwald to talk about IT’s continuing (and essential!) role in corporate BI&DW deployments and the debate around Mode1 vs. Mode2 Analytics, how we got here, and the future of data warehousing database platforms as we move into the cloud.