Enterprise Governance on Data Lake with Unity Catalog & Databricks Implementation with R Jobs Migration Use Case
Enterprise Governance on Data Lake with Unity Catalog & Databricks Implementation with R Jobs Migration Use Case A big thank you to Vinny Vijeyakumaar from Databricks and Deon Jacobs from Data-Driven for a great presentation at the first in person Sydney Databricks meetup last July 14, 2022. What was covered: Enterprise Governance on Data Lake with Unity Catalog […]
Deep dive into building a Data Lakehouse with Delta Lake and Spark
Deep dive into building a Data Lakehouse with Delta Lake and Spark Watch the recorded webinar that deep dives into building a data lakehouse with Delta Lake and Spark. https://youtu.be/mrHfdeH6az0 A big thank you to Jonathan Neo for a great presentation at the October Sydney Databricks meetup. Here’s what was covered: What makes up the Lakehouse architecture […]
Enabling Self-Service Analytics & ML at Transport for NSW with Databricks
Enabling Self-Service Analytics & ML at Transport for NSW with Databricks Sydney Databricks Meetup – December 2020 In this business-focused Databricks Meetup, Shelby Ferson (Sr. Manager ANZ – Databricks), Sandeep Mathur (Program Manager – TfNSW) and Rodney Joyce (Practice Director – Data-Driven) discuss how Transport for NSW (TfNSW) leveraged Databricks to enable Self-Service Analytics and […]
Databricks and Data-Driven Announce Strategic New Partnership
Sydney, Australia – September 17, 2020 – Data-Driven AI has announced a strategic partnership with Databricks, the leader in Unified Data Analytics to deliver Azure Databricks solutions to it’s customers. This partnership combines Databricks’ simplified approach to data science/analytics with Data-Driven’s precision engineering and consulting services to enable smarter and better outcomes for clients. The business […]
Azure Cognitive Services Sentiment Analysis v3.0 using Databricks PySpark
Azure Cognitive Services Text Analytics is a great tool you can use to quickly evaluate a text data set for positive or negative sentiment. For example, a service provider can quickly and easily evaluate reviews as positive or negative and rank them based on the sentiment score detected.
Things You Wish You Had Known Earlier About Databricks Performance
A big thank you to Jixin Jia (Gin), Databricks Solution Architect for a brilliant presentation, one that I personally found very interesting and learn a few things I didn’t know. Watch the online video recording to learn more about how to improve Databricks performance! To pick your interest, here are some of topics covered: 1. […]
I know what a Data Scientist is… but what the heck is a Machine Learning Engineer?!
“IT” This has got me by for the past 20 years when asked by various relatives and friends exactly what it is that I do. It does mean I have to “fix” working computers, install virus scanners, get printers working (throw it away), and fix iTunes for my mum on a regular basis and generally […]
Data Science for Dummies – Data Science Overview with Databricks (Tech Talk 1 of 9)
You might have heard of Spark and how it’s the evolution of Hadoop… great for processing Big Data…. but have you heard of Databricks? Here are the slides for the next tech talk in Data Science for Dummies series I am presenting around Sydney: Part 1 of 9: Data Science Overview with Databricks Think Spark-as-a-service, […]
Data Science for Dummies – Data Engineering with Titanic dataset + Databricks + Python (Tech Talk 3 of 9)
I put together a tech talk on Machine Learning and Databricks which is the 3rd part of an 9 part Data Science for Dummies series: Data Engineering with Titanic dataset + Databricks + Python. Preparing & feature engineering highlighted the importance of domain knowledge, even with something as simple as a 10 column dataset! It […]