Advanced

Databricks Meetup Image 1200x627 3 • Data and AI Analytics

Enterprise Governance on Data Lake with Unity Catalog & Databricks Implementation with R Jobs Migration Use Case​

Marketing and Sales Director @Data-Driven. I am a passionate professional who strives to deliver excellent customer service, solve problems using modern technology and deliver practical solutions under resource and time constraints.
Sofia Oropeza

Enterprise Governance on Data Lake with Unity Catalog & Databricks Implementation with R Jobs Migration Use Case A big thank you to Vinny Vijeyakumaar from Databricks and Deon Jacobs from Data-Driven for a great presentation at the first in person Sydney Databricks meetup last July 14, 2022. What was covered: Enterprise Governance on Data Lake with Unity Catalog …

Enterprise Governance on Data Lake with Unity Catalog & Databricks Implementation with R Jobs Migration Use Case​ Read More »

campaign creators e6n7uoEnYbA unsplash • Data and AI Analytics

Sharepoint Integration: How to Share and Ingest Data Automatically into a Data Platform

I am skilled at developing solutions that leverage the power of Modern Data Platforms on cloud. My expertise on Azure cloud allows me to orchestrate solutions using Logic Apps, Azure Data Factory, Databricks and ADLS Gen2. My emphasis is always on increasing value to the customers.
Yash Tamakuwala

Are you still sharing files with your peers on teams and maintaining them in your local system? If yes, then perhaps you could use SharePoint to remove the hassle of organising files and managing documents efficiently. SharePoint is a Microsoft Office web platform that allows users to share files and documents in a collaborative fashion. …

Sharepoint Integration: How to Share and Ingest Data Automatically into a Data Platform Read More »

Deep dive into building a Data Lakehouse with Delta Lake and Spark

Deep dive into building a Data Lakehouse with Delta Lake and Spark

Marketing and Sales Director @Data-Driven. I am a passionate professional who strives to deliver excellent customer service, solve problems using modern technology and deliver practical solutions under resource and time constraints.
Sofia Oropeza

Deep dive into building a Data Lakehouse with Delta Lake and Spark Watch the recorded webinar that deep dives into building a data lakehouse with Delta Lake and Spark. https://youtu.be/mrHfdeH6az0 A big thank you to Jonathan Neo for a great presentation at the October Sydney Databricks meetup. Here’s what was covered: What makes up the Lakehouse architecture …

Deep dive into building a Data Lakehouse with Delta Lake and Spark Read More »

A_20Developers_20Guide_20to_20Building_20AI_20Application_ebook_thumb.jpg

A Developer’s Guide to Building AI Application

Artificial Intelligence is rapidly becoming a mainstream technology. Read this developer’s eBook for an introduction to the tools, infrastructure and services in the Microsoft AI platform that allow you to create intelligent applications.

Topics covered:
• How the intersection of cloud, data and AI enables organizations to build intelligent systems
• The tools, infrastructure, and services available as part of the Microsoft AI platform
• How to teach your bot/application new AI skills
• How ONNX may be relevant to your AI work

#datadriven #AI #MicrosoftAI #Azure

Databricks August Meetup - Databricks Performance Tuning with Gin Jia

Things You Wish You Had Known Earlier About Databricks Performance

Azure-certified Data Architect with a focus on delivering business value and guiding customers through the maze of analytical architectures, design and implementation activities.

Experienced in setting up modern data platforms with advanced predictive analytic workloads. Brings strong people skills and a devops-centric, entrepreneurial approach to Enterprise software delivery.


Rodney Joyce

A big thank you to Jixin Jia (Gin), Databricks Solution Architect for a brilliant presentation, one that I personally found very interesting and learn a few things I didn’t know. Watch the online video recording to learn more about how to improve Databricks performance! To pick your interest, here are some of topics covered: 1. …

Things You Wish You Had Known Earlier About Databricks Performance Read More »

Delta Lake Performance

Databricks Performance: Fixing the Small File Problem with Delta Lake

Anjana Rupasinghege is the Technical Director and Lead
Architect at Data Driven, specialised in Cloud, Security, Data
and Analytics.

With a background in Azure modern data architecture, he
has over 15 years of experience working in Information
Technology in industries such as Government, Banking,
Telecommunication and Consulting.

Anjana Rupasinghege
Latest posts by Anjana Rupasinghege (see all)

A common Databricks performance problem we see in enterprise data lakes are that of the “Small Files” issue.  One of our customers is a great example – we ingest 0.5TB of JSON and CSV data per day made of 5kb files which equates to millions of files a week in the data lake Raw zone. …

Databricks Performance: Fixing the Small File Problem with Delta Lake Read More »

Subscribed! We'll let you know when we have new blogs and events...