About
I help customers design, build, and optimize data platforms and products using DevOps first principles.
I have seven years of experience in data engineering, big data, and cloud computing, working with various tools and technologies such as Apache Spark, Terraform, Airflow, Databricks, Anyscale, Snowflake, and EMR (with Databricks being the best by far ☺️) both on AWS, and Azure.
I have a solid understanding of the security aspects of running data platforms in the cloud, and am always continuing to evolve my skills, recently getting into SRE for Data Engineering.
I am also an avid writer and educator, sharing my insights and expertise on data engineering testing on Medium, where I have won multiple awards for my series of publications. I am passionate about applying test driven development (TDD) to data workflows and pipelines, ensuring quality, reliability, and efficiency.
I hold an Honours degree in Freshwater Biology, a Bachelor's degree in Computer Science from Monash University, a AWS Solutions Architect Associate certification, and a Databricks Data Engineering Associate certification.
Also in case you’re wondering, while I am an avid cyclist, I don’t shave my eyebrows for the aero gains. I have an autoimmune condition known as Alopecia Universalis 😅. If you want to know more or even donate to the charity for it goto aaaf.org.au.
Contributions
Activity
-
Databricks' platform makes it really easy to get state-of-the-art AI performance by working on all aspects of the problem, from data to model tuning.…
Databricks' platform makes it really easy to get state-of-the-art AI performance by working on all aspects of the problem, from data to model tuning.…
Liked by David O'Keeffe
-
The announcement by Meta to sunset Workplace hits all the feels for me. For 4 years I worked hand in hand with Vicky Skipp and my secret (or not so…
The announcement by Meta to sunset Workplace hits all the feels for me. For 4 years I worked hand in hand with Vicky Skipp and my secret (or not so…
Liked by David O'Keeffe
-
Yet another reason to make sure you keep up to date with the latest in Delta Lake. It’s basically the gift that keeps on giving, remember, time is…
Yet another reason to make sure you keep up to date with the latest in Delta Lake. It’s basically the gift that keeps on giving, remember, time is…
Shared by David O'Keeffe
More activity by David
Building on the features released in Delta Lake 3.0 and 3.1, Delta Lake 3.2 introduces a host of performance enhancements and optimizations! In our…
Liked by David O'Keeffe
In feverish desperation to build teams that aren't a net negative out of a highly variable market, I frequently see posts (similar to this) making…
Liked by David O'Keeffe
I'm betting big on Databricks in the long term. Snowflake aced SQL, but building a good SQL engine is trivial nowadays, have you heard of duckDB?…
Liked by David O'Keeffe
We've rewritten Ray from scratch a few times (actually from scratch). I was asked on Twitter about lessons we learned from that process and changes…
Liked by David O'Keeffe
This is super interesting and valuable. DataKitchen is one of very few companies that is not a tech giant, but nevertheless practices and evangelises…
Liked by David O'Keeffe
Calling all #Databricks Partners! Customers are using Azure Databricks and Microsoft Fabric to solve their biggest data, analytics, and AI…
Liked by David O'Keeffe
A modern data stack architecture is the next step in the evolution of the data stack. Discover the scalability, efficiency, and enhanced governance…
Liked by David O'Keeffe
Great new article by Li (Luke) Yu on how to use #ray to perform feature extraction for #llms on #databricks: https://lnkd.in/d5qv3Xmk Key insights…
Liked by David O'Keeffe
I’ve had a few customers ask me how to do dynamic SQL based reports in Power BI on Databricks SQL, and these use cases are typically more complex and…
Liked by David O'Keeffe
Given the rapid evolution in AI and other technologies, particularly with the rise of GenAI, it's challenging for individuals to keep up. This is…
Liked by David O'Keeffe
An oldie but goodie. Introduction to Unit Testing PySpark. I should write more testing content but it doesn't seem that exciting. Although the 24k…
Liked by David O'Keeffe
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named David O'Keeffe in Australia
-
David O'Keeffe
Positive Speakers Bureau Coordinator at Living Positive Victoria
-
David O'Keeffe
-
David O'Keeffe
Head of Design / Interior Designer
-
David O'Keeffe
--
13 others named David O'Keeffe in Australia are on LinkedIn
See others named David O'Keeffe