The Databricks Data Engineer

The Databricks Data Engineer

Databricks Data Engineer Interview: A Broadcast Join Case Study

Why a 50MB table can cause a driver OOM crash, and how concurrency is the real culprit.

Jakub Lasak's avatar
Jakub Lasak
Nov 20, 2025
∙ Paid

Senior Databricks Interview: ‘I forced a Broadcast Join on a tiny 50MB table. The driver crashed with OOM. Why?’

(Hint: It’s not the table size).

A junior engineer blames the configuration. A senior engineer blames the concurrency.

Here is the offer-winning breakdown:

❌ The Junior Answer (Rejected) “The table must have been bigger than you thought, or spark…

User's avatar

Continue reading this post for free, courtesy of Jakub Lasak.

Or purchase a paid subscription.
© 2026 Jakub Lasak Consulting · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture