Asked about how PySpark internally works basically the architecture of PySpark.
Data Engineer Interview Questions
21,063 data engineer interview questions shared by candidates
Basic Basic questions only they asked
Asked me to model a program in scala
Intermediate SQL queries based on the take-home assignment. I was also asked a window function query.
What is a window function? Joins vs subqueries. Outer vs inner joins. Infrastructure based questions.
Spark and hive optimization techniques, partitioning and bucketing concepts, small programs in pyspark
1. How would you determine the frequency of each unique element in a list? 2. A problem that can be solved using a SQL window function, and explain your approach?
How does memory management work in Python?
Explain the join and union and describe the differences between them.
Synchronization of two different data frame and asking the complexity of the algorithm. Domain interview: bunch of pyspark related questions and SQL simple query Second programming interview: Find if a path exists in a Graph
Viewing 1221 - 1230 interview questions