SQL & PySpark Equivalent
ConceptSQLSpark / PySpark
SELECTSELECT column(s) FROM table;
SELECT * FROM table; | df.select("column(s)")
df.select("*") |
| DISTINCT | SELECT DISTINCT column(s) FROM table; | df.select("column(s)").distinct() |
| WHERE | SELECT column(s) F...
blog.naveenpn.com4 min read