PySpark -Convert SQL queries to Dataframe
In PySpark, you can run dataframe commands or if you are comfortable with SQL then you can run SQL queries too. In this post, we will see how to run different variations of SELECT queries on table built on Hive & corresponding Dataframe commands to replicate same output as SQL query. Let’s create a dataframe […]
PySpark -Convert SQL queries to Dataframe Read More »