Welcome to my PySpark Toolbox!
Hi everyone!
This toolbox is a collection of PySpark functions that I regularly use in my daily work. I'd like to share them with you as a small token of appreciation for the amazing Apache Spark & Azure communities.
Feel free to explore and hope you find something useful to streamline your Spark workflow!
Happy coding!
Max
001: Input/Output
- Get data from Rest Api
- Save Dictionary to Azure Data Lake in JSON format
002: DataFrame Transformation
- Basic DataFrame transformation
- Create aggregation columns from child DataFrame
Top comments (0)