Exploring Data Explorer pool data with Python
In the last few years, Python has gained significant popularity in the data science community. This happened for several factors, including the simplicity of the language syntax, broad community adoption, and the availability of a vast range of mathematics and statistics libraries, among other reasons. In conjunction with the R, Scala, and SQL languages, Python has become one of the primary programming languages used in data analysis, and for artificial intelligence (AI) and machine learning (ML) applications—a space that was dominated by MATLAB, especially in academia, for decades.
Azure Synapse offers support for data exploration using Python, Scala, C#, SQL, and R. Data exploration with these languages is handled by the Apache Spark analytical engine in Azure Synapse. For this chapter, we will focus on data exploration with Python.
Support for Python in Azure Synapse is achieved through PySpark: a Python application programming...