-
Book Overview & Buying
-
Table Of Contents
Data Analysis with Python
By :
If data science is to continue to grow and graduate into a core business activity, companies must find a way to scale it across all layers of the organization and overcome all the difficult challenges we discussed earlier. To get there, we identified three important pillars that architects planning a data science strategy should focus on, namely, data, services, and tools:

Three pillars of dat science at scale
Setting a data strategy that enables data scientists to easily access high-quality data that's relevant to them increases productivity and morale and ultimately leads to a higher rate of successful outcomes.
The advantages are obvious: high reusability, easier maintenance, reduced time to market, scalability, and much more. In addition, this approach would fit nicely into a cloud strategy giving you a growth path as the size of your workload increases beyond existing capacities. You also want to prioritize open source technologies and standardize on open protocols as much as possible.
Breaking processes into smaller functions infuses scalability, reliability, and repeatability into the system.
Making the tools simpler to use contributes to breaking the silos and increases collaboration between data science, engineering, and business teams.