Data Science Tools

In the previous article, you had an overview of data science workflow.  Although knowing the process and workflow is very crucial, learning is incomplete without implementation. 

To do so, you need to learn and be aware of the data science tools that can help you achieve it.

What are the Tools?

For a beginner, learning to use the Data Sc tools can be both fun and challenging. 

There are a huge amount of ways to perform data science, however, this article would focus on the tools that are quite popular and easy to start with.


5 tools that can help you start with:


Python


There are a lot of programming languages that can help you start with, however, Python can achieve the same with less effort and that is why so many companies are using Python on a daily basis.



These have made Python a first choice from small enterprises to large enterprises.



Pandas


Pandas is an open-source data analysis tool built on top of Python. It is quite fast and provides a lot of functionalities to analyze data and make conclusions.



(image source: pandas )

Matplotlib & Seaborn

Data Visualization is the most important part of Exploratory data analysis ( EDA ) in the data understanding & preparation phase. For this, there are majorly two widely used libraries - Matplotlib and Seaborn.

Matplotlib is a popular tool for creating static, animated, and interactive visualizations. Seaborn is based on Matplotlib and provides feature-rich functionalities for statistical analysis.



Scikit-Learn


Data science is incomplete without predictive analytics and scikit-learn empowers this process by providing a set of reusable and efficient tools for all kinds of predictions. 



Python Anywhere

After model evaluation, the best-performing model should be deployed in the cloud to give access to this from anywhere. ( more on cloud computing)

Python Anywhere is a platform ( PAAS or platform as a service) that provides free deployment of ML models with a website link to access it from anywhere.


3-Week timeline