Introduction to Jupyter and libraries

What is jupyter?

Jupyter is the short form of Julia, Python, and R. It's open-source software that creates interactive notebook documents containing live code, equations, visualizations, media, and other computational outputs. Moreover, it supports more than 40 languages as of now.

What is pandas?

Pandas is an open-source Python library. It has functions for analyzing, cleaning, exploring, and manipulating data. Pandas has two types of data structures: 

a) Series -It's a one-dimensional array with indexes, it stores a single column or row of data in a Data Frame.

b) Data Frame - It's a tabular spreadsheet-like structure representing rows containing one or multiple columns.

Why use pandas?

Pandas allow us to analyze big data and make conclusions based on statistical theories. Pandas can clean messy data sets and make them readable and relevant. The process of cleaning messy data is called data munging or data wrangling.

 

Data Frame:
It represents data using rows and columns (tabular or Excel spreadsheet-like data).
                       


What is Matplotlib?
It is a library in Python that helps in data visualization.
1. What is data visualization?
It is a graphical representation of information and data. Visual elements like charts, graphs and maps provide an easy way to see and understand the trends, outliers and patterns in data.

What is Scikit-Learn?
It is an open-source library in Python. It is also known as the father of ML in Python programming. It is one of the core libraries in Python because without this library it would be very difficult for us to manually implement ML algorithms. Along with scikit-learn, we use a few libraries such as
1. Pandas  2. Numpy  3. Matplotlib  4.Scipy





 



































Comments

Popular posts from this blog

Basics of AI

Types of ML

Basics of Neural networks