Introduction to Jupyter and libraries
What is jupyter?
Jupyter is the short form of Julia, Python, and R. It's open-source software that creates interactive notebook documents containing live code, equations, visualizations, media, and other computational outputs. Moreover, it supports more than 40 languages as of now.
What is pandas?
Pandas is an open-source Python library. It has functions for analyzing, cleaning, exploring, and manipulating data. Pandas has two types of data structures:
a) Series -It's a one-dimensional array with indexes, it stores a single column or row of data in a Data Frame.
b) Data Frame - It's a tabular spreadsheet-like structure representing rows containing one or multiple columns.
Why use pandas?
Pandas allow us to analyze big data and make conclusions based on statistical theories. Pandas can clean messy data sets and make them readable and relevant. The process of cleaning messy data is called data munging or data wrangling.
Comments
Post a Comment