Learning Pandas

Pandas is a data analysis library for the Python.
Pandas is a tool for data wrangling or munging. It is designed for quick and easy data manipulation, reading, aggregation, and visualization.

It take data in a CSV or TSV file or a SQL database and create a Python object with rows and columns called a data frame. The data frame is very similar to a table in statistical software, say Excel or SPSS.

Below is a list of things that can be achieved using Pandas:

Indexing, manipulating, renaming, sorting, merging data frame
Update, Add, Delete columns from a data frame
Impute missing files, handle missing data or NANs
Plot data with histogram or box plot

Get a sample data to work with

Stack Overflow survey can be good sample to start learning analysis.

Browse to: https://insights.stackoverflow.com/survey/

And download a Zip file for any year and extract it in your machine.

Install Pandas (on Virtual Environment)

Create a new environment:

$ python3 -m venv pandas_env

And activate it:

$ source pandas_env/bin/activate

Install Pandas:

pip install pandas

Jupyter Notebook

It's not a necessity to have Jupyter Notebooks. But it allows to see data more easily in the browser.

Install jupyter with:

$ pip install jupyterlab

And run it in a separate terminal - with the same virtual environment. Because the Jupyter will run as long as the terminal is active.:

$ jupyter notebook

In the browser app, create a new Python3 Notebook.
Give it a name (instead of Untitled).

We are ready to use Pandas.

References

Top 10 Python Libraries for Data Science
Python Pandas Tutorial

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
01-getting-started		01-getting-started
02-dataframe-series		02-dataframe-series
03-indexes		03-indexes
04-filtering		04-filtering
05-updateing		05-updateing
06-add-remove		06-add-remove
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning Pandas

Table of Contents

Get a sample data to work with

Install Pandas (on Virtual Environment)

Jupyter Notebook

References

About

Releases

Packages

License

ehsankorhani/learning-pandas

Folders and files

Latest commit

History

Repository files navigation

Learning Pandas

Table of Contents

Get a sample data to work with

Install Pandas (on Virtual Environment)

Jupyter Notebook

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages