Skip to main content

Pandas: Notes

Pandas Logo - Source: https://upload.wikimedia.org/wikipedia/commons/thumb/e/ed/Pandas_logo.svg/1200px-Pandas_logo.svg.png

Pandas is a Python library used for working with data sets. Pandas is derived from 'panel data' term.

https://pandas.pydata.org/ - Python Data Analysis Library.

Keyword: Dataframe

# Install pandas
python -m pip install pandas

pd.head()
pd.describe()
pd.info()

pd.read_csv()
pd.read_excel()
pd.read_json()
pd.read_html()
pd.read_sql()
pd.read_pickle()

Dataframe functions

loc and iloc functions

    loc: name/lable index
    iloc: number index

Dataframe methods

Work with Excel files

# import pandas as pd
pd.read_excel("data.xlsx")
pd.ExcelFile("data.xlsx")

Work with CSV file

import pandas as pd
import sys

def usage():
    """Print syntax"""
    print("Usage: ", sys.argv[0], "<path-to-csv-file>")

def main():
    if len(sys.argv) != 2:
        usage()
        sys.exit(1)
    
    filename = sys.argv[1]
    signin = pd.read_csv(filename)
    print(signin.info())
    print(signin.head())
    print(signin.tail())

if __name__ == "__main__":
    main()

Courses

Data Manipulation with pandas | DataCamp

References

>>> Pandas Cheatsheet

>>> Pandas Tutorial

Popular posts from this blog

hmailserver: Notes from the field

hmailserver is one of free open source mail servers running on Microsoft Windows operating system.

stegseek: Notes

StegCracker has been retired following the release of StegSeek, which will blast through the rockyou.txt wordlist within 1.9 second as opposed to StegCracker which takes ~5 hours.

Install Microsoft .NET Core SDK

.NET Core Software Development Kit (SDK) is a set of libraries and tools that allow developers to create .NET Core applications and libraries.