Pandas is a Python library used for working with data sets. Pandas is derived from 'panel data' term.
https://pandas.pydata.org/ - Python Data Analysis Library.
Keyword: Dataframe
# Install pandas python -m pip install pandas pd.head() pd.describe() pd.info() pd.read_csv() pd.read_excel() pd.read_json() pd.read_html() pd.read_sql() pd.read_pickle()
Dataframe functions
loc and iloc functions
- loc: name/lable index
- iloc: number index
Dataframe methods
Work with Excel files
# import pandas as pd pd.read_excel("data.xlsx") pd.ExcelFile("data.xlsx")
Work with CSV file
import pandas as pd import sys def usage(): """Print syntax""" print("Usage: ", sys.argv[0], "<path-to-csv-file>") def main(): if len(sys.argv) != 2: usage() sys.exit(1) filename = sys.argv[1] signin = pd.read_csv(filename) print(signin.info()) print(signin.head()) print(signin.tail()) if __name__ == "__main__": main()
Courses
Data Manipulation with pandas | DataCamp
References
>>> Pandas Tutorial