NYOUG 2018 Summer General Meeting: Introduction of Jupyter Notebook And Packages for Data Analysis and Plotting

Introduction of Jupyter Notebook And Packages for Data Analysis and Plotting

Jupyter is an interactive development tool for Python related analytics, for rapid prototyping. The foundation of ML is Linear Algebra, the great chunk of it is pertaining to matrix operation, NumPy is not directly exposed but they are the dependency of some typical ML packages, i.e. scikit learn. Pandas DataFrame – Operation on data sets to help automation and analysis on the data sets. Present a Pandas analysis example with public travel data, demonstrate data wrangling and plotting features.

Linda Li
Linda Li – Joined the Priceline Data Lake Service team in 2014, is a senior data warehouse analyst. Has worked on Oracle DBA, informatica, Business Intelligence in various industries – Reinsurance, Bank, ecommerce and worked with systems like SAP Business warehouse, Oracle EBS, EDW, OBI warehouse. Worked with various business intelligence tools – SAP Business Objects, Oracle Discoverer, OBIEE, Tableau. Has interest and passion in data and most cutting-edge technologies. In leisure time, Linda loves music appreciation and loves watching the young generation practice the piano and play in recitals, Chamber Music. Linda is a graduate of New Jersey Institute of Technology, major in Information systems.