Ebook: Python Data Science Handbook: Essential Tools for Working with Data
Author: Jake VanderPlas
- Genre: Computers // Programming
- Tags: Machine Learning, Decision Trees, Data Science, Classification, Principal Component Analysis, Support Vector Machines, Data Visualization, Feature Engineering, Hyperparameter Tuning, Linear Regression, scikit-learn, NumPy, matplotlib, pandas, Jupyter, Random Forest, Naïve Bayes, Seaborn, Manifold Learning
- Year: 2023
- Publisher: O'Reilly Media
- City: Sebastopol, CA
- Edition: 2
- Language: English
- pdf
Python is a first-class tool for many researchers, primarily because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the new edition of Python Data Science Handbook do you get them all—IPython, NumPy, pandas, Matplotlib, Scikit-Learn, and other related tools.
Working scientists and data crunchers familiar with reading and writing Python code will find the second edition of this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. Quite simply, this is the must-have reference for scientific computing in Python.
With this handbook, you'll learn how:
- IPython and Jupyter provide computational environments for scientists using Python
- NumPy includes the ndarray for efficient storage and manipulation of dense data arrays
- Pandas contains the DataFrame for efficient storage and manipulation of labeled/columnar data
- Matplotlib includes capabilities for a flexible range of data visualizations
- Scikit-learn helps you build efficient and clean Python implementations of the most important and established machine learning algorithms