Machine Learning Engineering with Python [Second Edition] 9781837631964

Transform your machine learning projects into successful deployments with this practical guide on how to build and scale

145 91 19MB

English Pages 601 Year 2023

Table of contents :
Preface
Who this book is for
What this book covers
To get the most out of this book
Get in touch
Introduction to ML Engineering
Technical requirements
Defining a taxonomy of data disciplines
Data scientist
ML engineer
ML operations engineer
Data engineer
Working as an effective team
ML engineering in the real world
What does an ML solution look like?
Why Python?
High-level ML system design
Example 1: Batch anomaly detection service
Example 2: Forecasting API
Example 3: Classification pipeline
Summary
The Machine Learning Development Process
Technical requirements
Setting up our tools
Setting up an AWS account
Concept to solution in four steps
Comparing this to CRISP-DM
Discover
Using user stories
Play
Develop
Selecting a software development methodology
Package management (conda and pip)
Poetry
Code version control
Git strategies
Model version control
Deploy
Knowing your deployment options
Understanding DevOps and MLOps
Building our first CI/CD example with GitHub Actions
Continuous model performance testing
Continuous model training
Summary
From Model to Model Factory
Technical requirements
Defining the model factory
Learning about learning
Defining the target
Cutting your losses
Preparing the data
Engineering features for machine learning
Engineering categorical features
Engineering numerical features
Designing your training system
Training system design options
Train-run
Train-persist
Retraining required
Detecting data drift
Detecting concept drift
Setting the limits
Diagnosing the drift
Remediating the drift
Other tools for monitoring
Automating training
Hierarchies of automation
Optimizing hyperparameters
Hyperopt
Optuna
AutoML
auto-sklearn
AutoKeras
Persisting your models
Building the model factory with pipelines
Scikit-learn pipelines
Spark ML pipelines
Summary
Packaging Up
Technical requirements
Writing good Python
Recapping the basics
Tips and tricks
Adhering to standards
Writing good PySpark
Choosing a style
Object-oriented programming
Functional programming
Packaging your code
Why package?
Selecting use cases for packaging
Designing your package
Building your package
Managing your environment with Makefiles
Getting all poetic with Poetry
Testing, logging, securing, and error handling
Testing
Securing your solutions
Analyzing your own code for security issues
Analyzing dependencies for security issues
Logging
Error handling
Not reinventing the wheel
Summary
Deployment Patterns and Tools
Technical requirements
Architecting systems
Building with principles
Exploring some standard ML patterns
Swimming in data lakes
Microservices
Event-based designs
Batching
Containerizing
Hosting your own microservice on AWS
Pushing to ECR
Hosting on ECS
Building general pipelines with Airflow
Airflow
Airflow on AWS
Revisiting CI/CD for Airflow
Building advanced ML pipelines
Finding your ZenML
Going with the Kubeflow
Selecting your deployment strategy
Summary
Scaling Up
Technical requirements
Scaling with Spark
Spark tips and tricks
Spark on the cloud
AWS EMR example
Spinning up serverless infrastructure
Containerizing at scale with Kubernetes
Scaling with Ray
Getting started with Ray for ML
Scaling your compute for Ray
Scaling your serving layer with Ray
Designing systems at scale
Summary
Deep Learning, Generative AI, and LLMOps
Going deep with deep learning
Getting started with PyTorch
Scaling and taking deep learning into production
Fine-tuning and transfer learning
Living it large with LLMs
Understanding LLMs
Consuming LLMs via API
Coding with LLMs
Building the future with LLMOps
Validating LLMs
PromptOps
Summary
Building an Example ML Microservice
Technical requirements
Understanding the forecasting problem
Designing our forecasting service
Selecting the tools
Training at scale
Serving the models with FastAPI
Response and request schemas
Managing models in your microservice
Pulling it all together
Containerizing and deploying to Kubernetes
Containerizing the application
Scaling up with Kubernetes
Deployment strategies
Summary
Building an Extract, Transform, Machine Learning Use Case
Technical requirements
Understanding the batch processing problem
Designing an ETML solution
Selecting the tools
Interfaces and storage
Scaling of models
Scheduling of ETML pipelines
Executing the build
Building an ETML pipeline with advanced Airflow features
Summary
Other Books You May Enjoy
Index

Machine Learning Engineering with Python [Second Edition]
9781837631964

Author / Uploaded
Andrew P. McMahon

Similar Topics
Computers
Algorithms and Data Structures: Pattern Recognition

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

Recommend Papers

Machine Learning with Python 1824213629

1,148 146 3MB Read more

Interpretable Machine Learning with Python: Build explainable, fair and robust high-performance models [Second Edition] 9781803235424

A deep dive into the key aspects and challenges of machine learning interpretability using a comprehensive toolkit, incl

99 96 44MB Read more

Machine Learning with Python: A Practical Beginners’ Guide (Machine Learning with Python for Beginners Book 2)

Ready to add Machine Learning to your skill stack? As the second title in the Machine Learning From Scratch series, this

282 111 2MB Read more

Machine Learning with Python 9789386551931, 9386551934

Develop and Implement your own Machine Learning Models to solve real-world problemsKey Features• Introduction to Machine

122 85 2MB Read more

Hacker’s Guide to Machine Learning with Python

1,083 139 17MB Read more

Learning Python, Second Edition [Second Edition] 9780596002817, 0596002815

Learning Python is an introduction to the increasingly popular Python programming language. Python is an interpreted, in

548 1 1MB Read more

Python Machine Learning for Beginners: All You Need to Know about Machine Learning with Python

Have you thought about a career in data science? It's where the money is right now, and it's only going to bec

228 126 194KB Read more

Python Machine Learning 7666666675

842 122 14MB Read more

TensorFlow Machine Learning Cookbook: Over 60 recipes to build intelligent machine learning systems with the power of Python, 2nd Edition [Second edition] 9781789131680, 1789131685, 9781789130768, 178913076X

Key Features Your quick guide to implementing TensorFlow in your day-to-day machine learning activities Learn advanced t

921 80 8MB Read more

Python Machine Learning - Second Edition [2nd ed] 9781787125933, 1787125939, 9781787126022, 1787126021

Key Features A practical approach to the frameworks of data science, machine learning, and deep learning Use the most po

112 37 41MB Read more