
Machine Learning Guide
Machine learning audio course, teaching the fundamentals of machine learning and artificial intelligence. It covers intuition, models (shallow and deep), math, languages, frameworks, etc. Where your other ML resources provide the trees, I provide the forest. Consider MLG your syllabus, with highly-curated resources for each episode's details at ocdevel.com. Audio is a great supplement during exercise, commute, chores, etc.
Episodes
MLA 030 AI Job Displacement & ML Careers
ML engineering demand remains high with a 3.2 to 1 job-to-candidate ratio, but entry-level hiring is collapsing as AI automates routine programming and data tasks. Career longevity requires shifting from model training to production operations, deep domain expertise, and mastering AI-augmented workflows before standard implementation becomes a commodity. Links Notes and resources at ocdevel.com/
MLA 029 OpenClaw
OpenClaw is a self-hosted AI agent daemon that executes autonomous tasks through messaging apps like WhatsApp and Telegram using persistent memory. It integrates with Claude Code to enable software development and administrative automation directly from mobile devices. Links Notes and resources at ocdevel.com/mlg/mla-29 Try a walking desk - stay healthy & sharp while you learn & code Generate
MLA 028 AI Agents
AI agents differ from chatbots by pursuing autonomous goals through the ReACT loop rather than responding to turn-based prompts. While coding agents are currently the most reliable due to verifiable feedback loops, the market is expanding into desktop and browser automation via tools like Claude co-work and open claw. Links Notes and resources at ocdevel.com/mlg/mla-28 Try a walking desk - stay
MLA 027 AI Video End-to-End Workflow
How to maintain character consistency, style consistency, etc in an AI video. Prosumers can use Google Veo 3's "High-Quality Chaining" for fast social media content. Indie filmmakers can achieve narrative consistency by combining Midjourney V7 for style, Kling for lip-synced dialogue, and Runway Gen-4 for camera control, while professional studios gain full control with a layered ComfyUI pipeline
MLA 026 AI Video Generation: Veo 3 vs Sora, Kling, Runway, Stable Video Diffusion
Google Veo leads the generative video market with superior 4K photorealism and integrated audio, an advantage derived from its YouTube training data. OpenAI Sora is the top tool for narrative storytelling, while Kuaishou Kling excels at animating static images with realistic, high-speed motion. Links Notes and resources at ocdevel.com/mlg/mla-26 Try a walking desk - stay healthy & sharp while y
MLA 025 AI Image Generation: Midjourney vs Stable Diffusion, GPT-4o, Imagen & Firefly
The AI image market has split: Midjourney creates the highest quality artistic images but fails at text and precision. For business use, OpenAI's GPT-4o offers the best conversational control, while Adobe Firefly provides the strongest commercial safety from its exclusively licensed training data. Links Notes and resources at ocdevel.com/mlg/mla-25 Try a walking desk - stay healthy & sharp whil
MLG 036 Autoencoders
Auto encoders are neural networks that compress data into a smaller "code," enabling dimensionality reduction, data cleaning, and lossy compression by reconstructing original inputs from this code. Advanced auto encoder types, such as denoising, sparse, and variational auto encoders, extend these concepts for applications in generative modeling, interpretability, and synthetic data generation. Li
MLG 035 Large Language Models 2
At inference, large language models use in-context learning with zero-, one-, or few-shot examples to perform new tasks without weight updates, and can be grounded with Retrieval Augmented Generation (RAG) by embedding documents into vector databases for real-time factual lookup using cosine similarity. LLM agents autonomously plan, act, and use external tools via orchestrated loops with persisten
MLG 034 Large Language Models 1
Explains language models (LLMs) advancements. Scaling laws - the relationships among model size, data size, and compute - and how emergent abilities such as in-context learning, multi-step reasoning, and instruction following arise once certain scaling thresholds are crossed. The evolution of the transformer architecture with Mixture of Experts (MoE), describes the three-phase training process cul
MLA 024 Agentic Software Engineering
Agentic engineering shifts the developer role from manual coding to orchestrating AI agents that automate the full software lifecycle from ticket to deployment. Using Claude Code with MCP servers and git worktrees allows a single person to manage the output and quality of an entire engineering organization. Links Notes and resources at ocdevel.com/mlg/mla-24 Try a walking desk - stay healthy &
MLA 023 Claude Code Components
Claude Code distinguishes itself through a deterministic hook system and model-invoked skills that maintain project consistency better than visual-first tools like Cursor. Its multi-surface architecture allows developers to move sessions between CLI, web sandboxes, and mobile while maintaining persistent context. Links Notes and resources at ocdevel.com/mlg/mla-23 Try a walking desk - stay heal
MLA 022 Vibe Coding
Andrej Karpathy coined "vibe coding" in February 2025 - a year later, 41% of all code is AI-generated, agents run multi-hour tasks autonomously, and the developer role has shifted from writing code to orchestrating systems. Links Notes and resources at ocdevel.com/mlg/mla-22 Try a walking desk - stay healthy & sharp while you learn & code Generate a podcast - use my voice to listen to any AI g
MLG 033 Transformers
Links: Notes and resources at ocdevel.com/mlg/33 3Blue1Brown videos: https://3blue1brown.com/ Try a walking desk stay healthy & sharp while you learn & code Try Descript audio/video editing with AI power-tools Background & Motivation RNN Limitations: Sequential processing prevents full parallelization—even with attention tweaks—making them inefficient on modern hardware. Breakthrough: "Attent
MLA 021 Databricks: Cloud Analytics and MLOps
Databricks is a cloud-based platform for data analytics and machine learning operations, integrating features such as a hosted Spark cluster, Python notebook execution, Delta Lake for data management, and seamless IDE connectivity. Raybeam utilizes Databricks and other ML Ops tools according to client infrastructure, scaling needs, and project goals, favoring Databricks for its balanced feature s
MLA 020 Kubeflow and ML Pipeline Orchestration on Kubernetes
Machine learning pipeline orchestration tools, such as SageMaker and Kubeflow, streamline the end-to-end process of data ingestion, model training, deployment, and monitoring, with Kubeflow providing an open-source, cross-cloud platform built atop Kubernetes. Organizations typically choose between cloud-native managed services and open-source solutions based on required flexibility, scalability,
MLA 019 Cloud, DevOps & Architecture
The deployment of machine learning models for real-world use involves a sequence of cloud services and architectural choices, where machine learning expertise must be complemented by DevOps and architecture skills, often requiring collaboration with professionals. Key concepts discussed include infrastructure as code, cloud container orchestration, and the distinction between DevOps and architect
MLA 017 AWS Local Development Environment
AWS development environments for local and cloud deployment can differ significantly, leading to extra complexity and setup during cloud migration. By developing directly within AWS environments, using tools such as Lambda, Cloud9, SageMaker Studio, client VPN connections, or LocalStack, developers can streamline transitions to production and leverage AWS-managed services from the start. This epi
MLA 016 AWS SageMaker MLOps 2
SageMaker streamlines machine learning workflows by enabling integrated model training, tuning, deployment, monitoring, and pipeline automation within the AWS ecosystem, offering scalable compute options and flexible development environments. Cloud-native AWS machine learning services such as Comprehend and Poly provide off-the-shelf solutions for NLP, time series, recommendations, and more, red
MLA 015 AWS SageMaker MLOps 1
SageMaker is an end-to-end machine learning platform on AWS that covers every stage of the ML lifecycle, including data ingestion, preparation, training, deployment, monitoring, and bias detection. The platform offers integrated tools such as Data Wrangler, Feature Store, Ground Truth, Clarify, Autopilot, and distributed training to enable scalable, automated, and accessible machine learning oper
MLA 014 Machine Learning Hosting and Serverless Deployment
Builders can scale ML from simple API calls to full MLOps pipelines using SST on AWS, utilizing Aurora pgvector for search and Spot instances for 90 percent cost savings. External platforms like Modal or GCP Cloud Run provide superior serverless GPU options for real-time inference when AWS native limits are reached. Links Notes and resources at ocdevel.com/mlg/mla-14 Try a walking desk - stay h
MLA 013 Tech Stack for Customer-Facing Machine Learning Products
Primary technology recommendations for building a customer-facing machine learning product include React and React Native for the front end, serverless platforms like AWS Amplify or GCP Firebase for authentication and basic server/database needs, and Postgres as the relational database of choice. Serverless approaches are encouraged for scalability and security, with traditional server frameworks
MLA 012 Docker for Machine Learning Workflows
Docker enables efficient, consistent machine learning environment setup across local development and cloud deployment, avoiding many pitfalls of virtual machines and manual dependency management. It streamlines system reproduction, resource allocation, and GPU access, supporting portability and simplified collaboration for ML projects. Machine learning engineers benefit from using pre-built Docke
MLG 032 Cartesian Similarity Metrics
Try a walking desk to stay healthy while you study or work! Show notes at ocdevel.com/mlg/32. L1/L2 norm, Manhattan, Euclidean, cosine distances, dot product Normed distances link A norm is a function that assigns a strictly positive length to each vector in a vector space. link Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below. L1: Manhattan/city-block/taxicab. abs
MLA 011 Practical Clustering Tools
Primary clustering tools for practical applications include K-means using scikit-learn or Faiss, agglomerative clustering leveraging cosine similarity with scikit-learn, and density-based methods like DBSCAN or HDBSCAN. For determining the optimal number of clusters, silhouette score is generally preferred over inertia-based visual heuristics, and it natively supports pre-computed distance matric
MLA 010 NLP packages: transformers, spaCy, Gensim, NLTK
The landscape of Python natural language processing tools has evolved from broad libraries like NLTK toward more specialized packages such as Gensim for topic modeling, SpaCy for linguistic analysis, and Hugging Face Transformers for advanced tasks, with Sentence Transformers extending transformer models to enable efficient semantic search and clustering. Each library occupies a distinct place in
MLA 009 Charting and Visualization Tools for Data Science
Python charting libraries - Matplotlib, Seaborn, and Bokeh - explaining, their strengths from quick EDA to interactive, HTML-exported visualizations, and clarifies where D3.js fits as a JavaScript alternative for end-user applications. It also evaluates major software solutions like Tableau, Power BI, QlikView, and Excel, detailing how modern BI tools now integrate drag-and-drop analytics with em
MLA 008 Exploratory Data Analysis (EDA)
Exploratory data analysis (EDA) sits at the critical pre-modeling stage of the data science pipeline, focusing on uncovering missing values, detecting outliers, and understanding feature distributions through both statistical summaries and visualizations, such as Pandas' info(), describe(), histograms, and box plots. Visualization tools like Matplotlib, along with processes including imputation a
MLA 007 Jupyter Notebooks
Jupyter Notebooks, originally conceived as IPython Notebooks, enable data scientists to combine code, documentation, and visual outputs in an interactive, browser-based environment supporting multiple languages like Python, Julia, and R. This episode details how Jupyter Notebooks structure workflows into executable cells - mixing markdown explanations and inline charts - which is essential for do
MLA 006 Salaries for Data Science & Machine Learning
O'Reilly's 2017 Data Science Salary Survey finds that location is the most significant salary determinant for data professionals, with median salaries ranging from $134,000 in California to under $30,000 in Eastern Europe, and highlights that negotiation skills can lead to salary differences as high as $45,000. Other key factors impacting earnings include company age and size, job title, industry
MLA 005 Shapes and Sizes: Tensors and NDArrays
Explains the fundamental differences between tensor dimensions, size, and shape, clarifying frequent misconceptions—such as the distinction between the number of features ("columns") and true data dimensions—while also demystifying reshaping operations like expand_dims, squeeze, and transpose in NumPy. Through practical examples from images and natural language processing, listeners learn how to
MLA 003 Storage: HDF, Pickle, Postgres
Practical workflow of loading, cleaning, and storing large datasets for machine learning, moving from ingesting raw CSVs or JSON files with pandas to saving processed datasets and neural network weights using HDF5 for efficient numerical storage. It clearly distinguishes among storage options—explaining when to use HDF5, pickle files, or SQL databases—while highlighting how libraries like pandas,
MLA 002 Numpy & Pandas
NumPy enables efficient storage and vectorized computation on large numerical datasets in RAM by leveraging contiguous memory allocation and low-level C/Fortran libraries, drastically reducing memory footprint compared to native Python lists. Pandas, built on top of NumPy, introduces labelled, flexible tabular data manipulation—facilitating intuitive row and column operations, powerful indexing,
MLA 001 Degrees, Certificates, and Machine Learning Careers
While industry-respected credentials like Udacity Nanodegrees help build a practical portfolio for machine learning job interviews, they remain insufficient stand-alone qualifications—most roles require a Master's degree as a near-hard requirement, especially compared to more flexible web development fields. A Master's, such as Georgia Tech's OMSCS, not only greatly increases employability but is
MLG 029 Reinforcement Learning Intro
Notes and resources: ocdevel.com/mlg/29 Try a walking desk to stay healthy while you study or work! Reinforcement Learning (RL) is a fundamental component of artificial intelligence, different from purely being AI itself. It is considered a key aspect of AI due to its ability to learn through interactions with the environment using a system of rewards and punishments. Links: openai/baselines r
MLG 028 Hyperparameters 2
Notes and resources: ocdevel.com/mlg/28 Try a walking desk to stay healthy while you study or work! More hyperparameters for optimizing neural networks. A focus on regularization, optimizers, feature scaling, and hyperparameter search methods. Hyperparameter Search Techniques Grid Search involves testing all possible permutations of hyperparameters, but is computationally exhaustive and suite
MLG 027 Hyperparameters 1
Full notes and resources at ocdevel.com/mlg/27 Try a walking desk to stay healthy while you study or work! Hyperparameters are crucial elements in the configuration of machine learning models. Unlike parameters, which are learned by the model during training, hyperparameters are set by humans before the learning process begins. They are the knobs and dials that humans can control to influence
MLG 026 Project Bitcoin Trader
Try a walking desk to stay healthy while you study or work! Ful notes and resources at ocdevel.com/mlg/26 NOTE. This episode is no longer relevant, and tforce_btc_trader no longer maintained. The current podcast project is Gnothi. Episode Overview TForce BTC Trader Project: Trading Crypto Special: Intuitively highlights decisions: hypers, supervised v reinforcement, LSTM v CNN Crypto (v st
MLG 025 Convolutional Neural Networks
Try a walking desk to stay healthy while you study or work! Notes and resources at ocdevel.com/mlg/25 Filters and Feature Maps: Filters are small matrices used to detect visual features from an input image by applying them to local pixel patches, creating a 3D output called a feature map. Each filter is tasked with recognizing a specific pattern (e.g., edges, textures) in the input images.
MLG 024 Tech Stack
Try a walking desk to stay healthy while you study or work! Notes and resources at ocdevel.com/mlg/24 Hardware Desktop if you're stationary, as you'll get the best performance bang-for-buck and improved longevity; laptop if you're mobile. Desktops. Build your own PC, better value than pre-built. See PC Part Picker, make sure to use an Nvidia graphics card. Generally shoot for 2nd-best of CPUs/
MLG 023 Deep NLP 2
Try a walking desk to stay healthy while you study or work! Notes and resources at ocdevel.com/mlg/23 Neural Network Types in NLP Vanilla Neural Networks (Feedforward Networks): Used for general classification or regression tasks. Examples include predicting housing costs or classifying images as cat, dog, or tree. Convolutional Neural Networks (CNNs): Primarily used for image-related t
MLG 022 Deep NLP 1
Try a walking desk to stay healthy while you study or work! Notes and resources at ocdevel.com/mlg/22 Deep NLP Fundamentals Deep learning has had a profound impact on natural language processing by introducing models like recurrent neural networks (RNNs) that are specifically adept at handling sequential data. Unlike traditional linear models like linear regression, RNNs can address the comple
MLG 020 Natural Language Processing 3
Try a walking desk to stay healthy while you study or work! Notes and resources at ocdevel.com/mlg/20 NLP progresses through three main layers: text preprocessing, syntax tools, and high-level goals, each building upon the last to achieve complex linguistic tasks. Text Preprocessing Text preprocessing involves essential steps such as tokenization, stemming, and stop word removal. These foundat
MLG 019 Natural Language Processing 2
Try a walking desk to stay healthy while you study or work! Notes and resources at ocdevel.com/mlg/19 Classical NLP Techniques: Origins and Phases in NLP History: Initially reliant on hardcoded linguistic rules, NLP's evolution significantly pivoted with the introduction of machine learning, particularly shallow learning algorithms, leading eventually to deep learning, which is the current s
MLG 018 Natural Language Processing 1
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/18 Overview: Natural Language Processing (NLP) is a subfield of machine learning that focuses on enabling computers to understand, interpret, and generate human language. It is a complex field that combines linguistics, computer science, and AI to process and analyze large amounts of natural language data
MLG 017 Checkpoint
Try a walking desk to stay healthy while you study or work! At this point, browse #importance:essential on ocdevel.com/mlg/resources with the 45m/d ML, 15m/d Math breakdown.
MLG 016 Consciousness
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/16 Inspiration in AI Development Early inspirations for AI development centered around solving challenging problems, but recent advancements like self-driving cars and automated scientific discoveries attract professionals due to potential economic automation and career opportunities. The Singularity The
MLG 015 Performance
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/15 Concepts Performance Evaluation Metrics: Tools to assess how well a machine learning model performs tasks like spam classification, housing price prediction, etc. Common metrics include accuracy, precision, recall, F1/F2 scores, and confusion matrices. Accuracy: The simplest measure of performance, in
MLG 014 Shallow Algos 3
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/14 Anomaly Detection Systems Applications: Credit card fraud detection and server activity monitoring. Concept: Identifying outliers on a bell curve. Statistics: Central role of the Gaussian distribution (normal distribution) in detecting anomalies. Process: Identifying significant deviations from the mean
MLG 013 Shallow Algos 2
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/13 Support Vector Machines (SVM) Purpose: Classification and regression. Mechanism: Establishes decision boundaries with maximum margin. Margin: The thickness of the decision boundary, large margin minimizes overfitting. Support Vectors: Data points that the margin directly affects. Kernel Trick: Project
MLG 012 Shallow Algos 1
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/12 Topics Shallow vs. Deep Learning: Shallow learning can often solve problems more efficiently in time and resources compared to deep learning. Supervised Learning: Key algorithms include linear regression, logistic regression, neural networks, and K Nearest Neighbors (KNN). KNN is unique as it is ins
MLG 010 Languages & Frameworks
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/10 Topics: Recommended Languages and Frameworks: Python and TensorFlow are top recommendations for machine learning. Python's versatile libraries (NumPy, Pandas, Scikit-Learn) enable it to cover all areas of data science including data mining, analytics, and machine learning. Language Choices: C/C+
MLG 009 Deep Learning
Try a walking desk to stay healthy while you study or work! Full notes at ocdevel.com/mlg/9 Key Concepts: Deep Learning vs. Shallow Learning: Machine learning is broken down hierarchically into AI, ML, and subfields like supervised/unsupervised learning. Deep learning is a specialized area within supervised learning distinct from shallow learning algorithms like linear regression. Neural Netwo
MLG 008 Math for Machine Learning
Mathematics essential for machine learning includes linear algebra, statistics, and calculus, each serving distinct purposes: linear algebra handles data representation and computation, statistics underpins the algorithms and evaluation, and calculus enables the optimization process. It is recommended to learn the necessary math alongside or after starting with practical machine learning tasks, u
MLG 007 Logistic Regression
The logistic regression algorithm is used for classification tasks in supervised machine learning, distinguishing items by class (such as "expensive" or "not expensive") rather than predicting continuous numerical values. Logistic regression applies a sigmoid or logistic function to a linear regression model to generate probabilities, which are then used to assign class labels through a process i
MLG 006 Certificates & Degrees
People interested in machine learning can choose between self-guided learning, online certification programs such as MOOCs, accredited university degrees, and doctoral research, with industry acceptance and personal goals influencing which path is most appropriate. Industry employers currently prioritize a strong project portfolio over non-accredited certificates, and while master's degrees carry
MLG 005 Linear Regression
Linear regression is introduced as the foundational supervised learning algorithm for predicting continuous numeric values, using cost estimation of Portland houses as an example. The episode explains the three-step process of machine learning - prediction via a hypothesis function, error calculation with a cost function (mean squared error), and parameter optimization through gradient descent -
MLG 004 Algorithms - Intuition
Machine learning consists of three steps: prediction, error evaluation, and learning, implemented by training algorithms on large datasets to build models that can make decisions or classifications. The primary categories of machine learning algorithms are supervised, unsupervised, and reinforcement learning, each with distinct methodologies for learning from data or experience. Links Notes and
MLG 003 Inspiration
AI is rapidly transforming both creative and knowledge-based professions, prompting debates on economic disruption, the future of work, the singularity, consciousness, and the potential risks associated with powerful autonomous systems. Philosophical discussions now focus on the socioeconomic impact of automation, the possibility of a technological singularity, the nature of machine consciousness
MLG 002 Difference Between Artificial Intelligence, Machine Learning, Data Science
Artificial intelligence is the automation of tasks that require human intelligence, encompassing fields like natural language processing, perception, planning, and robotics, with machine learning emerging as the primary method to recognize patterns in data and make predictions. Data science serves as the overarching discipline that includes artificial intelligence and machine learning, focusing
MLG 001 Introduction
Show notes: ocdevel.com/mlg/1. MLG teaches the fundamentals of machine learning and artificial intelligence. It covers intuition, models, math, languages, frameworks, etc. Where your other ML resources provide the trees, I provide the forest. Consider MLG your syllabus, with highly-curated resources for each episode's details at ocdevel.com. Audio is a great supplement during exercise, commute, ch
Recommended

1-2-3 Learn Spanish with Me!

128 Civics Questions for U.S. Citizenship Test

12 Hour Sound Machines for Sleep (no loops or fades)

#12minconvos

12 Minute Meditation

12 Rules for Life: An Antidote to Chaos by Jordan B. Peterson, Book Summary, Podcast, English

1440 Explores

1490 Doom - Lore Series Podcast

15 MINS OF FAME

15 Minute Mysteries: The Deep Dive

15 minutes de grâce et de vérité

15 Minutes of Infamy