Category Archives: Machine Learning

Parrando’s Paradox Using JavaScript

I was preparing to start a project that will use JavaScript. To get ready, I coded up a few short demos of my favorite problems, using JavaScript. One of these favorite problems is Parrando’s Paradox. It’s one of the most … Continue reading

Posted in JavaScript, Machine Learning | Leave a comment

Why I Dislike XGBoost and Why I Like XGBoost

First, the title of this blog post is moderately click-bait. I dislike many charateristics of XGBoost but I like some of them too. XGBoost (“extreme gradient boost”) is a huge library of many functions, with hundreds of parameters and possible … Continue reading

Posted in Machine Learning | 2 Comments

Tomek Links for Pruning Imbalanced Data

Imbalanced data occurs when you have machine learning training data with many items of one class and very few items of the other class. For example, some medical data might have many thousands of data items that are “no disease” … Continue reading

Posted in Machine Learning | Leave a comment

Researchers Explore Intelligent Sampling of Huge ML Datasets to Reduce Costs and Maintain Model Fairness

I contributed to an article titled “Researchers Explore Intelligent Sampling of Huge ML Datasets to Reduce Costs and Maintain Model Fairness” in the May 2021 edition of the online Pure AI site. See https://pureai.com/articles/2021/05/03/intelligent-ai-sampling.aspx. Researchers devised a new technique to … Continue reading

Posted in Machine Learning | Leave a comment

Neural Networks, Dogs, and JavaScript

I was walking my two dogs, Riley and Kevin, early one wet Pacific Northwest Saturday morning. I enjoy walking and thinking while my dogs do their dog-thing and look for rabbits. My dogs have never caught a rabbit but they’re … Continue reading

Posted in Machine Learning | Leave a comment

A Minimal Binary Search Tree Example Using Python

I don’t use tree data structures very often. When I expect to use a tree, I usually refresh my memory by coding a minimal example — coding trees is tricky and a warm-up example saves time in the long run. … Continue reading

Posted in Machine Learning | Leave a comment

Generating Synthetic ‘1’ Digits for the UCI Digits Dataset Using a Variational Autoencoder

A variational autonencoder (VAE) is a deep neural system that can generate synthetic data items. One possible use of a VAE is to generate synthetic minority class items (those with very few instances) for an imbalanced dataset. At least in … Continue reading

Posted in Machine Learning | 2 Comments

Automatic Stopping for Logistic Regression Training

A few days ago I did some thought experiments about different schemes to automatically stop training a logistic regression model. I was motivated by the poor performance of the sckit library LogisticRegression model with default parameters. I coded up a … Continue reading

Posted in Machine Learning | Leave a comment

Using Reinforcement Learning for Anomaly Detection

I often think about new machine learning ideas in two different, but related ways. One approach is to look at a specific, practical problem and then mentally examine my collection of ML techniques to see if I have a way … Continue reading

Posted in Machine Learning | Leave a comment

Some Thoughts About Dealing With Imbalanced Training Data

Suppose you have a binary classification problem where there are many of one class, but very few of the other class. For example, with medical data, you might have many thousands of data items representing people, that are class 0 … Continue reading

Posted in Machine Learning | Leave a comment