Latest Internet News

Machine Learning 101: A Guide for Non-Technical Executives

Artificial Intelligence (AI) is a field of computer science that focuses on the creation of intelligent machines that function as humans. Machine Learning (ML) is a subset of AI that uses statistical techniques that enable computers to learn from data without human intervention. There are three categories of Machine Learning methods:

  1. Supervised Machine Learning: with this method, the computer trains itself on a labeled data set. Supervised ML requires less training data and makes training easier since results can be compared to labeled results. The drawback is the cost involved in preparing and labeling data and the risk of creating a model that’s too similar and biased to the training data. When this happens, variations in data aren’t interpreted accurately.
  2. Unsupervised Machine Learning: with this method, the computer consumes large, unlabeled data sets and extracts meaningful flags using algorithms to label, sort, and classify data in real-time without human intervention. Unsupervised ML focuses more on identifying patterns and relationships than automating decisions and predictions.
  3. Semi-Supervised Learning: this method falls neatly between Supervised and Unsupervised ML. Semi-Supervised ML uses a small labeled dataset to classify and feature extractions from a larger, unlabeled dataset.


How does Machine Learning work?

Data scientists use four basic steps to build machine learning applications:

1. Prepare a Data Training Set

Training data is labeled data that represents the data the ML model will interpret and solve. It’s tagged with features and classifications that the ML model will identify. Sometimes training data is unlabeled and the model has to figure out how to extract features and assign classifications. Preparing training data means randomizing, de-duplicating, and checking for inaccuracies or biases. Training data should also be divided into a training subset to train the application, and an evaluation subset to test and refine the application.

2. Choose an Algorithm

An algorithm is a set of processing steps, and the type of algorithm implemented is determined by the type and amount of training data as well as the type of problem being solved. There are several common types of ML algorithms used with labeled training data. Regression algorithms understand relationships in data by predicting the value of a dependent variable based on the value of an independent variable. Decision trees make recommendations based on a set of decision rules. Instance-based algorithms estimate the likelihood that a data point is a member of a certain group based on its proximity to other data points.

There are also several common types of ML algorithms used with unlabeled training data. Clustering algorithms identify groups of similar records and label them with their appropriate group. Association algorithms identify patterns and relationships to identify association rules or ‘if-then’ relationships. Neural networks define layered networks of calculations that consume, interpret, and deliver conclusions about data. Each layer refines the results of the previous layer in a process known as deep learning.


3. Train the Algorithm

Training an algorithm is a multi-step process. Variables run through the algorithm and output is compared with the expected results. Weights and biases are adjusted to get more accurate results, then variables are run again until the output matches the expected results more times than not. The trained, accurate algorithm is now the Machine Learning model.

4. Use and Improve the Model

Lastly, it’s important to use new data with the ML model and improve its accuracy and effectiveness. The source of the new data depends on the nature of the problem being solved.

Gartner Data Science recognizes several software companies as leaders in the field of data science and Machine Learning platforms. These companies create software solutions that include reporting and modern BI to predictive analytics and streaming analytics that help businesses compete and succeed.


Jeffery D. Silvers
Love and share my articles, I will be happy to react on it ! Spent 2002-2009 promoting weed whackers in Edison, NJ. Earned praise for importing junk food for fun and profit. Spent 2001-2006 exporting teddy bears in Atlantic City, NJ. Had some great experience investing in tattoos in Fort Walton Beach, FL. Spent 2002-2007 selling action figures in the aftermarket. Enthusiastic about working on basketballs on the black market.