{"id":169,"date":"2021-06-29T14:10:09","date_gmt":"2021-06-29T14:10:09","guid":{"rendered":"https:\/\/jenniferkwentoh.com\/?p=169"},"modified":"2021-11-29T13:40:02","modified_gmt":"2021-11-29T13:40:02","slug":"what-is-machine-learning-fundamentals","status":"publish","type":"post","link":"https:\/\/jenniferkwentoh.com\/what-is-machine-learning-fundamentals\/","title":{"rendered":"What is Machine Learning"},"content":{"rendered":"\n

Fundamentals of Machine Learning<\/strong><\/h2>\n\n\n\n

Machine learning (ML)<\/strong> is beyond an internet buzzword, and the possibilities it promises are nothing short of fantasy and science friction.
At the end of this article, my objectives are to open you up to the world of machine learning.
We will start with some definitions, take a short trip in the machine learning memory lane, explore some exciting research.
From there, we will peep into “who is using ML in production<\/strong>” and see the prerequisites required to get started.
We will also discuss the different approaches to solving ML problems.
We will look at the platforms and frameworks used in coding for ML.
Lastly, we get to chat about what next?
This article is a long read, and you won’t be writing code. Grab something to drink or eat, sit back and enjoy \ud83d\ude42<\/p>\n\n\n\n

What is Machine Learning?<\/strong><\/h2>\n\n\n\n

I like to define Machine Learning (ML)<\/strong> as the science of getting a computer to learn to solve a problem by experience, just like humans without explicitly instructing it. Similar to how humans learn with experience, computers are fed with data instead.<\/p>\n\n\n\n

Machine learning is a sub-field of Artificial Intelligence (AI) that focuses on how computer algorithms automatically learn and improve their accuracy without being explicitly programmed.<\/em><\/pre>\n\n\n\n


The goal is for computers to learn with no human intervention.
Machine learning is a branch of Artificial Intelligence (AI) and Computer Science.
ML is used in diverse applications and found helpful in almost every field: medicine and health, transportation, education, entertainment, finance.<\/p>\n\n\n\n


ML thrives in solving problems that are extremely difficult for conventional programming logic. For example, Image classification.<\/p>\n\n\n\n

In the example below, classifying if an image is a dog or a cat.<\/h3>\n\n\n\n
\"\"
cats vs. dogs image classification<\/figcaption><\/figure>\n\n\n\n

Writing code to recognize and differentiate images can be complicated. Even If we find a way to perfectly describe the appearance of a dog in item A <\/strong>to the computer, what happens when we show the picture on Item B<\/strong> to the same algorithm?<\/p>\n\n\n\n

\"\"
what_is_machine_learning_dogs<\/figcaption><\/figure>\n\n\n\n


Not all dogs have a long tail. Some dogs are hairy, and some are not hairy.
The algorithm will be thousands of If-else statements<\/strong> if we try building it out using traditional programming. And yet, it won\u2019t still be accurate.<\/p>\n\n\n\n

\"\"
what_is_machine_learning_thousand_if_else_statement<\/figcaption><\/figure>\n\n\n\n


<\/p>\n\n\n\n

In image classification, the goal is to identify images of a particular object or thing accurately. Using traditional programming, some photos of dogs will be falsely classified as cats and vice versa.
Machine learning solves this using a different approach. Instead of instructing the computer on every step, we feed the computer many additional images of dogs and cats.
The algorithm learns over time the characteristics that genuinely describe a dog and a cat.<\/p>\n\n\n\n

Generalization<\/h3>\n\n\n\n


The goal of a machine learning model is not to memorize the data but to generalize. In Machine learning, algorithms train on a sample data known as training data. The model created is tested with a testing dataset.<\/p>\n\n\n\n


ML is closely related to some fields like Data Mining<\/strong> and Statistics<\/strong>. Some parts of ML focus on solving problems like predictive analysis, which is a statistical problem. But not all of the machine learning is statistics. Machine learning requires training and testing data, and so knowing how to mine the correct data is a plus for ML Engineers.
Other related fields are Linguistics, Data Engineering, and Software Engineering, etc.<\/p>\n\n\n\n

Why Machine Learning is Important?<\/strong><\/h2>\n\n\n\n

ML has been used to solve simple and basic tasks to challenging scientific problems. The ability to solve complex problems is one of the main attractions of machine learning.
The availability of high computational power and large volumes of data are the reasons behind the recent ML-powered products.<\/p>\n\n\n\n


ML is used in business operations to segment and understand customers\u2019 data. ML is used to optimize and automate routine tasks, mitigate risks, classify fraudulent and non-fraudulent scenarios, predict financial and market trends, recommend products to customers, improve existing tedious processes. Businesses have been able to enact intelligent decisions and stay ahead of the competition because of machine learning.<\/p>\n\n\n\n

Machine Learning History<\/strong><\/h2>\n\n\n\n

Just like every other scientific field, ML has a great history. From the popular Turing tests by Alan Turing in the 1950s, John McCarthy and the Dartmouth conference in Summer 1956, the dreadful AI winter in the 1980s, to the first AI (AlphaGo developed by Deepmind) that beat the world best GO player in 2016. And more recently, GPT-3 an autoregressive language model designed by OpenAI.<\/p>\n\n\n\n

\"alan
Alan Turing ~ image credit Wikipedia<\/figcaption><\/figure>\n\n\n\n


In the 1950s, Alan Turing created the famous “Turing Test.”
For a machine to pass the test, it has to convince a human that it is another human and not a computer.<\/p>\n\n\n\n

\"\"
Arthur Samuel: Image credit Google arts and culture<\/a><\/figcaption><\/figure>\n\n\n\n


Arthur Lee Samuel, a pioneer researcher in “game technology” and artificial intelligence, in 1952 designed the first computer game that learned as it played the game checkers. In 1959, he coined the name “machine learning<\/strong>.”<\/p>\n\n\n\n


The first artificial neural network was designed in 1958 by Frank Rosenblatt.
In the 1960s, Nilsson N. J.’s book on “Learning Machines” theorized pattern recognition and pattern classification possibilities.
The late 1980s and 1990s witnessed a massive collapse in the field. Research funding dried up. Big corporations that financed AI research withdrew interest.<\/p>\n\n\n\n

\"garry_kasparov_IBM_deep_blue_machine_learning\"
Garry Kasparov and the IBM deep blue. image credit <\/a>AFP<\/figcaption><\/figure>\n\n\n\n


However, things bounced back, and there was a big win in 1997 when the IBM deep blue, a chess-playing AI, beat the world champion, Garry Kasparov, in a chess game.
More and more groundbreaking research has since sprung up.<\/p>\n\n\n\n

Some Interesting research in Machine learning<\/strong><\/h2>\n\n\n\n

Since the end of AI winter, we have seen more exciting research. Thanks to high computing power and the internet that make big data available.<\/p>\n\n\n\n

AlexNet (2012) <\/h3>\n\n\n\n

AlexNet is a deep convolutional neural network that won the 2012 ImageNet Large Scale Visual Recognition Challenge.
The neural network classified 1.3 million high-resolution images in the LSVRC-2010 ImageNet training set into 1000 different classes, with over 500,000 neurons consisting of eight layers, five convolutional layers, and three fully-connected layers.<\/p>\n\n\n\n

Read the <\/a>paper here by Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton.<\/p>\n\n\n\n

See code implementation<\/a>.<\/p>\n\n\n\n

GPT-3<\/h3>\n\n\n\n

The GPT-3 released at the peak of COVID-19 pandemic lockdown was one of the highlights of 2020. GPT-3, short for Generative Pre-trained Transformer 3, is a language model that produces human-like text. GPT-3 succeeded the GPT-2. The OpenAI research laboratory creates the language model.<\/p>\n\n\n\n

OpenAI<\/a> mentioned on its website that over 300 applications are currently powered by GPT-3. Delivering search, conversation, text completion, and other advanced AI features through their API.<\/p>\n\n\n\n

Microsoft ImageBert<\/h3>\n\n\n\n

ImageBert is a pre-trained model that combines natural language processing (NLP) and computer vision (CV) to translate image-text. Researchers created it at Bing Multimedia Team, Microsoft. Di Qi, Lin Su, Jia Song, Edward Cui, Taroon Bharti, Arun Sacheti.<\/p>\n\n\n\n

Read the paper here ImageBERT:<\/a> Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data.<\/strong><\/p>\n\n\n\n

Machine learning in action<\/strong><\/h2>\n\n\n\n

Away from research, who is using ML in production?<\/p>\n\n\n\n

It is most likely you are consuming ML products already. Big and small organizations are incorporating ML into their products.<\/p>\n\n\n\n

Google uses Machine learning to sort out spam messages.<\/p>\n\n\n\n

Facebook uses ML in ads placement.<\/p>\n\n\n\n

Grammarly<\/strong> is a writing assistant that helps you compose writings with better grammar and checks for spelling errors.<\/p>\n\n\n\n

CopyAI<\/a> is an excellent tool for copywriting. If you have writer’s block or stale copy, typing a few words about the content you are working on can autogenerate words.<\/p>\n\n\n\n

Try CopyAI<\/a><\/p>\n\n\n\n

\"\"<\/a><\/figure>\n\n\n\n

Machine learning prerequisite<\/strong><\/h2>\n\n\n\n

Statistics<\/h3>\n\n\n\n
Statistics is the discipline concerned with collecting, organization, analysis, interpretation and presentation of empirical data.<\/pre>\n\n\n\n


Machine learning largely depends on data, and Statistics is a discipline that focuses on data.
Some tools and concepts used in statistics can be handy for machine learning.
For example, descriptive statistics are used to transform raw data into useful information.
These essential concepts include “mean,” mode, median, skewness, standard deviation, outliers, etc.
Having a good basic knowledge of statistics will help you understand data and extract information from data.<\/p>\n\n\n\n

Mathematics: Linear Algebra, Calculus, Probability<\/h3>\n\n\n\n

Mathematics helps to understand the underlying fundamentals of machine learning algorithms.<\/p>\n\n\n\n

Calculus<\/h4>\n\n\n\n

Calculus plays an essential role in designing ML algorithms.<\/p>\n\n\n\n

Some topics include:<\/p>\n\n\n\n