How To Get Started With Machine Learning: A Tutorial For Beginners By A Beginner

April 12th 2021 new story

@torbetGuy Torbet

I'm a 17 year old Software Developer and Student from Scotland.

Machine learning and AI as a whole can seem hugely daunting when you're first getting started. Over a weekend, I painstakingly sifted through all of the "beginner" guides on how to get started, so that you don't have to.

0 reactions

If you like this post, feel free to subscribe or check out my other posts here

0 reactions

Most people recommend an image classifier using the MNIST dataset as your 'hello world'/first machine learning program - I'm not sure if there is just an obscenely high barrier to entry, or if I'm simply just stupid, but for someone who wants to go from zero to something in the world of machine learning, this project seems a bit tricky.

0 reactions

Now, of course, I attempted it. I read anything and everything that I could get my hands on about the inner workings of how neural networks ACTUALLY work. I thought I had a pretty rudimentary understanding of what was going on, but with pre-made datasets and perplexing technical jargon, it's often hard to dilute what is actually happening - I'm still not entirely sure what the f*ck a 2D convolutional layer is.

0 reactions

So without further ado, here's how a beginner can get started with machine learning, by a REAL beginner. This will cover everything, from what libraries and frameworks to use, to how to save and load your trained model. By the end of this project, you should have a rough idea as to how it all works, which will send you on your merry way to more complex projects.

0 reactions

What Framework to Use

There are countless machine learning frameworks and libraries out there, but the main players are TensorFlow, PyTorch, and Keras (now part of TensorFlow). As usual, I ended up in a state of paralysis as to which to pick, and which one was the best - this is what inspired me to adopt the mentality of just starting.

0 reactions

In the end, I went with PyTorch. I would recommend this as you get control of everything and it makes it easy to visualise what's going on, but in the grand scheme of things they all work well and it doesn't really matter which one you go with - so just pick one and get started!

0 reactions

First Project

Neural Networks function around arrays, or tensors if you're cool (yes, I know that there is a difference, but it's not that important at this stage).

0 reactions

Long story short, you give a trained model an input as an array, and it gives you an output, that is hopefully correct. The simplest way that I could think to demonstrate this was to count the number of occurrences in a list.

0 reactions

The end goal of this model is to give it a binary list, and it outputs the number of ones, for example;

0 reactions

input: [1, 0, 0, 1, 0, 0, 0, 0, 1, 1], output: 4

Neural networks require an input and an expected output in order to train. You can generate arrays of random 1's and 0's like so, which also includes labels (expected outputs) for easy training later down the line:

0 reactions

def genData(vol, length):
    
    values = []
    labels = []

		data = []

    for i in range(vol):

        rand = random.randint(0, length)
        data = rand * [1] + (length - rand) * [0]
        random.shuffle(data)
        
        labels.append(data.count(1))
        values.append(data)

    return values, labels

We then can assign the values and labels to variables, which we convert to tensors so that the neural network can easily read and process them:

0 reactions

x, y = genData(100, 10)

xTensor = torch.Tensor(np.array(x))
yTensor = torch.Tensor(np.array(y))

Simple so far right? Now we structure our network! We will use PyTorches nn.Module to include all of the layers we could ever need, and then just piece them together. For this network, I only used 1 layer, it takes in 10 inputs (I used a random binary array of length 10), and 1 output (0-10, number of 1's in the given list).

0 reactions

We then have a forward method, which tells the network how to process the inputs as they pass through the layers, we're keeping it simple by just passing them through layer 1 and using a relu function:

0 reactions

class Net(torch.nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        self.l1 = nn.Linear(10, 1)

    def forward(self, x):
        out = F.relu(self.l1(x))
        return out


model = Net()

Now, we need a loss function, which will tell our network how far off the expected output it was so that it can correct itself and become more accurate, we will also add an optimiser to you know... optimise this process:

0 reactions

criterion = torch.nn.MSELoss()
optimizer = torch.optim.SGD(model.parameters(), lr=0.01)

You can do a lot of reading for yourself on the different loss functions, there are hundreds of them, but I enjoyed the MSE or cross-entropy loss function for linear NN's.

0 reactions

Now, TRAIN, TRAIN, TRAIN! Think of an epoch as a training loop, so in this example, I'm cycling through the data 1000 times:

0 reactions

for epoch in (t := trange(1000)):
    for i in range(len(x)):
        values = xTensor
        labels = yTensor

        output = model(values)

        optimizer.zero_grad()

        loss = criterion(output, labels.unsqueeze(1))
        loss.backward()

        optimizer.step()

We can finally save the model like this:

0 reactions

torch.save(model, "cnn.pt")

That's it! you've successfully created and trained your first neural network, from the ground up!

0 reactions

Running the Model

Now that we've trained and saved our model, we can start to pass in real inputs, and see what it predicts!

0 reactions

Make a new file, and initialise a simple array, and turn it into a tensor like before:

0 reactions

in_data = [1, 1, 1, 0, 1, 1, 1, 0, 0, 0]
inTensor = torch.Tensor(np.array(in_data))

Next, we need to copy the model from our previous file, so that we can add the weights and biases from our saved neural network to it:

0 reactions

class Net(torch.nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        self.l1 = nn.Linear(10, 1)

    def forward(self, x):
        x = F.relu(self.l1(x))
        return x

Finally, we can load our saved model, evaluate it, and pass in our new input data!

0 reactions

model = torch.load('cnn.pt')
model.eval()

out = model(inTensor)

If you print the value of the "out" variable, you should see a 6 in this case!

0 reactions

If you've made it this far and have been following along, you should hopefully have a better understanding of how PyTorch and machine learning work as a whole. For me, this project was a good jumping-off point, and after it, I felt more comfortable pursuing image classification with MNIST!

0 reactions

Previously published at https://torbet.co/posts/Weekend-Machine-Learning

0 reactions

Share this story

Join Hacker Noon

Create your free account to unlock your custom reading experience.

How To Get Started With Machine Learning: A Tutorial For Beginners By A Beginner

How To Get Started With Machine Learning: A Tutorial For Beginners By A Beginner

@torbetGuy Torbet

What Framework to Use

First Project

Running the Model

Recommend

Learning YAML - YAML Ain't Markup Language

Building Blocks of DOM Manipulation - Vanilla JS Tutorial (Part One)

The Ultimate Guide To Design Patterns And Generic Composite In Python

The Seven Best Admin Templates And Themes in React

Let's Talk About Vanilla JavaScript: What Is Vanilla JS, and Why Should I Spend...

The Future Is Now: Robots Helping Us Combat Plastic Pollution And Climate Change

Online Privacy is Not an Option: It's a Necessity

How to Implement Glassmorphism via HTML and CSS

Apple M1 Chip: How To Install Homebrew Using Rosetta

AnRKey X NFT Sales Reach Top 10 in the World on Rarible

About Joyk