Shrijith Venkatramana

Posted on Feb 16, 2025 • Edited on Mar 7

Modeling a Neuron in micrograd (As Explained by Karpathy)

#ai #programming #python #machinelearning

Hello, I'm Shrijith. I'm building git-lrc, an AI code reviewer that runs on every commit. It is free, unlimited, and source-available on Github. Star Us to help devs discover the project. Do give it a try and share your feedback for improving the product.

## Modeling a Neuron

In serious neural network implementations, we model the neuron in the following way:

Input x0 (axon)
Weight w0 (synapse)
1 "Influence" x0*w0 (dendrite)
Sum of "influences" = x0*w0 + x1*w1 + ... (cell body)
Bias b

The above leads to the cell body expression:

$\sum (x_i \cdot w_i) + b$

We also have:

Activation function - squashing fuction (tanh, sigmoid)

The output axon is then:

f(\sum (x_i \cdot w_i) + b)

Representing the Model Neuron (defined above) in micrograd

# inputs x1, x2
x1 = Value(2.0, label='x1')
x2 = Value(0.0, label='x2')

# weights w1, w2
w1 = Value(-3.0, label='w1')
w2 = Value(1.0, label='w2')

# bias of the neuron
b = Value(6.7, label='b')

x1w1 = x1 * w1; x1w1.label = 'x1*w1'
x2w2 = x2 * w2; x2w2.label = 'x2*w2'
x1w1x2w2 = x1w1 + x2w2; x1w1x2w2.label = 'x1*w1 + x2*w2'

n = x1w1x2w2 + b; n.label = 'n'

draw_dot(n)

Result:

Implementing `tanh` into Value (for the Activation Function)

We have the following tanh formula:

We can implement the function as follows:

class Value:
    ...

    def tanh(self):
        x = self.data
        t = (math.exp(2*x) - 1) / (math.exp(2*x) + 1)
        out = Value(t, (self, ), 'tanh')
        return out

We'll add a new node o which is the tanh(n):

# inputs x1, x2
x1 = Value(2.0, label='x1')
x2 = Value(0.0, label='x2')

# weights w1, w2
w1 = Value(-3.0, label='w1')
w2 = Value(1.0, label='w2')

# bias of the neuron
b = Value(6.8813735870195432, label='b')

x1w1 = x1 * w1; x1w1.label = 'x1*w1'
x2w2 = x2 * w2; x2w2.label = 'x2*w2'
x1w1x2w2 = x1w1 + x2w2; x1w1x2w2.label = 'x1*w1 + x2*w2'

n = x1w1x2w2 + b; n.label = 'n'

o = n.tanh(); o.label = 'o'

draw_dot(o)

And we get:

Derivative of o - Derivative of `tanh`

The formula for derivative of tanh is the following:

So, we want to find out do/dn:

do/dn = 1 - tanh(n)**2 = 1 - o**2

We know that do/do = 1

So, o.grad = 1

To find do/dn, we do:

Therefore:

n.grad = 0.5

Getting all the backprop values calculated (manually)

We leverage some patterns we've learned previously about how backprop works with addition/multiplication, to quickly fill in the values for grad in each node:

o.grad = 1
n.grad = 1 - o.data**2

## addition x1w1x2w2.grad = n.grad
b.grad = n.grad
x2w2.grad = x1w1x2w2.grad
x1w1.grad = x1w1x2w2.grad

## multiplication - element.grad = sibling.data * next.grad
x2.grad = w2.data * x2w2.grad
w2.grad = x2.data * x2w2.grad
x1.grad = w1.data * x1w1.grad
w1.grad = x1.data * x1w1.grad
draw_dot(o)

Reference

*AI agents write code fast. They also silently remove logic, change behavior, and introduce bugs -- without telling you. You often find out in production.

git-lrc fixes this. It hooks into git commit and reviews every diff before it lands. 60-second setup. Completely free.*

Any feedback or contributors are welcome! It's online, source-available, and ready for anyone to use.

⭐ Star it on GitHub:

HexmosTech / git-lrc

Free, Unlimited AI Code Reviews That Run on Commit

git-lrc

Free, Unlimited AI Code Reviews That Run on Commit

AI agents write code fast. They also silently remove logic, change behavior, and introduce bugs -- without telling you. You often find out in production.

git-lrc fixes this. It hooks into git commit and reviews every diff before it lands. 60-second setup. Completely free.

See It In Action

See git-lrc catch serious security issues such as leaked credentials, expensive cloud operations, and sensitive material in log statements

git-lrc-intro-60s.mp4

Why

🤖 AI agents silently break things. Code removed. Logic changed. Edge cases gone. You won't notice until production.
🔍 Catch it before it ships. AI-powered inline comments show you exactly what changed and what looks wrong.
🔁 Build a habit, ship better code. Regular review → fewer bugs → more robust code → better results in your team.
🔗 Why git? Git is universal. Every editor, every IDE, every AI…

View on GitHub

DEV Community

Modeling a Neuron in micrograd (As Explained by Karpathy)

Representing the Model Neuron (defined above) in micrograd

Implementing `tanh` into Value (for the Activation Function)

Derivative of o - Derivative of `tanh`

Getting all the backprop values calculated (manually)

Reference

HexmosTech / git-lrc

Free, Unlimited AI Code Reviews That Run on Commit

See It In Action

Why

Top comments (0)

Representing the Model Neuron (defined above) in micrograd

Implementing tanh into Value (for the Activation Function)

Derivative of o - Derivative of tanh

Getting all the backprop values calculated (manually)

Reference

HexmosTech / git-lrc

Free, Unlimited AI Code Reviews That Run on Commit

See It In Action

Why

Implementing `tanh` into Value (for the Activation Function)

Derivative of o - Derivative of `tanh`