Lab 23: Neural Networks as Matrix Machines

Central formula

A neural network layer computes

h = σ(Wx + b)

The matrix W mixes features, the bias b shifts thresholds, and the activation σ bends the computation.

Move the inputs and weights. The neuron computes z = w·x + b and h = max(0,z).

x₁ x₂ w₁ w₂ b

Compare ReLU, sigmoid, and tanh.

This layer maps R² to R³.

Input x₁ Input x₂

W = [[1,0],[0,1],[1,-1]], b = [0,0,1]

A ReLU neuron changes behavior across the line w·x+b=0.

Adjust raw scores and see probabilities.

score A score B score C

The XOR pattern cannot be separated by one line in input space. A hidden layer can make the structure easier.

Train a line y = wx+b on synthetic data. Click several times.

For a dense layer from n inputs to m outputs, parameters = mn + m.

input dimension n output neurons m

Write a short answer: In what sense is a neural network a matrix machine, and in what sense is it more than a matrix machine?