Lab 25: The Grammar of AI

1. Embedding search

Move the query vector. The words are toy embeddings. Ranking is by cosine similarity.

Query x

Query y

A layer computes h = ReLU(Wx+b). Change the input and watch the hidden activations.

x₁

x₂

Scores become probabilities. Increase the temperature to make probabilities flatter.

Math score

AI score

Food score

Temperature

Softmax converts raw dot-product scores into a probability distribution.

Each row is a token choosing a weighted average of value vectors. Adjust similarity strength.

Attention sharpness

A large table can be approximated by a few hidden factors. Choose rank .

Approximation rank

Random vectors become nearly orthogonal as dimension grows.

Write a short explanation: Which operation seems most central to AI in this lab: dot product, matrix multiplication, softmax, SVD, or optimization?

Big idea: AI systems often look magical because they are large, but their core grammar is built from linear algebra operations you can inspect.