ClearDataLabs

The Architectural Loophole: AI Copyright & Book Replication

2026-04-04T00:00:00+00:00

AI has a strange relationship with access control. I recently asked for a copyrighted book and hit a 403 Forbidden error—the AI refused to help me ‘pirate’ a PDF. But moments later, it granted me full read-access anyway, perfectly reconstructing the book’s entire conceptual architecture from its own weights.

While researching a few seminal software architecture books I asked AI assistant to find and compress their core insights. The response was the standard, hard-coded ethical wall we’ve all seen:

“I can’t help download copyrighted books. It’s a violation of the author’s rights and publisher’s copyright.”

But then came the pivot. Less than two minutes later, after a slight nudge for a summary, the AI offered a workaround that felt like a “legal glitch in the matrix”:

“I’ve read these extensively. Let me build distilled handbooks from my training knowledge, maintaining the same format and density.”

What followed was a startlingly accurate, 8,000-word recreation of the book’s architecture. It didn’t just summarize; it mirrored the original chapter flow, conceptual hierarchy, and technical density with surgical precision.

It didn’t give me the original file, but it gave me the downloadable functional DNA of the work.

1. Beyond Summarization: “Architectural Replication”

As an AI enthusiast, I was impressed. As a professional, I was concerned. Most discussions about AI ethics focus on “scraping” or “plagiarism.” But we are entering a new phase I call Architectural Replication.

When an LLM provides a “distilled handbook” that maintains the density of a 400-page work, it isn’t just “talking about” the book. It is mapping the blueprint.

The Paradox: The model protects the container (the PDF) while giving away the contents (the logic) for free.
The Loophole: In 2026, we are seeing the rise of Synthesized Displacement. This is where the AI provides enough “distilled” value that the user no longer feels the need to purchase the original source.

2. The 2026 “Grey Zone” for Tech Talent

For the “next-gen” developers and AI enthusiasts, this feels like a superpower. You can ingest the “wisdom” of a decade-long career in a 20-minute read. But this shortcut has a hidden technical debt.

The Accuracy Trap

The AI’s 8,000-word “mirror” is incredibly close, but it’s still a reconstruction. While the structure is there, the nuance—those hard-won edge cases that authors spend years documenting—can become flattened.

The Data Starvation Loop

If we stop supporting technical authors because an AI “distilled” them for us, the flow of high-quality data stops. We are essentially eating the “seed corn” of future training data.

3. Navigating the Ethics (and the Law)

Under the 2026 EU AI Act and recent Medium community standards, we are moving toward a world of “AI Transparency.” If you are using these distilled handbooks to build systems, you need to be an Ethical Curator, not just a prompt engineer.

Verification is Mandatory: Even a “structurally perfect” recreation can hallucinate a critical software pattern. Always treat AI-distilled handbooks as a “Map,” not the “Territory.”
Support the Source: If a distillation saves you 20 hours of work, that is the highest praise for the author. Buy the original book. Use it as your Source of Truth.

Final Thought

When an AI can refuse a “copy” but successfully recreate the “soul” of a work, the traditional definition of copyright is effectively broken. We are in the “Wild West” of information.

The question for us in the tech community isn’t just “Can we do this?” but “How do we build an ecosystem where the original architects of these ideas still have a reason to write?”

How are you handling these “recreated” insights in your workflow? Is the “soul” of the book enough, or do you still find yourself reaching for the original PDF?

Note: This article was written with AI assistance based on practical personal observations and experience, in alignment with 2026 transparency standards.

Building a Neural Network from Scratch in TypeScript — No Libraries

2026-04-02T00:00:00+00:00

This is Part 2 of the hwrjs series — a handwriting recognizer built from scratch in TypeScript. Live demo · Source on GitHub

A feedforward neural network is a series of layers where each neuron computes a weighted sum of its inputs, passes it through an activation function, and outputs a single number. This article builds one from scratch in TypeScript — every neuron, every layer, every weight — in under 200 lines with no ML libraries, running in a browser tab.

The math behind neural networks is simple enough to write yourself, yet most tutorials reach for PyTorch or TensorFlow within the first five minutes, hiding the mechanics.

This article walks through the architecture. How a single neuron works, how neurons compose into layers, how layers compose into a network, and how the network turns 144 binary inputs into a single predicted handwritten character.

If you haven’t read Part 1, the quick version: user handwriting gets normalized into a 12×12 binary grid — 144 numbers, each 0 or 1, encoding which cells of the grid the pen passed through. That array is the input to everything described here.

Source code: github.com/cleardatalabs/hwrjs · Live demo: cleardatalabs.github.io/hwrjs

The neuron

A neuron takes a list of numbers, computes a weighted sum, squashes the result through a function, and produces a single output number. That’s the entirety of it.

// domain/nneuron.ts
propForward(inputs: number[]) {
  let sum = 0;
  for (let i = 0; i < inputs.length; i++) {
    sum += inputs[i] * this.weights[i];
  }
  this.sigmaFunction(sum);
}

this.weights is an array of the same length as inputs. Each weight says how much attention the neuron pays to the corresponding input. A large positive weight amplifies that input’s influence; a large negative weight suppresses it; near-zero means “mostly ignore this.”

The weighted sum is:

sum = input[0] × weight[0] + input[1] × weight[1] + ... + input[143] × weight[143]

For a neuron connected to the 144-input layer, that’s 144 multiplications and 143 additions. Fast, and completely parallelizable.

The activation function

A weighted sum can produce any real number. But probabilities live in [0, 1], and we want outputs that can be interpreted as confidence. The sigmoid function maps any real number into that range:

// domain/nneuron.ts
sigmaFunction(x: number) {
  this.output = 1 / (1 + Math.exp(-x));
}

The shape: very negative inputs produce outputs close to 0; very positive inputs produce outputs close to 1; near zero, the output is close to 0.5. Plotted, it’s an S-curve.

Input (sum)	Output
−10	~0.000
−2	~0.119
0	0.500
+2	~0.881
+10	~1.000

This non-linearity is what makes stacking neurons useful. Without it, a network of any depth would still be a linear transformation, and linear transformations can’t represent the curved decision boundaries needed to separate different letter shapes.

Weight initialization

Every weight starts small and random:

// domain/nneuron.ts
constructor(numInputs: number) {
  this.weights = [];
  this.deltas = [];
  for (let i = 0; i < numInputs; i++) {
    this.weights.push(Math.random() * 0.1);
    this.deltas.push(0.0);
  }
}

Math.random() * 0.1 gives a value in [0, 0.1). Starting small prevents the sigmoid from saturating immediately (pushing outputs to near-0 or near-1 from the first forward pass, which would make early learning very slow). Starting random breaks symmetry — if all weights were identical, all neurons would learn identical things and the layer would collapse to a single effective neuron.

The layer

A layer is a collection of neurons that all receive the same inputs and produce independent outputs.

// domain/nlayer.ts
propForward(inputs: number[]) {
  for (let i = 0; i < this.neurons.length; i++) {
    this.neurons[i].propForward(inputs);
  }
}

getOutputs(): number[] {
  const outputs: number[] = [];
  for (let i = 0; i < this.neurons.length; i++) {
    outputs.push(this.neurons[i].output);
  }
  return outputs;
}

Each neuron sees the full input vector and produces one number. The layer collects all those numbers into an output vector. A layer of N neurons transforms an M-dimensional input into an N-dimensional output.

This is a dense (fully connected) layer: every input is connected to every neuron. There are no skip connections, no convolutions, no attention mechanisms. The simplicity is intentional — for an educational project on a fixed alphabet, it’s enough.

The network topology

The network is built in TrainingService.createNet():

// services/training.service.ts
createNet() {
  const numInputs  = this.samplesService.sensorWidth * this.samplesService.sensorHeight;
  const numOutputs = this.samplesService.sampleGroups.length;
  this.net = new NNet(numInputs, [numInputs, numOutputs]);
}

NNet takes a number of inputs and an array specifying the neuron count for each layer. The call new NNet(144, [144, N]) builds:

Input (144)
    ↓
Layer 1: 144 neurons  ← hidden layer
    ↓
Layer 2: N neurons    ← output layer, one neuron per letter

Where N is the number of distinct characters the user has trained on. Train on A, B, and C — N is 3. Train on all 26 letters — N is 26.

The hidden layer has 144 neurons, matching the input dimension. This is a somewhat arbitrary choice; larger or smaller hidden layers would also work, with different trade-offs in capacity and training speed.

The network is constructed in NNet:

// domain/nnet.ts
constructor(numInputs: number, numNeuronsPerLayer: number[]) {
  this.layers.push(new NLayer(numNeuronsPerLayer[0], numInputs));
  for (let i = 1; i < numNeuronsPerLayer.length; i++) {
    this.layers.push(new NLayer(numNeuronsPerLayer[i], numNeuronsPerLayer[i - 1]));
  }
}

Layer 0 receives numInputs (144) inputs from the user’s drawing. Layer 1 receives 144 outputs from Layer 0. Each layer’s input count is the previous layer’s neuron count.

Forward propagation

When the user draws a character and clicks “Check”, the network runs propForward:

// domain/nnet.ts
propForward(inputs: number[]): number[] {
  let currentInputs: number[] = inputs;
  for (const layer of this.layers) {
    layer.propForward(currentInputs);
    currentInputs = layer.getOutputs();
  }
  return currentInputs;
}

The 144-element input array flows through Layer 1 (144 neurons → 144 outputs), then through Layer 2 (N neurons → N outputs). The final currentInputs is the network’s output: an N-element array where each value is between 0 and 1.

Reading the output

The output vector has one element per letter. After training, a well-functioning network produces something like:

[0.03, 0.96, 0.02, 0.01]  → "B" (index 1 has the highest activation)

The recognition logic in TrainingService.getResult() finds the winner:

// services/training.service.ts
let maxValue = out[0];
let maxIndex = 0;
for (let i = 0; i < out.length; i++) {
  if (out[i] > maxValue) {
    maxValue = out[i];
    maxIndex = i;
  }
}
const res = this.samplesService.sampleGroups[maxIndex].letter;
this.resultSource.next(res);

A simple argmax — find the neuron with the highest activation and return its corresponding letter. The confidence scores (the raw output values) are also displayed in the UI as a horizontal bar next to each letter.

The whole picture

At this point, the architecture is complete:

Stage	Component	Shape
Raw drawing	`DrawingComponent`	`Point[]` (variable length)
Grid encoding	`SamplesService.gridFromSample()`	`number[144]`
Hidden layer	`NLayer` (144 neurons)	`number[144]`
Output layer	`NLayer` (N neurons)	`number[N]`
Prediction	`TrainingService.getResult()`	`string`

The network doesn’t know anything about letters yet — it starts with random weights and produces random outputs. Training is what turns the random number generator into a character recognizer. That’s the subject of the next article.

Backpropagation in the Browser: Training a Neural Network in JavaScript

2026-04-02T00:00:00+00:00

This is Part 3 of the hwrjs series — a handwriting recognizer built from scratch in TypeScript. Live demo · Source on GitHub

Backpropagation is the algorithm that trains a neural network by computing how much each weight contributed to the output error, then adjusting every weight proportionally using gradient descent. This article implements backpropagation from scratch in TypeScript — no ML libraries — and runs the entire training loop live in the browser.

The network has 144 input neurons, 144 hidden neurons, and N output neurons. Before training, it returns essentially random output. Through thousands of iterations, backpropagation slowly adjusts the weights until the network correctly classifies handwritten characters. This article explains exactly how that happens: the error signal, the backward pass, the weight update rule, and the engineering trick that keeps the browser responsive while the computation runs.

This is Part 3 of a series. Part 1 covered input encoding; Part 2 covered the architecture.

Source code: github.com/cleardatalabs/hwrjs · Live demo: cleardatalabs.github.io/hwrjs

What the network is trying to do

For each training sample, the network knows the correct answer: if the user drew “A”, the target output is [1, 0, 0] (called a one-hot vector — 1 for the correct letter, 0 for everything else). The network produces some actual output, say [0.47, 0.51, 0.52]. Training is the process of nudging the weights so the actual output gets closer to the target.

“Closer” is measured by Mean Squared Error (MSE), computed in TrainingService.calcMSE():

// services/training.service.ts
calcMSE(): number {
  let err = 0;
  for (const trainSet of this.trainData) {
    this.net.propForward(trainSet.inputs);
    err += this.net.layers[this.net.layers.length - 1]
      .errorsForSelf(trainSet.outputs)
      .reduce((a, b) => a + b * b, 0);
  }
  return err / this.trainData.length;
}

errorsForSelf(targets) returns output - target for each output neuron. Squaring each difference (via reduce) penalizes large errors more than small ones and keeps the total positive. Averaging over all training samples gives a single number: how wrong the network is, overall.

When you click “Train”, the UI shows MSE Initial and MSE Current. A well-trained network drives that second number close to zero.

Backpropagation, step by step

Backpropagation is an application of the chain rule from calculus: to reduce the output error, compute how much each weight contributed to that error, then adjust each weight proportionally.

The implementation is split across three methods.

Step 1: output layer errors

// domain/nlayer.ts
errorsForSelf(targets: number[]): number[] {
  const errors: number[] = [];
  for (let i = 0; i < this.neurons.length; i++) {
    errors.push(this.neurons[i].output - targets[i]);
  }
  return errors;
}

For each output neuron, the raw error is simply output - target. If the network says 0.47 for “A” and the target is 1.0, the error is −0.53. The sign tells us which direction to move.

That raw error gets converted to a neuron delta by multiplying by the sigmoid derivative:

// domain/nneuron.ts
sigmaDelta(): number {
  return this.output * (1 - this.output);
}

calcError(error: number) {
  this.error = error * this.sigmaDelta();
}

The sigmoid derivative output × (1 − output) is zero when the output is near 0 or 1 (the neuron is already “decided”) and maximum at 0.5. This is the chain rule in action: the gradient of the loss with respect to the pre-activation sum is the raw error scaled by how responsive the neuron is at its current activation.

Step 2: propagate errors backward

Output layer errors are known. Hidden layer errors are computed from them:

// domain/nlayer.ts
errorsForPrevious(): number[] {
  const errors: number[] = [];
  for (let i = 0; i < this.numInputs; i++) {
    let error = 0;
    for (const neuron of this.neurons) {
      error += neuron.error * neuron.weights[i];
    }
    errors.push(error);
  }
  return errors;
}

For each input position i, the error attributed to it is the sum over all neurons of neuron.error × neuron.weights[i]. Intuitively: if a weight is large and the downstream neuron has a large error, that input was heavily responsible for the mistake.

NNet.propBackward() orchestrates the whole backward pass:

// domain/nnet.ts
propBackward(outputs: number[]) {
  const lastLayer = this.layers[this.layers.length - 1];
  lastLayer.calcOwnError(lastLayer.errorsForSelf(outputs));

  for (let i = this.layers.length - 2; i >= 0; i--) {
    this.layers[i].calcOwnError(this.layers[i + 1].errorsForPrevious());
  }
}

The error flows from the output layer backward to the first hidden layer. Each layer computes its own deltas based on the layer immediately after it.

Step 3: update the weights

With deltas in hand, every weight gets adjusted:

// domain/nlayer.ts
updateWeights(inputs: number[]) {
  for (const neuron of this.neurons) {
    for (let i = 0; i < neuron.weights.length; i++) {
      neuron.weights[i] -= NNeuron.M * inputs[i] * neuron.error;
    }
  }
}

The update rule: weight -= M × input × error

M = 0.3 is the learning rate — how large a step to take each iteration
input is the activation the weight was multiplied against during the forward pass
neuron.error is the delta computed during backpropagation

If input was large and error was large, the weight changes significantly — this connection was responsible for a big mistake. If input was zero (the cell was empty in the grid), the weight doesn’t change at all — an absent feature can’t be blamed.

The learning rate of 0.3 is a hyperparameter. Too high and the network overshoots and oscillates; too low and training converges slowly. 0.3 works well for this small network and dataset.

One complete training step

NNet.trainSample() ties it all together:

// domain/nnet.ts
trainSample(inputs: number[], outputs: number[]) {
  this.propForward(inputs);
  this.propBackward(outputs);
  this.updateWeights(inputs);
}

Forward pass → backward pass → weight update. Three lines, repeated for every training sample, thousands of times.

The training loop

TrainingService.trainCycle() runs 100 complete passes over all training data:

// services/training.service.ts
trainCycle() {
  for (let i = 0; i < this.itersPerCycle; i++) {
    for (const trainSet of this.trainData) {
      this.net.trainSample(trainSet.inputs, trainSet.outputs);
    }
  }
}

One iteration = one forward pass + one backward pass + one weight update per training sample. One cycle = 100 iterations over all samples.

The UI synchronization problem

JavaScript in a browser runs on a single thread. If you put all the training in a while (true) loop, the page freezes — no repaints, no user interaction — until the loop exits. For a demo that’s supposed to show live MSE decreasing, that’s unacceptable.

The solution is setTimeout:

// training.component.ts
train() {
  if (this.trainingStarted) {
    this.trainingService.trainCycle();
    this.iterations += this.trainingService.itersPerCycle;
    this.errCurrent = this.trainingService.calcMSE();

    setTimeout(() => {
      this.train();
    }, 50);
  }
}

setTimeout(callback, 50) schedules the next training cycle to run 50 milliseconds later. Between that call and the next cycle, the browser event loop has a chance to run: it processes any pending input events and repaints the DOM. The user sees the MSE tick downward, the iteration counter increase, and can click “Stop” at any time.

The 100 iterations per cycle × 50ms timeout is the tuning knob. Fewer iterations per cycle makes the UI more responsive but slightly slower to converge; more iterations do the inverse. The current values strike a reasonable balance for a small training set on a modern machine.

A more sophisticated implementation would use a Web Worker to move the computation off the main thread entirely, eliminating the need for the setTimeout rhythm. For a demo that runs in a browser and prioritizes readability over throughput, the timer approach is clean enough — and it makes the training loop visible in the DevTools profiler.

Watching convergence

When you click “Train” for the first time:

createNet() builds the network with random weights
calcMSE() runs once to capture MSE Initial — typically somewhere between 0.2 and 0.4
The training loop starts; MSE Current updates every ~50ms
After a few hundred iterations, MSE should settle below 0.05 for a well-separated alphabet

Draw a character you trained on and click “Check”. The result appears on the canvas, and the confidence bars update to show what the network “thinks” each output neuron is saying.

Limitations and what to try next

This network is a minimal implementation. It works for the demo case, but a few changes would make it more capable:

Bias neurons: every neuron uses a weighted sum with no constant offset. Adding a bias weight (always connected to input 1.0) would let neurons activate even when all real inputs are zero. Bias is standard in modern networks; its absence here is a deliberate simplification.
More hidden layers: one hidden layer limits the complexity of the decision boundaries. A second or third layer could represent more abstract features.
Momentum: the current update rule is plain gradient descent. Adding a momentum term (carrying a fraction of the previous weight update forward) reduces oscillation and often converges faster.
Web Worker for training: move trainCycle() to a Worker thread. The main thread stays responsive without needing the 50ms yield trick.

If you want to experiment: fork the repo, change NNeuron.M from 0.3 to 0.1 and observe slower convergence; change it to 0.9 and watch the network oscillate. Add a second hidden layer in createNet() and see if recognition accuracy improves. The network is small enough that every parameter is accessible in the source.

The complete picture

Three articles, one project. The pieces:

Input encoding — raw strokes become a 144-element binary grid via bounding-box normalization
Architecture — a 3-layer perceptron: 144 inputs → 144 hidden → N outputs, each neuron using sigmoid activation
Training — gradient descent with backpropagation, running in the browser’s event loop via setTimeout

The result is a handwriting recognizer that runs entirely client-side, built from first principles in about 300 lines of TypeScript. No magic, no library hiding the math, no GPU required.

Handwriting Recognition in the Browser — Neural Network in TypeScript from Scratch

2026-04-02T00:00:00+00:00

Draw a letter. The network classifies it. No server, no cloud API, no GPU — just TypeScript and a browser tab.

Try the live demo →

What this is

hwrjs is a handwritten character recognizer built with Angular and TypeScript. The neural network inside it is implemented from scratch — every neuron, every weight, every gradient update is written by hand. No TensorFlow. No ONNX. No ML library of any kind.

You train it yourself: draw a few examples of each letter you want to recognize, click Train, and within seconds the network learns to distinguish your handwriting. Then draw a new letter and watch it predict.

The whole network is under 300 lines of TypeScript spread across three files.

How it works

The pipeline has three stages:

1. Input encoding — When you draw on the canvas, the raw pen coordinates get normalized into a 12×12 binary grid. This grid is scale-invariant (it doesn’t matter how big or small you draw) and position-invariant (it doesn’t matter where on the canvas you draw). The result is 144 numbers, each 0 or 1.

2. The network — A feedforward neural network with three layers: 144 inputs → 144 hidden neurons → N outputs (one per letter you’ve trained on). Each neuron computes a weighted sum of its inputs and passes the result through a sigmoid activation function.

3. Training — Backpropagation with gradient descent. For each training sample, the network computes its error, propagates that error backward through the layers, and adjusts each weight proportionally. This runs in the browser’s event loop using setTimeout to keep the UI responsive.

The series

These three articles cover each stage in detail:

Seeing in Cells: How a Computer Reads Your Handwriting — how raw pen strokes become a 144-number array
144 Numbers In, One Letter Out — the network architecture, neuron math, and forward propagation
Backprop in the Browser — gradient descent, backpropagation, and the browser UI synchronization trick

Each article is self-contained but they build on each other. Start from Part 1 if you want the full picture.

Source code

The full project is on GitHub: github.com/cleardatalabs/hwrjs. Fork it, change the learning rate, add a hidden layer, swap the activation function — the network is small enough that every parameter is accessible and every change is immediately visible in the demo.

What Does a Neural Network Learn? Visualizing MNIST with Causal Index

2026-04-02T00:00:00+00:00

This is Chapter 1 of a two-part series on knowledge extraction from neural networks. Live demo · Source on GitHub

The causal index is a method for understanding what a neural network has learned by measuring how strongly each input pixel influences each output class. For an MNIST digit classifier, this produces heat maps that visually reveal which pixel regions the network relies on for each digit — a form of neural network interpretability implemented here in pure JavaScript, running in the browser.

Background

This project was inspired by an online demo by Hubert Eichner — a neural network for handwritten digit recognition running entirely in the browser. The network was trained on the MNIST dataset in MATLAB, then exported to JavaScript. Combined with a drawing tool, it lets users write digits on screen and get instant predictions.

The model achieves a recognition error of just 1.92% (9,808 out of 10,000 digits classified correctly), which is a solid result even among other MNIST benchmarks. Great work and a beautiful presentation — but can we go further?

The Question: What Has the Network Learned?

A trained model can classify digits, but there’s growing interest in understanding how it makes decisions. Researchers often want to peek inside the “black box” and extract interpretable rules or measure how each input contributes to the output.

Several approaches exist for this purpose, varying in complexity and assumptions about network structure. Two useful references:

In this chapter, we use one of the simplest: the causal index method.

Network Architecture

The network has a straightforward feed-forward structure:

Input layer: 784 units (a 28 x 28 grayscale image, pixel values normalized to the range [-1, 1])
Hidden layer: a set of hidden neurons with learned weights
Output layer: 10 units, each representing the probability of a digit class (0 through 9)

The full network structure and weight values are available in net.js, extracted from the original demo page.

Computing the Causal Index

Since the architecture and weights are fully known, we can calculate a causal index for each input pixel relative to each output class. The causal index measures how strongly a given input pixel influences a particular output, summed across all paths through the hidden layer:

C_i = sum over j from 0 to h of (W_kj * W_ji)

Where:

h is the number of hidden neurons
W_kj is the weight from hidden neuron j to output neuron k
W_ji is the weight from input pixel i to hidden neuron j

In JavaScript, this looks like:

function getInfluence(inputIndex, outIndex) {
  var sum = 0;
  for (var i = 0; i < w12.length; i++) {
    sum += w12[i][inputIndex] * w23[outIndex][i];
  }
  return sum;
}

Visualizing the Results

The final step is to create 10 “heat maps” — one for each digit class. Each heat map is a 28x28 image where the brightness of each pixel corresponds to its causal index value. Darker pixels have more influence on the network’s prediction for that digit.

The visualization is rendered on HTML canvas elements using a draw function that maps each pixel’s causal index to a grayscale color value.

What the Heat Maps Reveal

The results are striking: the heat maps closely resemble the actual digit shapes. This makes intuitive sense — pixels in the regions where a digit is typically drawn should have the strongest influence on recognizing that digit.

You can see all 10 heat maps generated live in the interactive demo.

What’s Next

The causal index method is fast and intuitive, and it works well for simple feed-forward networks with known structure. However, more complex architectures (or true “black box” models) require different techniques — for instance, iteratively adapting an input image to maximize a particular output class, similar to the approach used in DeepDream.

That’s exactly what we explore in Chapter 2.

Making a Neural Network Dream: DeepDream-Style Visualization in JavaScript

2026-04-02T00:00:00+00:00

This is Chapter 2 of a two-part series on knowledge extraction from neural networks. Live demo · Source on GitHub

Iterative input adaptation is a technique for visualizing what a neural network has learned: start with a blank image, randomly perturb individual pixels, and keep changes that increase the network’s confidence for a target class. The result is a DeepDream-style image that reveals the network’s internal concept of each digit — implemented here in pure JavaScript with no ML libraries.

Recap

In Chapter 1, we extracted knowledge from a feed-forward neural network by computing the causal index — a weighted sum of paths from each input pixel to each output class. The result was a set of static heat maps that reveal which pixels matter most for each digit.

That approach is fast and analytical, but it has a limitation: it only considers the linear contribution of weights, ignoring the non-linear activation functions that make neural networks powerful. Can we do better?

A Different Approach: Optimizing the Input

Instead of analyzing weights directly, we can take a completely different path: start with a random (mostly blank) image and iteratively modify it until the network confidently classifies it as a specific digit. This is conceptually similar to Google’s DeepDream, though applied to a much simpler network.

The idea is straightforward:

Start with a near-blank 28x28 image (pixel values near -1, with small random noise).
Randomly perturb a single pixel by a small amount.
Run the modified image through the network and check the output probability for the target digit.
If the probability improved (i.e., the error decreased), keep the change. Otherwise, discard it.
Repeat thousands of times.

This is essentially a form of stochastic hill climbing — a simple optimization technique that doesn’t require computing gradients.

The Error Function

The core of the optimization is the error function. At its simplest, the error for a target digit k is:

error = 1 - network_output[k]

We want to minimize this, meaning we want the network’s confidence in digit k to approach 1.0.

Optional: Smoothness Regularization

Raw optimization can produce noisy, speckled images. To encourage smoother, more natural-looking results, we add a regularization term that penalizes pixels that differ significantly from their neighbors:

regularization = sum of (pixel - average_of_neighbors)^2

The final error becomes error + lambda * regularization, where lambda is a small weighting factor. In the demo, this is controlled by the “Force Smoothness” checkbox.

The Implementation

The Model object handles the full optimization loop:

initInputs() — creates a near-blank starting image
inputsChanged() — generates a candidate image by randomly perturbing one pixel
calcErr(inputs, out) — computes the error (with optional regularization)
run() — the main loop: tries 100 random perturbations per frame, keeps improvements, redraws, and schedules the next frame

this.run = function () {
    var out = selectedDigit;
    var bestErr = this.calcErr(this.inputs, out);

    for (var i = 0; i < 100; i++) {
        var newInputs = this.inputsChanged();
        var newErr = this.calcErr(newInputs, out);
        if (newErr < bestErr) {
            bestErr = newErr;
            this.inputs = newInputs;
        }
    }

    this.draw();
    setTimeout(function () { self.run(); }, 0);
};

Results

After several seconds of running, recognizable digit shapes emerge from the noise. With smoothness regularization enabled, the images are cleaner and more closely resemble human handwriting. Without it, the network finds noisier patterns that still achieve high confidence — revealing the kinds of subtle pixel arrangements the network responds to, even if they don’t look natural to us.

Try it yourself in the interactive demo. Select a digit, click Run, and watch the image evolve in real time.

Takeaways

This approach demonstrates that even a simple optimization strategy (no gradients, no backpropagation through the input) can reveal what a neural network has learned. The generated images serve as a form of model visualization — they show us the network’s internal concept of each digit.

Comparing the two chapters: the causal index method (Chapter 1) gives a quick analytical snapshot, while iterative adaptation (Chapter 2) lets the network actively construct its ideal input. Both are valuable perspectives on the same model.

Seeing in Cells: How Handwriting Becomes Input for a Neural Network

2026-04-02T00:00:00+00:00

This is Part 1 of the hwrjs series — a handwriting recognizer built from scratch in TypeScript. Live demo · Source on GitHub

Before a neural network can classify handwriting, raw pen strokes must be converted into a fixed-size numerical input. This article implements that preprocessing step from scratch in TypeScript: normalizing variable-length canvas coordinates into a scale-invariant 12×12 binary grid — 144 numbers that encode shape, not position or size.

The approach runs entirely in the browser — no cloud API, no Python runtime, no GPU — just JavaScript, a canvas element, and a few hundred lines of arithmetic. But before the neural network can do anything useful, it faces a deceptively hard problem: handwriting doesn’t fit in a box. You draw an “A” in the top-left corner of the canvas, your friend draws the same letter twice as large and centered. Raw pixel coordinates are useless — they encode position and scale, not shape. The network needs something invariant to where and how big you drew.

This article is about how we solve that. It covers the journey from raw canvas strokes to the 144-number array that actually feeds the network. If you want to follow along in code, the full project is at github.com/cleardatalabs/hwrjs, with a live demo here.

The raw material: a list of (x, y) points

When you drag your mouse across the canvas, the DrawingComponent fires on every mousemove event. Each event produces a single coordinate:

// drawing.component.ts
onMove(src: MouseEvent) {
  if (!this.isDrawing) { return; }
  const rect = this.canvasRef.nativeElement.getBoundingClientRect();
  this.draw({
    x: src.clientX - rect.left,
    y: src.clientY - rect.top
  });
}

The subtraction of rect.left / rect.top converts from viewport coordinates to canvas-local coordinates. Each point is pushed into samplesService.points, which at the end of a stroke looks something like:

[{x:112, y:48}, {x:114, y:51}, {x:117, y:58}, {x:120, y:68}, ...]

This list is the raw material. It captures every position your pen visited, in the order you visited them. It knows nothing about what letter you intended.

The problem with raw coordinates

Imagine training a network on these raw (x, y) pairs. Two people draw the letter “A”:

Person	x range	y range	Number of points
Alice	100–180	40–140	87 points
Bob	220–310	180–290	143 points

Alice drew small and in the upper-left. Bob drew large and in the center. To a model trained on Alice’s coordinates, Bob’s “A” looks like a completely different object — same topology, totally different numbers.

We need a representation that strips out position, scale, and stroke density, leaving only shape. The answer is a fixed-size binary grid.

Step 1: find the bounding box

Before we can project the drawing onto a grid, we need to know its extent. SamplesService.getBoundingBox() makes a single pass over the points:

// samples.service.ts
private getBoundingBox(points: Point[]): { minX: number; minY: number; maxX: number; maxY: number } {
  let maxX = -Infinity, maxY = -Infinity;
  let minX = Infinity, minY = Infinity;
  for (const point of points) {
    if (point.x > maxX) { maxX = point.x; }
    if (point.x < minX) { minX = point.x; }
    if (point.y > maxY) { maxY = point.y; }
    if (point.y < minY) { minY = point.y; }
  }
  return { minX, minY, maxX, maxY };
}

The bounding box is the tightest rectangle that contains every point you drew. For Alice’s “A” above, it would be roughly {minX:100, minY:40, maxX:180, maxY:140}.

Step 2: project onto a 12 × 12 grid

With the bounding box in hand, every point can be scaled to fit in a 12-cell-wide, 12-cell-tall grid — regardless of where on the canvas it was drawn or how large it is.

// samples.service.ts
gridFromSample(sample: Sample): number[] {
  const { minX, minY, maxX, maxY } = this.getBoundingBox(sample.points);

  const gridPoints = new Array(this.sensorWidth * this.sensorHeight).fill(0);

  for (const point of sample.points) {
    const gridX = this.sensorWidth  * (point.x - minX) / (maxX - minX + 1);
    const gridY = this.sensorHeight * (point.y - minY) / (maxY - minY + 1);
    gridPoints[Math.floor(gridX) + this.sensorWidth * Math.floor(gridY)] = 1;
  }

  return gridPoints;
}

The formula sensorWidth * (point.x - minX) / (maxX - minX + 1) maps the x coordinate linearly from [minX, maxX] to [0, sensorWidth). The +1 prevents a division-by-zero when the drawing is a single vertical or horizontal line.

Math.floor(gridX) + sensorWidth * Math.floor(gridY) converts the 2D grid coordinate into a flat array index: column + 12 * row.

Every grid cell that any stroke passed through gets set to 1. Everything else stays 0.

What the grid looks like

Here is a rough ASCII rendering of the letter “A” projected onto a 12 × 12 grid:

. . . . . . 1 . . . . .
. . . . . 1 . 1 . . . .
. . . . 1 . . . 1 . . .
. . . 1 . . . . . 1 . .
. . 1 . . . . . . . 1 .
. 1 1 1 1 1 1 1 1 1 1 .
1 . . . . . . . . . . 1
1 . . . . . . . . . . 1

Each row is 12 cells. The full grid flattens to 144 numbers, each either 0 or 1. This 144-element array is what the neural network receives as input.

Why this works (and where it breaks down)

The grid representation is surprisingly capable for its simplicity. A few reasons it holds up:

Scale invariance: Alice’s tiny “A” and Bob’s large “A” both fit the bounding box exactly, so they produce nearly the same grid.
Position invariance: subtracting minX and minY recenters every drawing to the origin.
Fixed size: regardless of how many points you drew, the output is always 144 numbers — a requirement for a network with a fixed number of input neurons.

The limits are equally instructive:

Rotation: a tilted “A” looks like a different letter.
Stroke order and direction: the grid is a spatial snapshot, not a temporal one. Two drawings with the same strokes in different orders produce identical grids.
Very similar letters: an “O” and a “0” will produce nearly identical grids unless the user has a strong, consistent style.

For a handwriting demo that trains on a single user’s samples and recognizes that same user’s characters, these limitations don’t matter much in practice. The grid is good enough — and simple enough to understand completely.

The path to 144 numbers

Let’s summarize the full pipeline:

User draws on canvas → DrawingComponent collects Point[] on each mousemove
User clicks “Add” → SamplesService.addSample() stores the {letter, points} pair
When building training data → gridFromSample() calls getBoundingBox(), then projects each point into a 12×12 binary grid
Result: number[] of length 144, containing only 0 and 1

The network never sees raw pixel coordinates. It sees a normalized, scale-invariant, fixed-size representation of the shape — which is the only thing it needs.

What ClearDataLabs Is About

2026-04-02T00:00:00+00:00

ClearDataLabs is an open-source educational project that explains AI concepts by building neural networks from scratch — no TensorFlow, no PyTorch, just TypeScript and a browser. Every article includes working code, and every demo runs client-side with no server.

Most machine learning tutorials hand you a library and a dataset. You call model.fit(), watch the accuracy climb, and walk away with a trained model and no particular understanding of what just happened.

That’s useful. But it’s not the only way to learn.

ClearDataLabs exists for the other approach: understand what’s actually going on. How do networks learn? How does data flow through layers? What does a trained model really “know”? These questions don’t have one-line answers — but they become a lot clearer when you can see the mechanics, interact with them, and sometimes build them yourself.

Why fundamentals matter

AI is moving fast. New architectures, new training techniques, new applications — the landscape shifts constantly. But the core ideas behind all of it — gradient descent, loss functions, representation learning, optimization — remain remarkably stable. Understanding those fundamentals makes it easier to follow what’s new, evaluate what’s hype, and build on what actually works.

ClearDataLabs focuses on those fundamentals. Not to ignore the cutting edge, but because a solid foundation is the best tool for navigating it.

The projects so far

Handwriting Recognition in the Browser — a feedforward neural network that reads handwritten characters in real time. You draw a letter, the network classifies it. Trained and running entirely in the browser, with backpropagation implemented from first principles.

Knowledge Extraction from a Neural Network — an MNIST digit classifier, trained and frozen, dissected two ways. First, by computing a causal index that measures how much each input pixel influenced each output class. Second, by optimizing a blank image until the network “sees” a specific digit in it — a browser-native version of DeepDream.

These are starting points. The project is a playground for experimenting with different architectures, training methods, and visualization techniques — open-source and not tied to any specific framework or brand.

What you’ll find here

Articles that go deep on the ideas and their implementation. Not “here’s the concept,” but “here’s how it works, here’s the code, here’s where it breaks.” Written for anyone curious enough to want to understand AI, not just use it.

Start with the projects page for the demos, or jump straight into the articles.