Fractal Neurons ● research

an AI that thinks in fractals

Fractal Neurons is the project everything else orbits. Instead of stacking transformer blocks, it aggregates information bottom-up through a parameter-shared f-ary tree — a fractal. The same small set of weights is reused at every level, so the model reaches 65k+ runtime nodes at roughly 70M parameters. Depth and fan-out become knobs instead of cost.

Around that core I built a whole organism: a Fractal Memory Matrix (FMM) that grows and prunes its own nodes, a quantum-inspired processing hook that treats time as a signal, and a local Mixture-of-Experts that only fires the experts it needs. The hard part was never the idea — it was making it fit. So I wrote Unified Fractal Memory, which spans GPU VRAM, pinned RAM, and NVMe as one pool, prefetching and evicting subgraphs so a single card behaves like a mini-cluster.

It's byte-level, so it has no vocabulary to be trapped by — it learns from raw bytes of anything. It ships with a 40-mode control menu, a capacity planner, an autopilot for the 4090, and an agent swarm that generates its own training data. Written alone, at night, in Saguenay. It is the proof that frontier-shaped AI research doesn't require a frontier-sized lab.

// highlights

Parameter-shared fractal backbone — 65k+ runtime nodes at ~70M params
Fractal Memory Matrix (FMM): self-organizing, growing/pruning memory
Unified Fractal Memory (UFM): VRAM + RAM + NVMe as one elastic pool
Byte-level modeling — no fixed vocabulary
Local MoE with top-k routing + load-balance telemetry
Capacity planner, Pareto auto-profiler, and a 4090 autopilot

// stack

PyTorch
CUDA
byte-level
MoE
Ollama

// highlights

// stack

Follow this work.