The Crow · Repertoire Atlas
The vocal map.
~800 vocalizations, 9 clusters, one space. Each dot is a clip embedded by NatureLM-audio and projected with UMAP. Hover the legend to isolate a cluster; click any point to read its context.
Loading 795 vocalizations…
Inline glossary
What you're looking at, in five words apiece.
- Embedding
- A learned vector representation of a clip. Here, 1,024 numbers per call.
- Latent space
- The high-dimensional space in which embeddings live; geometry ≈ acoustic similarity.
- UMAP
- A non-linear dimensionality reducer that flattens 1,024 dims to 2 for inspection.
- Cluster
- A dense region in the embedding space. Here, found by HDBSCAN on the full 1,024-dim vectors.
- Bridge point
- A point between two clusters indicating graded acoustic variation, not noise.
- Context
- The behavior co-occurring with the call, joined from synchronized observation logs.
The deep methodology lives at Latent Space 101 and NatureLM-audio.