FIG 7.1 — Glossary · Reference
Glossary.
Definitions of the technical vocabulary used across CrowLingo — bioacoustics, AI audio models, and corvid behavior. One sentence each. Click any term's deep link to read the methods page where it's explained at length.
- Embedding
- A learned vector representation of an input. For crow audio: a 1,024-number list a model produces for one clip; similar clips → nearby vectors.
- Latent space
- The high-dimensional space embeddings live in. Geometry inside it approximates acoustic similarity — close = similar, far = different.
- UMAP
- Uniform Manifold Approximation and Projection. A non-linear dimensionality reducer that flattens 1,024-dim embeddings to 2-D for visualization.
- HDBSCAN
- Hierarchical Density-Based Spatial Clustering of Applications with Noise. Finds dense clusters of arbitrary shape without requiring a target cluster count.
- Self-supervised learning (SSL)
- Training paradigm where a model creates its own supervision signal from unlabeled data — e.g. by predicting masked parts of a spectrogram from the rest.
- NatureLM-audio
- Earth Species Project's audio-language foundation model for bioacoustics (ICLR 2025). Audio in, natural language out. Zero-shot species + behavioral classification.
- BirdNET
- Open-source deep-learning bird-sound classifier from Cornell Lab + Chemnitz UT. The workhorse for avian detection in long recordings.
- Perch
- Google's bioacoustics embedding model (PANNs/PaSST lineage). Stronger within-species detail than BirdNET; broader audio coverage than NatureLM-audio.
- Syrinx
- The bird vocal organ. Located at the bifurcation of the trachea into the bronchi. Two independent sound sources, which is why birds can make two notes at once.
- Spectrogram
- A time-frequency-intensity plot of an audio clip. Time on the x-axis, frequency on the y, color = energy at (t, f).
- Graded calls
- Calls that vary continuously along acoustic dimensions rather than belonging to discrete categories. Appear as bridges between clusters on the vocal map.
- Dialect
- Group-level acoustic variation in calls. Crow family groups show measurable centroid differences in their shared call types — early evidence for crow dialect.
- Syntax (in animal communication)
- Combinatorial structure where call order carries meaning beyond the sum of parts. For crows: statistical regularities exist; behavioral confirmation is thin.