Extrian

Research

Voice is the next capability overhang.

We are entering a multi-species civilization. Humans and AIs will work together, think together, live alongside each other. Voice is the interface for that world—the highest-bandwidth channel we have, and the one that every human already shares.

Extrian is an applied audio research lab. We build datasets, models, and evaluation frameworks for end-to-end speech systems—preserving the full dimensionality of human expression across languages, dialects, and speaking styles. Our work sits at the foundation of the next generation of voice-native AI.

This is the interface between species.

Blog

Notes from the lab.

April 2026

Voice is the next capability overhang.

Notes from the lab.

Epoch — Unlocking S2S

Evaluation Gaps in Speech-to-Speech Generation

On the Role of Data in the S2S Capability Overhang

Toward Conversational Coherence in Audio-Native Models