Research Foundations

VoxKit acts on (and interprets) findings from key research studies in speech pathology to support research teams and reduce technical barriers.

MFA Approaches Human-Level Reliability

Mahr et al., 2021

Key Finding

MFA-SAT reached 86% accuracy on child speech (ages 3-6), the only system approaching human interrater agreement.

VoxKit Implementation

VoxKit defaults to MFA while supporting alternative engines for comparative research.

Critical Consideration

MFA-SAT was trained on adult speech; researchers should validate performance for their own use case.

Phoneme Class Reliability Varies

Mahr et al., 2021

Key Finding

Vowels showed 83% accuracy across systems. Fricative accuracy improved significantly with child age (OR = 1.29/year).

VoxKit Implementation

VoxKit tracks alignment metadata and speaker age, enabling age-stratified accuracy analysis.

Critical Consideration

These patterns emerged from elicited single-word productions and may not generalize to spontaneous speech.

Clinical AI Often Overfits Small Datasets

Berisha & Liss, 2024

Key Finding

Most clinical speech datasets contain only minutes to hours of audio with uncertain labels, leading to poor generalization.

VoxKit Implementation

VoxKit tracks metadata and versioned provenance for transparency.

Critical Consideration

VoxKit attempts rigorous documentation but cannot solve fundamental overfitting, only ensuring accidents happen less.

Interpretable Measures Outperform Black Boxes

Berisha & Liss, 2024

Key Finding

Clinically grounded measures outperform opaque embeddings.

VoxKit Implementation

VoxKit allows you to test and review this, as AI becomes more advanced, methodologies may advance as well.

Critical Consideration

Alignment errors can propagate downstream. Researchers must validate that phonetic boundaries are reliable.

VoxKit's Research-Driven Approach

Guided workflows: Guidance and layout can be customized to fit the direction for specific studies/research

Flexible architecture: Explorative tools and pipeline steps can be reused and adapted with minimal wiring, this also contributes to a more collaborative mindset around research

Metadata-rich outputs: Automated metadata tracking enables reportable results and and reduces uncertainty around what was done

The WE mindset: We hope researchers find leverage in this tool, and moreover, contribute to this ecosystem so that research can become more collaborative, rather than everyone reinventing the wheel and working in silos

VoxKit prioritizes usability, flexibility, and transparency to empower researchers to push the cutting edge.