HylloIcon
HylloAI

Powerful Features

Advanced capabilities that set Hyllo apart from traditional speech recognition

On-Device Privacy

Your voice never leaves the device. The transcription is done totally offline.

Multilingual Support

Highly accurate transcription for ~100 languages and dialects.

Smart Speaker Separation

Automatically distinguish 8+ speakers, ideal for meetings & interviews.

Word-Level Timestamps

Millisecond-accurate timestamps for every word, enabling frame-perfect audio editing.

LLM Orchestration

Seamlessly integrate LLM for summary and Q&A.

NPU-Optimized Engine

10x faster than CPU processing with Apple Neural Engine support.

Core Capabilities

Detailed explanation of Hyllo's core features

Speech Recognition

Hyllo uses advanced open-sourced models to convert speech to text with industry-leading accuracy.

  • Support for ~100 languages and dialects
  • State of the art models for speech recognition, including Whisper, SenseVoice and more
  • Highly accurate for domain-specific vocabulary in medical, legal, and technical fields

Speaker Diarization

Automatically identify and label different speakers in a conversation.

  • Distinguish up to 8+ unique speakers
  • 95% accuracy under ideal conditions

Word-Level Timestamps

Precise timing information for each word in the transcription.

  • Millisecond-accurate word timing, perfect for video editing and captioning
  • Efficient and accurate audio navigation and playback based on transcription
  • Export to various caption formats

LLM Integration

Connect with leading LLMs to enhance functionality.

  • Translation to other languages
  • Automatic summarization of content
  • Q&A generation from transcriptions

Model Comparison

See how our different speech recognition models compare

FeatureWhisper-BaseWhisper-Large-V3-TurboSenseVoice-Small
Accuracy
Processing Speed
Languages Supported50+50+5

Whisper Models

  • Open-sourced by OpenAI. Please refer to the GitHub repository for more information.
  • Highly accurate for English, Dutch, Spanish, Italian, German, Russion, Portuguese, etc.

SenseVoice Models

  • Open-sourced by Alibaba. Please refer to the GitHub repository for more information.
  • Support Mandarin, Cantonese, English, Japanese, and Korean.

Ready to Experience Hyllo?

Download now and transform how you work with audio