Introduction

Overview

The KeenASR Software Development Kit provides on-device automatic speech recognition functionality for mobile devices running iOS, Android, ChromeOS, and Linux operating systems.

Speech recognition is performed locally on the device; no internet connectivity or cloud support is required. The SDK is based on a state-of-the-art Deep Neural Network decoder and includes acoustic models.

The SDK provides an API for creating language models on the device for small to medium-size recognition tasks (up to several thousand words). For large vocabulary dictation-type tasks, language models and decoding graphs can be created ahead of time and used by the SDK. For domain specific large vocabulary use cases, large amounts of domain specific text data might be required to build appropriate language models.

The SDK also supports trigger phrase functionality, which is viable for sessions up to two hours. The trigger phrase implementation does not currently support always-on (24x7) listening use cases, especially on battery-powered devices.

Platforms and Development Tools Support

In addition to the Objective C iOS framework and the Android Java AAR SDK, we also provide instructions on how to use the Objective C framework in Swift and a Unity plugin, which provides a C# interface to both the iOS and the Android versions of the SDK. The SDK can also be integrated on a backend on Linux operating systems, either via a C++ library or a Python module.

Keen Research customers can also leverage Dashboard, a cloud-based tool for on-device ASR development support. KeenASR, in conjunction with Dashboard, provides seamless synchronization of on-device data, and cloud-based access to audio recordings and speech recognition metadata for further analysis and debugging, ability to transcribe and score responses, etc..

Trial Version of the SDK

Keen Research provides trial version of the SDK for both iOS and Android, as well as English ASR Bundle that can be used with the trial SDKs.

Note: The trial version of the SDK is fully functional, but it will run only for 15 minutes at a time. After 15 minutes it will ‘crash’ the app. There are no limits to how many times you can run the app. For commercial licensing inquires contact us .

Tip: You can evaluate the trial version of the SDK in your own app, or you can review and test the iOS (Objective C, Swift) or Android proof-of-concept apps that we provide on Github. If you already have a test set of transcribed recordings, we can provide the OS X or Linux command line utilities for batch evaluation.

Language Support

Currently, the SDK supports English, Spanish, German, and French out of the box. Additional ASR Bundles for most major spoken languages can be provided upon request within 6-8 weeks.

Keen Research can also provide customized ASR Bundles that work better in specific acoustic environments, with specific population, or have a smaller memory footprint and CPU utilization.

To get started, we suggest you review the Getting Started and Glossary pages, and then move on to the documentation for your platform of interest.