Introduction

Overview

The KeenASR Software Development Kit provides on-device automatic speech recognition functionality for mobile devices running iOS, Android, ChromeOS, and Linux operating systems. The JavaScript version of the SDK (keenasr-web) runs locally in the web browser, without a need to stream the speech data to the backend.

Speech recognition is performed locally on the device; no internet connectivity or cloud support is required. The SDK is based on a state-of-the-art Deep Neural Network decoder and includes acoustic models.

The SDK provides an API for creating language models on the device for small to medium-size recognition tasks (up to several thousand words). For large vocabulary dictation-type tasks, language models and decoding graphs can be created ahead of time and used by the SDK. For domain specific large vocabulary use cases, large amounts of domain specific text data might be required to build appropriate language models.

The SDK also supports trigger phrase functionality, which is viable for sessions up to two hours. The trigger phrase implementation does not currently support always-on (24x7) listening use cases, especially on battery-powered devices.

Platforms and Development Tools Support

In addition to the Objective C iOS framework, the Android Java AAR SDK, and JavaScript library we also provide instructions on how to use the Objective C framework in Swift and a Unity plugin, which provides a C# interface to both the iOS and the Android versions of the SDK. The SDK can also be integrated on a backend on Linux operating systems, either via a C++ library or a Python module.

Keen Research customers can also leverage Dashboard, a cloud-based tool for on-device ASR development support. KeenASR, in conjunction with Dashboard, provides seamless synchronization of on-device data, and cloud-based access to audio recordings and speech recognition metadata for further analysis and debugging, ability to transcribe and score responses, etc..

Trial Version of the SDK

Keen Research provides trial version of the SDK for both iOS and Android, as well as English ASR Bundle that can be used with the trial SDKs.

Note: The trial version of the SDK is fully functional, but it will run only for 15 minutes at a time. After 15 minutes it will ‘crash’ the app. There are no limits to how many times you can run the app. For commercial licensing inquires contact us .

Tip: You can evaluate the trial version of the SDK in your own app, or you can review and test the iOS (Objective C, Swift) or Android proof-of-concept apps that we provide on Github. If you already have a test set of transcribed recordings, we can provide the OS X or Linux command line utilities for batch evaluation.

Demos

Keen Research provides a set of interactive demos that showcase the capabilities of the KeenASR Web SDK across multiple use cases. Several demos highlight scenarios common in EdTech and frontline worker applications, demonstrating how on-device speech recognition can be integrated into specific use-cases.

In addition, we offer a Developer Demo, which exposes a wide range of KeenASR SDK API methods through an intuitive graphical interface. Although implemented using the Web SDK, this demo is relevant to developers working on any supported platform – iOS, Android, Linux, or Unity – because it mirrors the structure and behavior of the core API. This makes it easy to explore SDK functionality, experiment with different setups, and gain familiarity before integrating the SDK into your own application.

Language Support

Currently, the SDK supports English, Spanish, German, and French out of the box. For English, we also offer models optimized for children’s voices. Additional ASR Bundles for most major spoken languages can be provided upon request, within 6-8 weeks.

Keen Research can also provide customized ASR Bundles that work better in specific acoustic environments, with specific population, or have a smaller memory footprint and CPU utilization.

To get started, we suggest you review the Getting Started and Glossary pages, and then move on to the documentation for your platform of interest.