Latest News:
less ▲
|
-
Tech. Report RTI-20240917-01: "Spontaneous Informal Speech Dataset for Punctuation Restoration"
-
Tech. Report RTI-20240822-01: "MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX and NDX European Call Option Pricing"
-
Recognition Technologies, Inc. June 13, 2024 Westchester County’s Tech Accelerator Demo Day
-
Tech. Report RTI-20240524-01: "Carnatic Raga Identification System using Rigorous Time-Delay Neural Network"
-
IEEE IMCOM (2024): Efficient Ensemble for Multimodal Punctuation Restoration using Time-Delay Neural Network
-
Tech. Report RTI-20230828-01: "Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset"
-
A Damage Assessment Methodology for Structural Systems using Transfer Learning from the Audio Domain, Mechanical Systems and Signal Processing Journal (MSSP), Vol. 195, Jul. 15, 23, 2023 (online: Mar. 23, 2023)
-
Technical Report RTI-20230224-01: "Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks"
-
Technical Report RTI-20230131-01: "A Transaction Represented with Weighted Finite-State Transducers"
-
Technical Report RTI-20220519-02: "Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization"
-
IMAC Best Paper Award: Dynamics of Civil Structures Technical Division, 2021, for
"Transfer Learning from Audio Domains a Valuable Tool for Structural Health Monitoring"
-
Technical Report RTI-20220520-01: "Modernizing Open-Set Speech Language Identification"
-
Technical Report RTI-20220519-01: "Automatic Spoken Language Identification using a Time-Delay Neural Network"
-
Homayoon Beigi, "Emotion Detection using Transfer Learning from Speech and Speaker Recognition Deep Neural Net Models," IEEE/ASME/SME Joint Webinar, December 2, 2020
-
Homayoon Beigi, "Structural and Machine Health Monitoring through the Application of Speaker Recognition Techniques," IEEE/ASME/SME Joint Meeting, March 26, 2019
-
Bosch shows off the Recognition Technologies, Inc. RecoMadeEasy®
Embedded Speech Recognition, Speaker Recognition, and Face Recognition engines in their intelligent in-vehicle infotainment (IVI) implementation on a GM Cadillac SUV
-
What's next for voice in the car? Q&A with ARM & Recognition Technologies
-
Reimagining Voice in the Driving Seat (Joint White Paper with ARM)
|
|
See Demonstration Videos ▼
|
RecoMadeEasy®Products
A single comprehensive software engine for enabling Speech, Speaker, Face, Object, Emotion Recognition, Translation, Access Controls, and much more, using a unified set of APIs designed for Integrators and Software Developers -- works standalone (Android and Linux) and in client/server mode
|
|
|
-
AudioVisual Recognition
(Embedded)
(Server Based)
(Combination of Speaker, Speech, Face Recognition, and Object Detection and Recognition with a single interface)
-
Large-Vocabulary Speech Recognition
(Embedded)
(Server Based)
Initially available for English, Spanish, Mandarin, Arabic, and German, is now available for 100+ languages
Also includes multilinguagl support and code-switching
(Customizable domain full transcription ~ 300,000+ word vocabulary)
-
Speaker Recognition
(Embedded)
(Server Based)
(Language- and Text-Independent, aka: Speaker Biometrics, Voice Biometrics, or SIV)
Recipient: Frost & Sullivan Award 2011
-
Face Recognition
(Embedded)
(Server Based)
(Face detection and recognition)
-
Object Recognition
(Embedded)
(Server Based)
(Object detection and recognition)
|
|