Skip to main content
Carnegie Mellon University

Sphinx

Speech Processing

By Alexander Rudnickey

The Sphinx project is an umbrella for research in basic speech technologies. Current activities include systems for real-time recognition and multi-modal interaction. Research projects include multi-party conversation management, learning through spoken language, long-term user adaptation and the development of human-robot interfaces (such as avatars). Research in speech recognition includes out-of-vocabulary word detection and representation, and the use of conversational structure to enhance spoken term detection. The Sphinx recognition code-base and the Olympus dialog code-base are open-source and used by a number of projects in LTI, elsewhere in the university and by a large number of other sites.