Skip to content
Pankaj edited this page Jun 9, 2017 · 5 revisions

Pocketsphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices. It is one of Carnegie Mellon University's open source large vocabulary, speaker-independent continuous speech recognition engine.
This repository is the ROS package for pocketsphinx which aims to provide ROS users with the same capabilities as offered by the original pocketsphinx.

Resources

Installation

  1. Install pyaudio
    sudo pip install pyaudio
    
    If this does not work, follow instructions below:
    sudo apt-get install libasound-dev
    sudo apt-get install python-pyaudio
    
  2. Install pocketsphinx: You will need to have pip pre-installed for this to work
    sudo pip install pocketsphinx
    
    There are many dependencies which need to be met before installation of pocketsphinx through pip works. Use Synaptics package manager to install the unmet dependencies which would be mentioned as error messages on the terminal window in case installation fails. Some of them include:
    libpulse-dev
    swig
  3. Clone this repository into the src folder of your catkin workspace using:
cd ~/catkin_ws/src
git clone https://github.com/Pankaj-Baranwal/pocketsphinx

To know more about catkin workspace and ROS, follow instructions at: http://wiki.ros.org/ROS/Tutorials/InstallingandConfiguringROSEnvironment
After everything is setup, open a terminal from your catkin workspace and type the following command:

catkin_make
Clone this wiki locally