GitHub - nvbogu/Personal-Voice-and-Chat-Assistant-within-bigbluebutton: Complete open source web conferencing system with a personal assistant (chat & voice)

About this project

This project is about integrating an extension in the bigbluebutton (open source web conferencing) application which makes it possible to either chat or speak with a personal assistent by communicating with a Natural Language Understanding API in English.

Installation

In this section I will talk about what you neet to do to bring your personal assistent within your bigbluebutton application to live.

Prerequisites

Before you start you need to have a full biglbuebutton server running. You can installl one by fallowing the official docs.

You also need to fallow this official guide for developers to be able to start the bigbluebutton application within the developer mode. By fallowing the guide you need to change the fallowing:

don't fork their bigbluebutton repository, fork mine
continue fallowing the guide, just clone this repository into your ~/dev folder and not the one mentioned in the guide

Now you need to install the string similarity package. You can find more information about what it does in this section: String similarity test

Navigate to your bigbluebutton-html5 folder by running:

cd ~/dev/bigbluebutton/bigbluebutton-html5

Install the string similarity package by running:

npm install string-similarity --save

It is also nessesary to roll back some code changes which I have used for my development process: commit. Basicly you just need to remove the _niklas everywhere necessary, which locations are mentioned all in the commit above.

Hint: Later, when you installed the ASR-Server and the NLU-Server you need to change their URLs in the client accordingly. The URL for the ASR-Server is in the bigbluebutton/bigbluebutton-html5/imports/ui/components/audio/audio-controls/component.jsx file and there in the createPostRequest function.

The URL for the NLU-Server is in the bigbluebutton-html5/imports/ui/omponents/voice-assistant/service.js file and there in the make_post_request function.

Natural Language Understanding API

In order to identify the intent of the user and his mentioned entities like "hey bigbluebutton mute Steffen" which would result in a wake_up+mute intent and the entity Steffen you need to install this Natural Language Understanding API.

Automatic Speech Recognition API

To be able to transcript your commands via voice into text you need to set up a server which runs the MAX-Speech-to-Text-Converter as an automatic speech recognition (ASR) API.

Setting up a Hybrid Server

In this section you will add the ASR-API to your already existing NLU-API in a matter of minutes.

First make sure you have Docker installed. If not please fallow this official guide: Docker Installation on Ubuntu.

After that create a folder (e.g. "asr"):

mkdir asr

Navigate to the folder:

cd asr

Clone the used MAX-Speech-to-Text-Converter:

git clone https://github.com/IBM/MAX-Speech-to-Text-Converter.git

Navigate to the folder:

cd MAX-Speech-to-Text-Converter

Hint: You will need to have sudo rights

Build the docker image

docker build -t max-speech-to-text-converter .

Start the server

docker run -it -p 5000:5000 max-speech-to-text-converter

Now the only thing you have to do to be able to access our NLU-API and our ASR-API within one server accessable threw your NGNIX webserver is to change your reverse-proxy.conf file. Here just need to add another location and change a bit the first one from your Natural Language Understanding API.

You can do this by navigate to the site-variables folder by running:

cd /etc/nginx/sites-available

Edit the reverse-proxy.conf file by running:

vim reverse-proxy.conf

Change your one location variable to two location variables now and add a path to it like 'location /nlu/' and 'location /asr/.

    location /nlu/ {
             proxy_pass http://localhost:4000/;
    }
    location /asr/ {
             proxy_pass http://localhost:5000/;
    }

After that is done the NLU-API should be accessable threw:

www.example.de/nlu/model/parse

and the ASR-API threw'

www.example.de/asr/model/predict

Optional Meeting Summary Extension

You can also add the functionality to get a summary of the ongoing meeting if you install this guide here: Summary Feature Server.

You also need to change the url of the summary server in this
bigbluebutton-html5/imports/ui/omponents/voice-assistant/service.js file and there in the execute_intent function accordingly.

Start the Hybrid Server

In this section you will start the NGNIX webserver and 2 localhosts. One is the NLU-API and the other is the ASR-API. The NGNIX server will point your post requests to the right API (localhost).

Remeber you can start the NLU server by navigating to your project folder

cd <your_project_name>

Activate your venv by running:

source ./<your_virtual_environment_name>/bin/activate

Now navigate to your repository folder by running:

cd Natural-Language-Understanding-API

With this setup you need to change the localhost of the NLU-API from 5000 to 4000. You can just start it at port 4000 by running:

rasa run --enable-api -m models/bigbluebutton.tar.gz -p 4000

You can start the ASR-API by running:

docker run -it -p 5000:5000 max-speech-to-text-converter

You can do this from anywhere on your Ubuntu machine as sudo user. You can switch to sudo user by running:

sudo -i

Now your personal voice assistent within the bigbluebutton application should be ready to be used!

How to continue this project

To get a better understanding of the project I have created the fallowing images to illustrate different functionalies and where they are located.

The current architecture looks like this:

The communication between the client and the NLU-Server looks like this:

The communication between the client and the ASR-Server looks like this:

The files which I have changed are located here:

Known Bug:

As of right now this extension is not running in the Firefox Browser duo to the Recorder class. This sould be fixed for a production environment.

String similarity test

The string similarity package is used to be able to identify missspelled names or nicknames as users within a bigbluebutton meeting without actually typing the 100% correct name like Niklas_93 will be identified as Niklas_93 in the meeting even if you only said or typed Niklas.

A test (file) for this package, some use cases including their confidences is located at:

bigbluebutton-html5/tests/string_similarity_test/string_similarity_test.ipynb

It is also possible to take a look at the test with the jupyter nbviewer here: https://nbviewer.jupyter.org/github/nvbogu/Personal-Voice-and-Chat-Assistant-within-bigbluebutton/blob/develop/bigbluebutton-html5/tests/string_similarity_test/string_similarity_test.ipynb

License

This project is open source for everyone.

Name		Name	Last commit message	Last commit date
Latest commit History 27,893 Commits
.github		.github
akka-bbb-apps		akka-bbb-apps
akka-bbb-fsesl		akka-bbb-fsesl
akka-bbb-transcode		akka-bbb-transcode
bbb-api-demo		bbb-api-demo
bbb-apps-common		bbb-apps-common
bbb-client-check		bbb-client-check
bbb-common-message		bbb-common-message
bbb-common-web		bbb-common-web
bbb-fsesl-client		bbb-fsesl-client
bbb-libreoffice		bbb-libreoffice
bbb-lti		bbb-lti
bbb-screenshare		bbb-screenshare
bbb-video		bbb-video
bbb-voice-conference/config/freeswitch		bbb-voice-conference/config/freeswitch
bbb-voice		bbb-voice
bbb-webhooks		bbb-webhooks
bigbluebutton-apps		bigbluebutton-apps
bigbluebutton-client		bigbluebutton-client
bigbluebutton-config		bigbluebutton-config
bigbluebutton-html5		bigbluebutton-html5
bigbluebutton-web		bigbluebutton-web
clients/flash		clients/flash
deskshare		deskshare
doc		doc
images		images
labs		labs
record-and-playback		record-and-playback
scripts		scripts
video-broadcast		video-broadcast
web-polling		web-polling
.gitignore		.gitignore
.nvmrc		.nvmrc
.travis.yml		.travis.yml
DEVELOPMENT.md		DEVELOPMENT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
bbb.sh		bbb.sh
build_script.sh		build_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About this project

Table of contents

Installation

Prerequisites

Natural Language Understanding API

Automatic Speech Recognition API

Setting up a Hybrid Server

Optional Meeting Summary Extension

Start the Hybrid Server

How to continue this project

String similarity test

License

About

Releases

Packages

Languages

License

nvbogu/Personal-Voice-and-Chat-Assistant-within-bigbluebutton

Folders and files

Latest commit

History

Repository files navigation

About this project

Table of contents

Installation

Prerequisites

Natural Language Understanding API

Automatic Speech Recognition API

Setting up a Hybrid Server

Optional Meeting Summary Extension

Start the Hybrid Server

How to continue this project

String similarity test

License

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages