Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gaussian Mixture Model implementation #369

Open
wants to merge 15 commits into
base: main
Choose a base branch
from
Open

Conversation

Craigacp
Copy link
Member

@Craigacp Craigacp commented May 20, 2024

Description

Adds a Gaussian Mixture Model clustering implementation with spherical, diagonal and full covariance structure using Expectation Maximisation. Also adds a mixture distribution to the RNG library to allow sampling from a user constructed gaussian mixture model (as opposed to one fit to a data distribution).

It also contains new Math functions necessary to implement the GMM efficiently, some updates for K-Means to modernise it a little bit, and some cleanups to the main pom file.

There are a few important fixes to the Math package in here as well, determinants for matrix factorizations were incorrectly computed, and the subtract function on SparseVector was incorrect.

Motivation

GMMs are a useful clustering algorithm. Fixes #359.

Paper reference

Hastie, T., Tibshirani, R., Friedman, J. H., & Friedman, J. H. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition.

@Craigacp Craigacp added the Oracle employee This PR is from an Oracle employee label May 20, 2024
@oracle-contributor-agreement oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label May 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
OCA Verified All contributors have signed the Oracle Contributor Agreement. Oracle employee This PR is from an Oracle employee
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Gaussian Mixture Model capability
1 participant