Features

The AI Search Database.

Quickstart | Nuclia Docs | Community

NucliaDB is a robust database that allows storing and searching on unstructured data.

It is an out of the box hybrid search database, utilizing vector, full text and graph indexes.

NucliaDB is written in Rust and Python. We designed it to index large datasets and provide multi-teanant support.

When utilizing NucliaDB with Nuclia cloud, you are able to the power of an NLP database without the hassle of data extraction, enrichment and inference. We do all the hard work for you.

Features

Store text, files, vectors, labels and annotations
Perform text searches and given a word or set of words, return resources in our database that contain them.
Perform semantic searches with vectors. For example, given a set of vectors, return the closest matches in our database. With NLP, this allows us to look for similar sentences without being constrained by exact keywords.
Export your data in a format compatible with most NLP pipelines (HuggingFace datasets, pytorch, etc)
Store original data, extracting and data pulled from the Understanding API
Index fields, paragraphs, and semantic sentences on index storage
Cloud data and insight extraction with the Nuclia Understanding API™
Cloud connection to train ML models with Nuclia Learning API™
Role based security system with upstream proxy authentication validation
Resources with multiple fields and metadata
Text/HTML/Markdown plain fields support
Field types: text, file, link, conversation
Storage layer (PostgreSQL)
Blob support with S3-compatible API, GCS and Azure Blob Storage
Replication of index storage
Distributed search
Cloud-native

Architecture

Quickstart

Trying NucliaDB is super easy! You can extend your knowledge with the following readings:

Quick start!
Read about what Knowledge boxes are in our basic concepts section
Upload your data

💬 Community

Chat with us in Slack
📝 Blog Posts
Follow us on X
Do you want to work with us?

🙋 FAQ

How is NucliaDB different from traditional search engines like Elasticsearch or Solr?

The core difference and advantage of NucliaDB is its architecture built from the ground up for unstructured data. Its vector index, keyword, graph and fuzzy search provide an API to use all extracted and extracted information from Nuclia, Understanding API and provides powerful NLP abilities to any application with low code and peace of mind.

What license does NucliaDB use?

NucliaDB is open-source under the GNU Affero General Public License Version 3 - AGPLv3. Fundamentally, this means that you are free to use NucliaDB for your project, as long as you don't modify NucliaDB. If you do, you have to make the modifications public.

What is Nuclia's business model?

Our business model relies on our normalization API, this one is based on Nuclia Learning API and Nuclia Understanding API. This two APIs offers transformation of unstructured data to NucliaDB compatible data with AI. We also offer NucliaDB as a service at our multi-cloud provider infrastructure: https://nuclia.cloud.

🤝 Contribute and spread the word

We are always happy to have contributions: code, documentation, issues, feedback, or even saying hello on Slack! Here is how you can get started:

Read our Contributor Covenant Code of Conduct
Create a fork of NucliaDB and submit your pull request!

✨ And to thank you for your contributions, claim your swag by emailing us at info at nuclia.com.

Name		Name	Last commit message	Last commit date
Latest commit History 2,777 Commits
.cargo		.cargo
.github		.github
charts		charts
config		config
docs		docs
e2e		e2e
mypy_stubs		mypy_stubs
nidx		nidx
nucliadb		nucliadb
nucliadb_core		nucliadb_core
nucliadb_dataset		nucliadb_dataset
nucliadb_models		nucliadb_models
nucliadb_node		nucliadb_node
nucliadb_node_binding		nucliadb_node_binding
nucliadb_paragraphs3		nucliadb_paragraphs3
nucliadb_performance		nucliadb_performance
nucliadb_procs		nucliadb_procs
nucliadb_protos		nucliadb_protos
nucliadb_relations2		nucliadb_relations2
nucliadb_sdk		nucliadb_sdk
nucliadb_sidecar		nucliadb_sidecar
nucliadb_telemetry		nucliadb_telemetry
nucliadb_texts3		nucliadb_texts3
nucliadb_utils		nucliadb_utils
nucliadb_vectors		nucliadb_vectors
proposals		proposals
scripts		scripts
vectors_benchmark		vectors_benchmark
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitignore		.gitignore
.license_header.txt		.license_header.txt
.licenserc.yaml		.licenserc.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CODE_STYLE_PYTHON.md		CODE_STYLE_PYTHON.md
CODE_STYLE_RUST.md		CODE_STYLE_RUST.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.nidx		Dockerfile.nidx
Dockerfile.node		Dockerfile.node
Dockerfile.node_prebuilt		Dockerfile.node_prebuilt
Dockerfile.node_sidecar		Dockerfile.node_sidecar
Dockerfile.pipbinding		Dockerfile.pipbinding
Dockerfile.withbinding		Dockerfile.withbinding
LICENSE.txt		LICENSE.txt
LICENSE_AGPLv3.0.txt		LICENSE_AGPLv3.0.txt
Makefile		Makefile
NucliaDB_individual_CLA.md		NucliaDB_individual_CLA.md
README.md		README.md
VERSION		VERSION
bump.py		bump.py
deny.toml		deny.toml
docker-compose.yaml		docker-compose.yaml
mypy.ini		mypy.ini
openapi.yaml		openapi.yaml
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml
rust-toolchain.toml		rust-toolchain.toml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The AI Search Database.

Quickstart | Nuclia Docs | Community

Features

Architecture

Quickstart

💬 Community

🙋 FAQ

How is NucliaDB different from traditional search engines like Elasticsearch or Solr?

What license does NucliaDB use?

What is Nuclia's business model?

🤝 Contribute and spread the word

Reference

Meta

About

Releases 70

Contributors 30

Languages

License

nuclia/nucliadb

Folders and files

Latest commit

History

Repository files navigation

The AI Search Database.

Quickstart | Nuclia Docs | Community

Features

Architecture

Quickstart

💬 Community

🙋 FAQ

How is NucliaDB different from traditional search engines like Elasticsearch or Solr?

What license does NucliaDB use?

What is Nuclia's business model?

🤝 Contribute and spread the word

Reference

Meta

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 70

Contributors 30

Languages