John Snow Labs Spark-NLP 4.0.2: Over 620 new state-of-the-art models in 21 languages, full support for Apache Spark 3.3.0, new Databricks runtime 11.1, and bug fixes #10575
maziyarpanahi
announced in
Announcement
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Overview
We are pleased to release Spark NLP 🚀 4.0.2! This release comes with full compatibility with the newly-released Apache Spark 3.3.0 and official support for Databrick's new runtimes 11.1 Beta (includes Apache Spark 3.3.0, Scala 2.12).
As always, we would like to thank our community for their feedback, questions, and feature requests.
New Features
SentenceDetector
now comes with a new parametercustomBoundsStrategy
for returning custom bounds [SentenceDetector] Added Flag for returning custom bounds #10567Example
with
setCustomBounds([r"\.", ";"])
Without the flags will result in
With the new flag:
the result will be
Similarly with prepend:
the result will be
Bug Fixes
Models and Pipelines
Spark NLP 4.0.2 comes with 620+ state-of-the-art pre-trained transformer models in 21 languages including multi-lingual models.
Featured Models
en
zh
ru
id
ko
xx
tr
el
fa
ar
de
xx
ar
sv
zh
hu
id
ga
pt
es
The complete list of all 6900+ models & pipelines in 230+ languages is available on Models Hub
📖 Documentation & Articles
Installation
Python
#PyPI pip install spark-nlp==4.0.2
Spark Packages
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, 3.3.x (Scala 2.12):
GPU
M1
Maven
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, 3.3.x (Scala 2.12):
spark-nlp-gpu:
spark-nlp-m1:
FAT JARs
CPU on Apache Spark 3.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-assembly-4.0.2.jar
GPU on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-gpu-assembly-4.0.2.jar
M1 on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-m1-assembly-4.0.2.jar
What's Changed
Contributors
@gadde5300 @danilojsl @hsaglamlar @Cabir40 @ahmedlone127 @muhammetsnts @KshitizGIT @maziyarpanahi @albertoandreottiATgmail @DevinTDHa @luca-martial @Damla-Gurbaz @jsl-models @Meryem1425
New Contributors
Full Changelog: 4.0.1...4.0.2
This discussion was created from the release John Snow Labs Spark-NLP 4.0.2: Over 620 new state-of-the-art models in 21 languages, full support for Apache Spark 3.3.0, new Databricks runtime 11.1, and bug fixes.
Beta Was this translation helpful? Give feedback.
All reactions