Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update rapids version for 24.10 release #1248

Draft
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

nvliyuan
Copy link
Contributor

This pr is to update the spark-rapids script version to 24.10.0 and update the readme doc

@nvliyuan
Copy link
Contributor Author

@viadea please help review. CC @jayadeep-jayaraman @cjac

@cjac
Copy link
Contributor

cjac commented Oct 24, 2024

Oh hey, thanks for the ping. I'll check it out.

@cjac
Copy link
Contributor

cjac commented Oct 24, 2024

/gcbrun

@cjac cjac marked this pull request as draft October 24, 2024 04:26
Copy link
Contributor

@cjac cjac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's get the docs updated to reflect the versions of Dataproc supported in Q4 2024

Our current supported versions follow:
2.2-debian12
2.1-debian11
2.0-debian10
2.2-ubuntu22
2.2-ubuntu20
2.0-ubuntu18
2.2-rocky9
2.1-rocky8
2.0-rocky8

And if all of these platforms are not supported by spark-rapids, then we'll need to merge in my changes to dask-rapids.

* NCCL 2.11.4+
* Ubuntu 18.04, Ubuntu 20.04 or Rocky Linux 7, Rocky Linux8, Debian 10, Debian 11
* Ubuntu 20.04, Ubuntu 22.04, CentOS 7, or Rocky Linux 8, Debian 10, Debian 11
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you have support for CentOS 7, stil, really? I commend you. Do the tests exercise that platform?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would also add Rocky Linux 9.

If you do not have support for rocky linux 9 yet, we can merge my work from spark-dask. I'll continue my review.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for pointing it out, we don't support centos7 now, update a link in the doc for the soft/hardware requirements.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

File related pr for the doc issue.

@@ -63,7 +63,7 @@ export CUDA_VER=11.5

gcloud dataproc clusters create $CLUSTER_NAME \
--region $REGION \
--image-version=2.0-ubuntu18 \
--image-version=2.1-ubuntu20 \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please recommend 2.2 where possible. If 2.1 doesn't have other representation, then this is fine, but we should be emphasizing that 2.2 is the better choice.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

update to 2.2 image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants