Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable ci/cd spark image build when transform does not implement spark #604

Merged
merged 69 commits into from
Sep 20, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
69 commits
Select commit Hold shift + click to select a range
3fe8d2c
disable test workflow when none code files change
daw3rd Sep 13, 2024
86951f8
one more path-ignore in test.yml
daw3rd Sep 16, 2024
d368dc7
one more fix for path-ignore in test.yml
daw3rd Sep 16, 2024
6d7c186
test universal transform separately
daw3rd Sep 17, 2024
35f9f31
rename test universal workflow
daw3rd Sep 17, 2024
37cd7ef
add comments to noop src to trigger new universal test workflow
daw3rd Sep 17, 2024
a7fa50b
fix paths in test universal workflow
daw3rd Sep 17, 2024
300d06a
addj back ignore paths in test universal workflow
daw3rd Sep 17, 2024
0154179
another noop comment
daw3rd Sep 17, 2024
5dd78e6
move ignored paths to paths in univesal test workflow
daw3rd Sep 17, 2024
d91da0d
test-universal workflow name changes
daw3rd Sep 17, 2024
1be3292
noop comments
daw3rd Sep 17, 2024
54d48b6
noop readme change'
daw3rd Sep 17, 2024
288ffdf
change test universal not paths
daw3rd Sep 17, 2024
668b70d
disable all but new noop and doc_id test workflows
daw3rd Sep 17, 2024
aa24ee7
code change in noop
daw3rd Sep 17, 2024
1d72cf5
remake test transforms
daw3rd Sep 17, 2024
142da80
Merge branch 'dev' into cicd-opt
daw3rd Sep 17, 2024
6bee51b
add individual test transform workflows
daw3rd Sep 17, 2024
db18c74
noop README change
daw3rd Sep 17, 2024
42e18fc
better ignore of .md on test transform workflows
daw3rd Sep 17, 2024
ca09d75
noop readme change
daw3rd Sep 17, 2024
f16780b
noop test transform worklow 1 ignore
daw3rd Sep 17, 2024
189bdd7
noop readme
daw3rd Sep 17, 2024
54b3c69
split out the tests into test-kfp/lib/misc and remove test.yml, add r…
daw3rd Sep 17, 2024
d530ac9
test-kfp only on kfp/**
daw3rd Sep 17, 2024
781113f
noop code change to trigger build
daw3rd Sep 17, 2024
f09fbee
comments in workflows
daw3rd Sep 17, 2024
193f439
updated workflow readme
daw3rd Sep 17, 2024
3ed0fbd
only run build-library workflow on data-processing-lib changes
daw3rd Sep 17, 2024
60343d3
try and ignore docs in build-library, test-kfp/lib
daw3rd Sep 17, 2024
dec3aa9
workflow title changes for consistency
daw3rd Sep 17, 2024
e6b1d62
test change on filter source
daw3rd Sep 17, 2024
c40cf1f
change to lib readme
daw3rd Sep 17, 2024
c4853d1
change to lib source
daw3rd Sep 17, 2024
6badd06
Merge branch 'dev' into cicd-opt
daw3rd Sep 17, 2024
c4417fc
minor job name changes in transform workflows
daw3rd Sep 17, 2024
8270c6c
noop readme
daw3rd Sep 17, 2024
dc06b94
test-lib workflow ignores
daw3rd Sep 17, 2024
428833e
top level readme
daw3rd Sep 17, 2024
0bd7992
noop test source
daw3rd Sep 17, 2024
31b7f53
filter source change'
daw3rd Sep 17, 2024
e78df7b
updated all transform tets workflows
daw3rd Sep 17, 2024
684ae7a
fix typo in test template on check_images
daw3rd Sep 18, 2024
f419059
noop src change
daw3rd Sep 18, 2024
a887bf9
check for makefile in test transform workflow
daw3rd Sep 18, 2024
f9399c1
automatically determine transforms in transforms directory for which …
daw3rd Sep 18, 2024
e064e00
worklow readme, transform existence verification, disable build-libra…
daw3rd Sep 18, 2024
3a81d21
workflow readme details on kfp and misc tests
daw3rd Sep 18, 2024
5d957ed
backing out change to dpk lib code
daw3rd Sep 19, 2024
b02b52c
restore filter code
daw3rd Sep 19, 2024
7cb4390
restore noop code
daw3rd Sep 19, 2024
b780cf5
workflow readme
daw3rd Sep 19, 2024
9a5ccdc
really restore noop code
daw3rd Sep 19, 2024
93c56c2
check for makefile in transform test-src testing
daw3rd Sep 19, 2024
8f8592d
don't include lib test dependencies in transform test workflows
daw3rd Sep 19, 2024
40ce888
noop code change
daw3rd Sep 19, 2024
cb0d1f9
disable noop, don't include lib test-data in transform dependencies
daw3rd Sep 19, 2024
d6c5784
use job.id.if on Makefile to enable transform test job
daw3rd Sep 19, 2024
4f6b2f2
use job.id.if on Makefile to enable transform test job
daw3rd Sep 19, 2024
c0f4935
restore noop Makefile
daw3rd Sep 19, 2024
0a09daa
exclude kfp_ray from transfor test workflow and change noop code
daw3rd Sep 19, 2024
1bfb33d
remove if: from test workflows
daw3rd Sep 19, 2024
a80c0fd
backout noop code change
daw3rd Sep 19, 2024
ee08033
backout noop code change
daw3rd Sep 19, 2024
30a1524
only build spark image for transform image tests that need it
daw3rd Sep 20, 2024
6d6a233
header_cleanser code change
daw3rd Sep 20, 2024
b062ae0
Merge branch 'dev' into cicd-opt
daw3rd Sep 20, 2024
f495f82
only build spark image for transform image tests that need it
daw3rd Sep 20, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion .github/workflows/test-code-code2parquet.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/code/code2parquet
run: |
if [ -e "transforms/code/code2parquet/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/code/code2parquet/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/code/code2parquet DOCKER=docker test-image
else
echo "transforms/code/code2parquet/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-code-code_quality.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/code/code_quality
run: |
if [ -e "transforms/code/code_quality/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/code/code_quality/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/code/code_quality DOCKER=docker test-image
else
echo "transforms/code/code_quality/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-code-header_cleanser.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/code/header_cleanser
run: |
if [ -e "transforms/code/header_cleanser/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/code/header_cleanser/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/code/header_cleanser DOCKER=docker test-image
else
echo "transforms/code/header_cleanser/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-code-malware.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/code/malware
run: |
if [ -e "transforms/code/malware/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/code/malware/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/code/malware DOCKER=docker test-image
else
echo "transforms/code/malware/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-code-proglang_select.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/code/proglang_select
run: |
if [ -e "transforms/code/proglang_select/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/code/proglang_select/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/code/proglang_select DOCKER=docker test-image
else
echo "transforms/code/proglang_select/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-code-repo_level_ordering.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/code/repo_level_ordering
run: |
if [ -e "transforms/code/repo_level_ordering/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/code/repo_level_ordering/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/code/repo_level_ordering DOCKER=docker test-image
else
echo "transforms/code/repo_level_ordering/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-language-doc_chunk.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/language/doc_chunk
run: |
if [ -e "transforms/language/doc_chunk/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/language/doc_chunk/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/language/doc_chunk DOCKER=docker test-image
else
echo "transforms/language/doc_chunk/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-language-doc_quality.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/language/doc_quality
run: |
if [ -e "transforms/language/doc_quality/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/language/doc_quality/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/language/doc_quality DOCKER=docker test-image
else
echo "transforms/language/doc_quality/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-language-lang_id.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/language/lang_id
run: |
if [ -e "transforms/language/lang_id/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/language/lang_id/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/language/lang_id DOCKER=docker test-image
else
echo "transforms/language/lang_id/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-language-pdf2parquet.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/language/pdf2parquet
run: |
if [ -e "transforms/language/pdf2parquet/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/language/pdf2parquet/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/language/pdf2parquet DOCKER=docker test-image
else
echo "transforms/language/pdf2parquet/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-language-pii_redactor.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/language/pii_redactor
run: |
if [ -e "transforms/language/pii_redactor/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/language/pii_redactor/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/language/pii_redactor DOCKER=docker test-image
else
echo "transforms/language/pii_redactor/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-language-text_encoder.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/language/text_encoder
run: |
if [ -e "transforms/language/text_encoder/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/language/text_encoder/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/language/text_encoder DOCKER=docker test-image
else
echo "transforms/language/text_encoder/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-transform.template
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in @TARGET_TRANSFORM_DIR@
run: |
if [ -e "@TARGET_TRANSFORM_DIR@/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "@TARGET_TRANSFORM_DIR@/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C @TARGET_TRANSFORM_DIR@ DOCKER=docker test-image
else
echo "@TARGET_TRANSFORM_DIR@/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-doc_id.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/doc_id
run: |
if [ -e "transforms/universal/doc_id/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/doc_id/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/doc_id DOCKER=docker test-image
else
echo "transforms/universal/doc_id/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-ededup.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/ededup
run: |
if [ -e "transforms/universal/ededup/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/ededup/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/ededup DOCKER=docker test-image
else
echo "transforms/universal/ededup/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-fdedup.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/fdedup
run: |
if [ -e "transforms/universal/fdedup/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/fdedup/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/fdedup DOCKER=docker test-image
else
echo "transforms/universal/fdedup/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-filter.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/filter
run: |
if [ -e "transforms/universal/filter/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/filter/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/filter DOCKER=docker test-image
else
echo "transforms/universal/filter/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-html2parquet.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/html2parquet
run: |
if [ -e "transforms/universal/html2parquet/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/html2parquet/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/html2parquet DOCKER=docker test-image
else
echo "transforms/universal/html2parquet/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-noop.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/noop
run: |
if [ -e "transforms/universal/noop/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/noop/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/noop DOCKER=docker test-image
else
echo "transforms/universal/noop/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-profiler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/profiler
run: |
if [ -e "transforms/universal/profiler/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/profiler/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/profiler DOCKER=docker test-image
else
echo "transforms/universal/profiler/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-resize.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/resize
run: |
if [ -e "transforms/universal/resize/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/resize/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/resize DOCKER=docker test-image
else
echo "transforms/universal/resize/Makefile not found - testing disabled for this transform."
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/test-universal-tokenization.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
- name: Test transform image in transforms/universal/tokenization
run: |
if [ -e "transforms/universal/tokenization/Makefile" ]; then
make -C data-processing-lib/spark DOCKER=docker image
if [ -d "transforms/universal/tokenization/spark" ]; then
make -C data-processing-lib/spark DOCKER=docker image
fi
make -C transforms/universal/tokenization DOCKER=docker test-image
else
echo "transforms/universal/tokenization/Makefile not found - testing disabled for this transform."
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -27,5 +27,5 @@ def __init__(self):

if __name__ == "__main__":
launcher = PythonTransformLauncher(HeaderCleanserPythonTransformConfiguration())
logger.info("Launching license copyright header removal")
logger.info("Launching license copyright header removal.")
launcher.launch()