Skip to content

/test jump-ci icelake skeleton#634

Open
MML-coder wants to merge 2 commits intoopenshift-psap:mainfrom
MML-coder:ci-test
Open

/test jump-ci icelake skeleton#634
MML-coder wants to merge 2 commits intoopenshift-psap:mainfrom
MML-coder:ci-test

Conversation

@MML-coder
Copy link
Copy Markdown
Collaborator

No description provided.

@openshift-ci
Copy link
Copy Markdown

openshift-ci Bot commented Jan 14, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign ashishkamra for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@MML-coder
Copy link
Copy Markdown
Collaborator Author

/test jump-ci icelake skeleton

@openshift-ci openshift-ci Bot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Jan 14, 2025
@openshift-ci
Copy link
Copy Markdown

openshift-ci Bot commented Jan 14, 2025

Hi @MML-coder. Thanks for your PR.

I'm waiting for a openshift-psap member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-ci
Copy link
Copy Markdown

openshift-ci Bot commented Jan 14, 2025

@MML-coder: Cannot trigger testing until a trusted user reviews the PR and leaves an /ok-to-test message.

Details

In response to this:

/test jump-ci icelake skeleton

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Jan 15, 2025

/ok-to-test
/test jump-ci icelake skeleton

@openshift-ci openshift-ci Bot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jan 15, 2025
@psap-forge-bot
Copy link
Copy Markdown

🟢 Test of 'skeleton test test_ci' succeeded after 00 hours 09 minutes 14 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci

@MML-coder
Copy link
Copy Markdown
Collaborator Author

/test jump-ci icelake skeleton

@psap-forge-bot
Copy link
Copy Markdown

🟢 Test of 'skeleton test test_ci' succeeded after 00 hours 09 minutes 28 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Jan 16, 2025

/test jump-ci icelake fine_tuning

@psap-forge-bot
Copy link
Copy Markdown

🔴 Test of 'fine_tuning test prepare_ci' failed after 00 hours 10 minutes 53 seconds. 🔴

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci

Failure indicator:

/logs/artifacts/000__prepare1/000__install_rhoai/000__cluster__deploy_operator_service-mesh/FAILURE | [000__cluster__deploy_operator_service-mesh] ./run_toolbox.py cluster deploy_operator --catalog=redhat-operators --manifest_name=servicemeshoperator --namespace=all --> 2
/logs/artifacts/000__prepare1/000__install_rhoai/FAILURE | CalledProcessError: Command 'set -o errexit;set -o pipefail;set -o nounset;set -o errtrace;ARTIFACT_DIR="/logs/artifacts/000__prepare1/000__install_rhoai" ARTIFACT_TOOLBOX_NAME_SUFFIX="_service-mesh" ./run_toolbox.py cluster deploy_operator --catalog='redhat-operators' --manifest_name='servicemeshoperator' --namespace='all'' returned non-zero exit status 2.
Traceback (most recent call last):
  File "/opt/topsail/src/projects/rhods/library/prepare_rhoai.py", line 57, in install
    install_servicemesh()
  File "/opt/topsail/src/projects/rhods/library/prepare_rhoai.py", line 32, in install_servicemesh
    run.run_toolbox("cluster", "deploy_operator",
  File "/opt/topsail/src/projects/core/library/run.py", line 82, in run_toolbox
    return run(f'{cmd_env} ./run_toolbox.py {group} {command} {_dict_to_run_toolbox_args(kwargs)}', **run_kwargs)
  File "/opt/topsail/src/projects/core/library/run.py", line 105, in run

[...]

@MML-coder
Copy link
Copy Markdown
Collaborator Author

/test rhoai-light skeleton

@psap-forge-bot
Copy link
Copy Markdown

🟢 Test of 'rhoai test test_ci' succeeded after 00 hours 05 minutes 45 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

# RHOAI: run skeleton test test_ci
PR_POSITIONAL_ARGS: ''
PR_POSITIONAL_ARG_0: skeleton-rhoai-light

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Jan 16, 2025

/test rhoai-light fine_tuning fms gating use_smoke_os

@psap-forge-bot
Copy link
Copy Markdown

🟢 Test of 'rhoai test test_ci' succeeded after 00 hours 37 minutes 41 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

# RHOAI: run fine_tuning test test_ci
PR_POSITIONAL_ARGS: fms gating use_smoke_os
PR_POSITIONAL_ARG_0: fine_tuning-rhoai-light
PR_POSITIONAL_ARG_1: fms
PR_POSITIONAL_ARG_2: gating
PR_POSITIONAL_ARG_3: use_smoke_os

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Jan 16, 2025

/test jump-ci icelake fine_tuning fms gating
/var matbench.lts.opensearch.export.enabled: true

@psap-forge-bot
Copy link
Copy Markdown

🔴 Test of 'fine_tuning test test_ci' failed after 00 hours 05 minutes 35 seconds. 🔴

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci
PR_POSITIONAL_ARG_1: fms
PR_POSITIONAL_ARG_2: gating
matbench.lts.opensearch.export.enabled: True

Failure indicator:

/logs/artifacts/004__fms_fine_tuning/000__fine_tuning__run_fine_tuning_job/FAILURE | [000__fine_tuning__run_fine_tuning_job] ./run_toolbox.py from_config fine_tuning run_fine_tuning_job --extra={'name': 'fine-tuning', 'pod_count': 1, 'model_name': 'bloom-560m', 'dataset_name': 'twitter_complaints_small.json', 'gpu': 1, 'dataset_replication': 1, 'dataset_response_template': '\n### Label:'} --> 2
/logs/artifacts/004__fms_fine_tuning/FAILURE | CalledProcessError: Command 'set -o errexit;set -o pipefail;set -o nounset;set -o errtrace;ARTIFACT_DIR="/logs/artifacts/004__fms_fine_tuning" ./run_toolbox.py from_config fine_tuning run_fine_tuning_job --extra="{'name': 'fine-tuning', 'pod_count': 1, 'model_name': 'bloom-560m', 'dataset_name': 'twitter_complaints_small.json', 'gpu': 1, 'dataset_replication': 1, 'dataset_response_template': '\n### Label:'}"' returned non-zero exit status 2.
Traceback (most recent call last):
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 156, in _run_test
    run.run_toolbox_from_config("fine_tuning", "run_fine_tuning_job",
  File "/opt/topsail/src/projects/core/library/run.py", line 49, in run_toolbox_from_config
    return run(f'{cmd_env} ./run_toolbox.py from_config {group} {command} {_dict_to_run_toolbox_args(kwargs)}', **run_kwargs)
  File "/opt/topsail/src/projects/core/library/run.py", line 105, in run
    proc = subprocess.run(command, **args)
  File "/usr/lib64/python3.9/subprocess.py", line 528, in run

[...]

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Jan 16, 2025

/test jump-ci icelake fine_tuning fms gating
/var matbench.lts.opensearch.export.enabled: false
/var tests.fine_tuning.test_settings.gpu: 0
/only test_ci

@psap-forge-bot
Copy link
Copy Markdown

🔴 Test of 'fine_tuning test test_ci' failed after 00 hours 05 minutes 31 seconds. 🔴

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci
PR_POSITIONAL_ARG_1: fms
PR_POSITIONAL_ARG_2: gating
matbench.lts.opensearch.export.enabled: False
tests.fine_tuning.test_settings.gpu: 0

Failure indicator:

/logs/artifacts/005__plots/FAILURE | An error happened during the visualization post-processing ... (opensearch secret cannot be generated)
/logs/artifacts/006__prom_plots/FAILURE | An error happened during the visualization post-processing ... (opensearch secret cannot be generated)
/logs/artifacts/FAILURE | Traceback (most recent call last):
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 410, in test
    failed = _run_test_and_visualize()
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 338, in _run_test_and_visualize
    generate_visualization(do_matbenchmarking, test_artifact_dir_p[0])
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 389, in generate_visualization
    raise exc
  File "/opt/topsail/src/projects/core/library/run.py", line 178, in run_and_catch

[...]

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Jan 20, 2025

/test jump-ci icelake fine_tuning fms gating
/var matbench.lts.opensearch.export.enabled: false
/var tests.fine_tuning.test_settings.gpu: 0
/only test_ci

@psap-forge-bot
Copy link
Copy Markdown

🟢 Test of 'fine_tuning test test_ci' succeeded after 00 hours 05 minutes 47 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

{PR_POSITIONAL_ARGS: jump-ci, PR_POSITIONAL_ARG_1: fms, PR_POSITIONAL_ARG_2: gating,
  matbench.lts.opensearch.export.enabled: false, tests.fine_tuning.test_settings.gpu: 0}

@kpouget
Copy link
Copy Markdown
Contributor

kpouget commented Dec 10, 2025

outdated, closing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ok-to-test Indicates a non-member PR verified by an org member that is safe to test.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants