Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job Name length limit #5099

Open
EnyMan opened this issue Mar 24, 2025 · 2 comments
Open

Job Name length limit #5099

EnyMan opened this issue Mar 24, 2025 · 2 comments
Labels
component: pipelines Relates to the SageMaker Pipeline Platform component: processing Relates to the SageMaker Processing Platform type: bug

Comments

@EnyMan
Copy link

EnyMan commented Mar 24, 2025

Describe the bug
When using the PipelineDefinitionConfig i get the error

ClientError: An error occurred (ValidationException) when calling the CreatePipeline operation: Unable to
parse pipeline definition. Model Validation failed: string 
'my-custom-name-SHA256-hash' with 
length=91 cannot be greater than max=63.0 defined for ProcessingJobName.

The issue here is that the hash itself is 64 characters. Also the function that generates the job_name only limits the length of the generated name to 1024 characters.

To reproduce
Any basic pipeline with PipelineDefinitionConfig(use_custom_job_prefix=True)

from sagemaker import ScriptProcessor
from sagemaker.workflow.pipeline import Pipeline
from sagemaker.workflow.pipeline_definition_config import PipelineDefinitionConfig
from sagemaker.workflow.steps import ProcessingStep

ROLE_ARN = "ADD_YOUR_ROLE_HERE"
IMAGE_ARN = "ADD_YOUR_IMAGE_HERE"

definition_config = PipelineDefinitionConfig(use_custom_job_prefix=True)
steps = [
    ProcessingStep(
        name=f"my-first-step",
        display_name="My First Step",
        processor=ScriptProcessor(
            command=["python3"],
            role=ROLE_ARN,
            instance_count=1,
            image_uri=IMAGE_ARN,
            instance_type="ml.m5.xlarge",
        ),
        code=f"example.py",
    ),
]
pipeline = Pipeline(
    name="test-pipeline",
    steps=steps,
    pipeline_definition_config=definition_config,
)

pipeline.upsert(
    role_arn=ROLE_ARN,
)

Expected behavior
The name should be truncated, so it fits in the 63 character limit

System information
A description of your system. Please provide:

  • SageMaker Python SDK version: 2.242.0
  • Framework name (eg. PyTorch) or algorithm (eg. KMeans): Any
  • Framework version: Any
  • Python version: 3.12.9
  • CPU or GPU: CPU
  • Custom Docker image (Y/N): Any
@lifo9
Copy link

lifo9 commented Mar 24, 2025

I can confirm the issue.

@benieric benieric added type: bug component: processing Relates to the SageMaker Processing Platform labels Mar 27, 2025
@rohangujarathi rohangujarathi added the component: pipelines Relates to the SageMaker Pipeline Platform label Mar 31, 2025
@deneriz-veridas
Copy link

Hi @EnyMan @lifo9. It seems ProcessinJobName has a limitation, check its requirements on the AWS API. However, it seems the hash is being computed by sagemaker-python-sdk, but I can't find where it is being appended to the job name.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component: pipelines Relates to the SageMaker Pipeline Platform component: processing Relates to the SageMaker Processing Platform type: bug
Projects
None yet
Development

No branches or pull requests

5 participants