[Enhancement]: optional model_name for endpoint configuration production variants

### Description

Currently, you cannot create sagemaker endpoints that can be utilized with sagemaker inference components. The TL;DR is that you need to connect your inference components to an endpoint deployed without a model attached to its production variant, and **this isn't possible as model_name is a required property of an endpoint configuration's production variant.** It shouldn't be.

Long winded version -
While terraform cannot yet create inference components, ([there is a year old request for them however](https://github.com/hashicorp/terraform-provider-aws/issues/35226)) to create them. These inference components get attached to existing endpoints. If you try to attach it to an endpoint that already has a running model, you get `Invalid request provided: Inference Components are not supported in this Endpoint. Please make sure this endpoint can deploy inference components.`

I did some research and found that in order to attach an inference component to an endpoint, the endpoint must have been configured without any models. This is done by not specifying a model_name when creating a production variant inside of a endpoint configuration. You can do this via boto (although not the UI), as modelName is not a required property ([see here](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker/client/create_endpoint_config.html)). However terraform's AWS package [does specify it as required](https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/sagemaker_endpoint_configuration.html#model_name-1). This blocks you from being able to attach an inference component to a terraform-created endpoint.

### Affected Resource(s) and/or Data Source(s)

aws_sagemaker_endpoint_configuration

### Potential Terraform Configuration

```terraform
resource "aws_sagemaker_endpoint_configuration" "ec" {
  name = "my-endpoint-config"

  production_variants {
    variant_name           = "variant-1"
    initial_instance_count = 1
    instance_type          = "ml.t2.medium"
  }

  tags = {
    Name = "foo"
  }
}
```


### References

https://aws.amazon.com/blogs/machine-learning/easily-deploy-and-manage-hundreds-of-lora-adapters-with-sagemaker-efficient-multi-adapter-inference/

### Would you like to implement a fix?

Yes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Enhancement]: optional model_name for endpoint configuration production variants #40644

Description

Affected Resource(s) and/or Data Source(s)

Potential Terraform Configuration

References

Would you like to implement a fix?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Enhancement]: optional model_name for endpoint configuration production variants #40644

Description

Description

Affected Resource(s) and/or Data Source(s)

Potential Terraform Configuration

References

Would you like to implement a fix?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions