-
Notifications
You must be signed in to change notification settings - Fork 37
Description
Hello,
I had successfully deploy, tested this solution and it worked. Version 0.42
Now with the same stack Status is "Running" for a whole day.
And I found out that ECS DelStack tasks are failing with this error
STOPPED (ResourceInitializationError: unable to pull secrets or registry auth: pull command failed: : signal: killed)
I have no idea how to troubleshoot.
I have all VPC endpoints in place, with attached private subnets, the same one sent as a parameter to the CF.
Also Security Group is the same "default" one, attached to ECS Service, and also to all of these endpoints.

Here are the parameters I'm using, almast all default.
This is Terraform code, but you can see the parameters
resource "aws_cloudformation_stack" "s3_find_and_forget_ohio" {
name = format("%s-s3f2-ohio", terraform.workspace)
template_url = format("https://solution-builders-%s.s3.%s.amazonaws.com/amazon-s3-find-and-forget/%s/template.yaml", data.aws_region.ohio.name, data.aws_region.ohio.name, local.s3f2_version)
parameters = {
AdminEmail = local.s3f2_admin
DeployVpc = false
VpcSecurityGroups = module.vpc_ohio.default_security_group_id
VpcSubnets = join(",", module.vpc_ohio.private_subnets)
CreateCloudFrontDistribution = true
AccessControlAllowOriginOverride = false
AthenaConcurrencyLimit = 20
DeletionTasksMaxNumber = 3
DeletionTaskCPU = 4096
DeletionTaskMemory = 30720
QueryExecutionWaitSeconds = 3
QueryQueueWaitSeconds = 3
ForgetQueueWaitSeconds = 30
CognitoAdvancedSecurity = "OFF"
EnableAPIAccessLogging = false
EnableContainerInsights = false
JobDetailsRetentionDays = 0
EnableDynamoDBBackups = false
RetainDynamoDBTables = true
AthenaWorkGroup = "primary" #module.athena_s3f2_tool_ohio.athena_workgroups.name
PreBuiltArtefactsBucketOverride = false
}
capabilities = [
"CAPABILITY_AUTO_EXPAND",
"CAPABILITY_IAM",
"CAPABILITY_NAMED_IAM",
]
tags = local.tags
}
Also, different minor issue, I wanted to use my own AthenaWorkGroup, but I was not able to set bucket permissions.
Tried with both roles, Athena role and that one another deployed with CF.
And yes, one important thing.
I don't see how to stop Deletion Job. It runs 24h, I see this error, for sure it will fail, so it would be better to have some option to cancel complete job