data.archive_file does not generate archive file during apply #39

zoltan-toth-mw · 2019-01-30T13:44:51Z

Hi there,

looks like data.archive_file does not generate archive file during apply.

Terraform Version

Terraform version: 0.11.11

provider.archive v1.1.0

Affected Resource(s)

archive_file

Terraform Configuration Files

data "archive_file" "deployment_package" {
  type = "zip"
  source_dir = "../../example/"
  output_path = ".${replace(path.module, path.root, "")}/tmp/example.zip"
}

Expected Behavior

Archive file is generated during terraform apply.

Actual Behavior

Archive file is not generated. However if I run terraform plan before apply, the output is generated.

Steps to Reproduce

Please list the steps required to reproduce the issue, for example:

terraform apply

The text was updated successfully, but these errors were encountered:

ccayg-sainsburys · 2019-02-07T18:22:47Z

Based on your like on #3 I assume this is for the case where plan is executed and outputting a plan which is then applied from a clean environment.

We've also experienced this in a CI environment where plan and apply are separate stages and I can also simulate the issue with this code:

data "archive_file" "this" {
 type        = "zip"
 output_path = "test.zip"
 source_file = "a.txt"
}

resource "aws_s3_bucket_object" "this" {
 bucket = "YOURBUCKETHERE"
 key    = "test.zip"
 source = "test.zip"
}

and then running something along the lines of

terraform plan -out=tfplan
rm test.zip
terraform apply "tfplan"

dawidmalina · 2019-07-04T14:55:27Z

Same issue in my case

leelakrishnachava · 2019-10-31T10:49:53Z

still same issue.

ocervell · 2019-11-27T14:04:04Z

Same issue here.

ianwremmel · 2019-12-05T23:35:22Z

Seeing the same thing 0.12.17: when I change a file in the directory referenced below, terraform plan doesn't pick up the change unless I taint aws_s3_bucket_object.cookbook

data "archive_file" "cookbook" {
  output_path = "${path.module}/temp/cookbook.zip"
  source_dir  = "${path.module}/cookbook-archive"
  type        = "zip"
}

resource "aws_s3_bucket_object" "cookbook" {
  bucket = module.cookbook.bucket_name
  key    = "cookbook.zip"
  source = data.archive_file.cookbook.output_path

  tags = {
    ManagedBy = "Terraform"
  }
}

I'm running plan via app.terraform.io, so I assume it would generate the archive on every run and not cache it from a previous run.

ocervell · 2019-12-07T19:19:15Z

I'm adding an additional workaround below.

If you don't know which files will change, I suggest something along the following:

data "external" "hash" {
  program = ["bash", "${path.module}/scripts/shasum.sh", "${path.module}/configs", "${timestamp()}"]
}

data "archive_file" "main" {
  type        = "zip"
  output_path = pathexpand("archive-${data.external.hash.result.shasum}.zip")
  source_dir  = pathexpand("${path.module}/configs")
}

output "archive_file_path" {
  value = data.archive_file.main.output_path
}

where ${path.module}/configs is the folder to archive. We pass timestamp() to the first data resource so that the hash is recomputed on every run.

The content of the shasum.sh script is as follow (note that this will work only on UNIX based systems, so it won't work on Windows:

#!/bin/bash

FOLDER_PATH=${1%/}
SHASUM=$(shasum $FOLDER_PATH/* | shasum | awk '{print $1}')
echo -n "{\"shasum\":\"${SHASUM}\"}"

ianwremmel · 2019-12-07T19:28:14Z

yea, a while after I posted my last comment, I came up with something like

locals {
  source_dir = "${path.module}/cookbook-archive"
}

resource "random_uuid" "this" {
  keepers = {
    for filename in fileset(local.source_dir, "**/*"):
    filename => filemd5("${local.source_dir}/${filename}")
  }
}

data "archive_file" "cookbook" {
  # threw the `/temp/` in there to gitignore it easier, but in hindsight it  
  # could be just as easy to gitignore `cookbook*.zip`
  output_path = "${path.module}/temp/cookbook-${random_uuid.this.result}.zip"
  source_dir  = local.source_dir
  type        = "zip"
}

resource "aws_s3_bucket_object" "cookbook" {
  bucket = module.cookbook.bucket_name
  key    = "cookbook.zip"
  source = data.archive_file.cookbook.output_path

  tags = {
    ManagedBy = "Terraform"
  }
}

(did this from memory, so it might not quite work as-is, but it should be close)

ocervell · 2019-12-07T19:36:38Z

This is much better, thanks ! Maybe update your code so that it's valid (need a ',' line 3, and ${filename}" line 4)

ianwremmel · 2019-12-07T19:41:17Z

good catch, thanks! also dried it up a bit :)

ocervell · 2019-12-07T20:03:27Z

Oops, just run into a weird thing with this code (seems like a provider error):

Error: Provider produced inconsistent final plan

When expanding the plan for module.slo-pipeline-cf-errors.random_uuid.hash to
include new values learned so far during apply, provider "random" produced an
invalid new value for .keepers["slo_config.json"]: was
cty.StringVal("d8073f7f8a404661c31a3cdf66ae6f8d"), but now
cty.StringVal("b42b077fe6dd6e3a57af845c5b0c6c0d").


This is a bug in the provider, which should be reported in the provider's own
issue tracker.

ianwremmel · 2019-12-07T20:06:36Z

weird. I haven't run into that, but I've also only made one change, so maybe it'll bite me next time. Maybe try one of the other file hash methods? could be something weird about md5 on one of the systems involved?

ocervell · 2019-12-07T20:46:51Z

Ah, it's because I'm dynamically adding a file (generated by TF) to my source directory, using the local_file resource. Even with a depends_on = [local_file.main] in the random_uuid.this resource, it seems like the fileset is executed before the file is dropped in the folder, thus confusing Terraform.

ianwremmel · 2019-12-07T21:00:44Z

what if you added it explicitly somehow? something like:

resource "random_uuid" "this" {
  keepers = {
    localfile => md5(local_file.main.content)
    for filename in fileset(local.source_dir, "**/*"):
    filename => filemd5("${local.source_dir}/${filename}")
  }
}

no idea if for loops work like that... :)

Paulmolin · 2020-02-07T10:41:51Z

The tricks work indeed, but then, each time a new apply is made, the archive and all resources that depend on it (e.g. a lambda function) will be modified, even if the content of the lambda did not change.
The Terraform code is then not idempotent anymore.

jharley · 2020-04-27T15:46:51Z

I ran into this on Terraform Cloud, also. It would be ideal if we could persist a single directory between the plan and apply phases (or, if archive_file was smart enough to regenerate the archive during "apply" if it was missing)

warrenstephens · 2020-07-27T20:43:49Z

If I create the initial zip file manually myself then the archive_file behavior on subsequent apply runs works fine for me -- using terraform version 0.12.28

bmonty · 2020-10-01T13:52:43Z

Based on your like on #3 I assume this is for the case where plan is executed and outputting a plan which is then applied from a clean environment.

We've also experienced this in a CI environment where plan and apply are separate stages and I can also simulate the issue with this code:
data "archive_file" "this" {
 type        = "zip"
 output_path = "test.zip"
 source_file = "a.txt"
}

resource "aws_s3_bucket_object" "this" {
 bucket = "YOURBUCKETHERE"
 key    = "test.zip"
 source = "test.zip"
}
and then running something along the lines of
terraform plan -out=tfplan
rm test.zip
terraform apply "tfplan"

This comment helped me solve my issue. I'm using terraform in a Gitlab CI pipeline with separate plan and apply stages. My apply stage would fail because the archive file was not found.

What's happening (and the comment above helped me understand) is the plan step is where the archive file is actually created. To make this work in my CI pipeline, I added config to cache the files created by the plan stage and make them available to the apply stage.

I'd recommend changing the archive provider to produce the zip file during apply instead of plan. This would match with how I think about Terraform working. At a minimum, the docs for the archive provider should be updated to make it clear when Terraform creates the archive file.

hugbubby · 2020-10-28T06:50:35Z

how the hell did they manage to mess up a goddamn zip command

shambhu9803 · 2021-01-08T16:27:25Z

this solution worked for me adding source code hash hashicorp/terraform#8344 (comment)

amine250 · 2021-04-07T15:47:22Z

Having the exact same issue in our Gitlab CI pipeline.
We couldn't use artifacts since we have many zips and it might just upload sensitive data to Gitlab.
As a workaround, we are obliged to rerun terraform plan in the apply step just to create the zip file.

EDIT: According to this bit of documentation, you can defer the creation of the archive file until some resource is applied (ie. in the terraform apply step). One can imagine something like this, which also works as a workaround:

data "archive_file" "zip" {
  type        = "zip"
  source_file = "${path.module}/textfile.txt"
  output_path = "${path.module}/myfile.zip"
  depends_on = [
    random_string.r
  ]
}

resource "random_string" "r" {
  length  = 16
  special = false
}

or something like this, which has an equivalent dependency graph:

data "archive_file" "zip" {
  type        = "zip"
  source_file = "${path.module}/textfile.txt"
  output_path = "${path.module}/myfile-${random_string.r.result}.zip"
}

resource "random_string" "r" {
  length  = 16
  special = false
}

josjaf · 2021-04-16T15:38:25Z

I just ran into this issue in Gitlab as well

JonnyDaenen · 2021-06-10T09:45:19Z

I managed to tweak @amine250 's solution to get it working.
The random string does not work as it will already determine it in the plan phase it seems. Hence, I used a null resource that is triggered by a timestamp as mentioned here.

The downside of this approach is that even when the underlying files haven't changed, it will trigger and update. In my case this works out nicely as I'm using this to deploy a Cloud Function (GCP), which will not redeploy when there are no changes (the zipfile I upload to Cloud Storage has a hash in its name).

Note that using a the null-resource directly on the archive resource and triggering the null resource with a hash of the 2 file contents does not work.

# Dummy resource to ensure archive is created at apply stage
resource null_resource dummy_trigger {
  triggers = {
    timestamp = timestamp()
  }
}

data "local_file" "py_main" {
  filename = "${path.root}/../../../../cloud_function/main.py"
  depends_on = [
  # Make sure archive is created in apply stage
    null_resource.dummy_trigger
  ]
}

data "local_file" "py_req" {
  filename = "${path.root}/../../../../cloud_function/requirements.txt"
  depends_on = [
  # Make sure archive is created in apply stage
    null_resource.dummy_trigger
  ]
}



data "archive_file" "cf_zip" {
  type        = "zip"
  output_path = "${path.root}/../../../../tmp/cf.zip"

  source {
    content  = data.local_file.py_main.content
    filename = "main.py"
  }

  source {
    content  = data.local_file.py_req.content
    filename = "requirements.txt"
  }
}

akirax-git · 2021-06-20T16:58:15Z

I also run into the same issue in Gitlab, and the resource.random_string did not work, but resource.null_resource work. Thanks!

mikiisz · 2021-07-08T12:40:33Z

Is there a follow up on this? I was here one year ago, this behaviour still occurs

dstuck · 2021-11-10T19:11:36Z

Wanted to leave a warning for anyone considering the suggestion:

If I create the initial zip file manually myself then the archive_file behavior on subsequent apply runs works fine for me -- using terraform version 0.12.28

I tested this out and it does not work. It simply unbreaks the apply by putting an old version of the zip file there.

test.tf:

data "archive_file" "api" {
  type        = "zip"
  source_dir  = "${path.module}/test_files/"
  output_path = "${path.module}/test.zip"
  excludes    = ["__pycache__"]
}

resource "local_file" "zip_sha" {
  content  = data.archive_file.api.output_sha
  filename = "${path.module}/test_sha.txt"
}

Taking an old copy of the zip file with sha , and running the following shows that we end up with the old version of the zip file present during apply.

cp old_test.zip test.zip
terraform plan -out=tfplan
cp old_test.zip test.zip
terraform apply "tfplan"
cat test_sha.txt
> 33585fa47331712f37d9206c3587b6a1380db53b
shasum test.zip
> 0dd4eb3e0f51b5f659c991d1ff93ef5d2c1cc2a0  test.zip

christhomas · 2022-01-15T13:14:21Z

Based on your like on #3 I assume this is for the case where plan is executed and outputting a plan which is then applied from a clean environment.
We've also experienced this in a CI environment where plan and apply are separate stages and I can also simulate the issue with this code:
data "archive_file" "this" {
 type        = "zip"
 output_path = "test.zip"
 source_file = "a.txt"
}

resource "aws_s3_bucket_object" "this" {
 bucket = "YOURBUCKETHERE"
 key    = "test.zip"
 source = "test.zip"
}
and then running something along the lines of
terraform plan -out=tfplan
rm test.zip
terraform apply "tfplan"
This comment helped me solve my issue. I'm using terraform in a Gitlab CI pipeline with separate plan and apply stages. My apply stage would fail because the archive file was not found.

What's happening (and the comment above helped me understand) is the plan step is where the archive file is actually created. To make this work in my CI pipeline, I added config to cache the files created by the plan stage and make them available to the apply stage.

I'd recommend changing the archive provider to produce the zip file during apply instead of plan. This would match with how I think about Terraform working. At a minimum, the docs for the archive provider should be updated to make it clear when Terraform creates the archive file.

Knowing this helped solve my pipeline problem where I would also plan, then apply in separate gitlab pipeline stages. So the apply would attempt to upload the lambda zip files, which were generated in the plan stage and it would fail. So just adding in the plan stage, the zip folder to the artifacts of the stage, meant it was fixed and working in the apply stage

I don't know why the planning stage is being used to generate zip files, planning should just be about making the plan file, applying should be about creating things and doing actions. It seems wrong to do it in the plan stage. As other people have commented

edomaur · 2022-01-21T07:39:54Z

Got hit by that problem, and I also solved it using #39 (comment)

Works well (but it would be nice if the Terraform doc contained more borderline examples like this... )

…sues/39\#issuecomment-815021702

CodyPaul · 2022-02-24T00:19:36Z

#39 (comment)

also did the trick for me

micchickenburger · 2022-05-18T21:57:52Z

The archive_file artifacts are produced during the plan stage. You just need to pass the artifacts across the stages.

For instance, for Gitlab CI:

image:
  name: hashicorp/terraform:1.1.9
  entrypoint:
    - '/usr/bin/env'
    - 'PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin'

variables:
  PLAN: "plan.tfplan"
  TF_IN_AUTOMATION: "true"

.terraform_before_script:
  - terraform --version
  # Ensure directory for lambda function zip files exists
  - install -d lambda_output
  - terraform init -input=false

stages:
  - plan
  - deploy

plan:
  stage: plan
  before_script: !reference [.terraform_before_script]
  script:
    - terraform plan -out=$PLAN -input=false
  artifacts:
    name: plan
    paths:
      - $PLAN
      - lambda_output

deploy:
  stage: deploy
  before_script: !reference [.terraform_before_script]
  script:
    - terraform apply -input=false $PLAN
  dependencies:
    - plan

Then, in your Terraform file:

data "archive_file" "function" {
  type        = "zip"
  source_dir  = "${path.root}/lambda/function"
  output_path = "${path.root}/lambda_output/function.zip"
}

amine250 · 2022-05-18T22:18:19Z

The archive_file artifacts are produced during the plan stage. You just need to pass the artifacts across the stages.

FYI, it's not recommended to store plan files as artifacts because it might contain sensitive data and is not encrypted.

krishansrimal · 2022-08-03T09:08:13Z

Got hit by same problem. I wonder why there is still no proper solution from archive provider :(

christophemorio · 2023-01-25T17:07:41Z

Same issue on terraform cloud.

Workaround with consistent output_path when var.inputfile does not change,
and force datasource refresh constantly.

data "archive_file" "scenario_zip" {
  type = "zip"

  output_path = "/tmp/${filesha1(var.inputfile)}.zip"

  source {
    content  = file(var.inputfile)
    filename = "myinputfile"
  }

  source {
    # Forces a datasource refresh
    content  = timestamp()
    filename = ".timestamp"
  }
}

bendbennett · 2023-01-26T14:58:01Z

The fundamental issue here is that the archive data source has side effects (i.e., creates a .zip).

Data sources are an abstraction that allow Terraform to reference external data. Unlike managed resources, Terraform does not manage the lifecycle of the resource or data. Data sources are intended to have no side-effects.

When terraform plan -out=tfplan is executed, the Read function in the data source is called, creating the archive and updating the state. The generated tfplan file contains no changes. Consequently, executing terraform apply tfplan does nothing.

This is expected behaviour for Terraform, again the issue is the fact that the archive data source has side effects. Currently, the workarounds described which have implicit or explicit dependencies on a managed resource are the only way to try and force execution during terraform apply rather than terraform plan.

monti-python · 2023-10-12T15:54:54Z

An even better solution is to use timestamp() as part of the output_path:

data "archive_file" "zip" {
  type        = "zip"
  source_file = "${path.module}/textfile.txt"
  output_path = "${path.module}/myfile-${timestamp()}.zip"
}

This will force terraform to create the zip during the apply phase, and doesn't need any extra providers

queglay · 2023-11-04T06:35:59Z

I have the same problem, but this shouldn't be marked resolved with a timestamp forcing zips and lambda layers to get versioned up all the time its wasteful and slows down CI. The hash of the zip or intended contents should determine if dependencies are retriggered and currently they aren't.

iVariable · 2023-11-06T20:04:39Z

This does the trick for me in combination with locals (to reuse the path of the archive down the line). It creates a new archive only if the underlying source file has changed. Notice the filemd5 in the lambda_api_archive_path.

locals {
  lambda_api_function_name = "api"
  lambda_api_binary_path   = "${path.cwd}/../build/${local.lambda_api_function_name}"
  lambda_api_archive_path  = "${path.module}/tf_generated/${local.lambda_api_function_name}-${filemd5(local.lambda_api_binary_path)}.zip"
}

data "archive_file" "lambda_api_zip" {
  type        = "zip"
  source_file = local.lambda_api_binary_path
  output_path = local.lambda_api_archive_path
}

WalterClementsJr · 2023-12-27T09:55:18Z

currently on Terraform v1.6.6 and it has happened twice this week. It's driving me insane.

lowkasen · 2024-01-08T16:55:57Z

facing the same issue

andrewedstrom · 2024-01-31T03:18:44Z

@bendbennett forgive me, but I find your response unsatisfactory.

data.archive_file is an official provider from terraform that lives in this repo. What good does it do to tell us that the code in this repo, that y'all maintain, does something non-idiomatic?

If there's a more idiomatic way to do this, please tell us. What is hashicorp's recommended approach to creating a zip from a file in source code?

mikemiller35 · 2024-02-22T18:21:54Z

Same issue here

antoinefaure · 2024-03-20T01:18:35Z

This works fine for me. Just followed this SO thread:
https://stackoverflow.com/questions/53477485/terraform-does-not-detect-changes-to-lambda-source-files

overfl0wd · 2024-03-20T13:22:59Z

Issue still present. My plan and apply stages run separately in Gitlab CICD pipelines, so for me the fix was caching *.zip in my pipeline confiig so the files were passed from one stage to another

JacobDiChiacchio · 2024-03-21T18:42:52Z

Also facing this issue. Why is this closed?

Bruno1298 · 2024-03-26T15:57:02Z

https://stackoverflow.com/questions/53477485/terraform-does-not-detect-changes-to-lambda-source-files

only If you use AWS :/

bwhaley · 2024-04-16T19:04:43Z

One point of confusion that I have is the difference between the archive_file resource and the data source. The docs say that the resource is deprecated, but #218 says otherwise. The resource generates the zip file during apply.

github-actions · 2024-05-23T13:53:30Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

ndarwincorn mentioned this issue May 28, 2019

Unable to load notify_slack.zip (no such file or directory) terraform-aws-modules/terraform-aws-notify-slack#21

Closed

ocervell mentioned this issue Nov 27, 2019

ZIP file is not recreated if running in a separate stage terraform-google-modules/terraform-google-event-function#37

Closed

ocervell mentioned this issue Dec 7, 2019

Updating SLO config does not redeploy Cloud Function terraform-google-modules/terraform-google-slo#7

Closed

josh803316 referenced this issue in sketchy/terraform-aws-lambda-at-edge Feb 19, 2022

trick from https://github.com/hashicorp/terraform-provider-archive/is…

024d440

…sues/39\#issuecomment-815021702

josh803316 referenced this issue in sketchy/terraform-aws-lambda-at-edge Feb 19, 2022

trick from https://github.com/hashicorp/terraform-provider-archive/is…

0f04567

…sues/39\#issuecomment-815021702

josh803316 referenced this issue in sketchy/terraform-aws-lambda-at-edge Feb 19, 2022

trick from https://github.com/hashicorp/terraform-provider-archive/is…

4685786

…sues/39\#issuecomment-815021702

paulschwarzenberger mentioned this issue Jun 21, 2022

OSE-850 support separate tf plan and apply stages domain-protect/domain-protect#147

Merged

bendbennett closed this as completed Jan 26, 2023

github-actions bot locked as resolved and limited conversation to collaborators May 23, 2024

data.archive_file does not generate archive file during apply #39

data.archive_file does not generate archive file during apply #39

Comments

zoltan-toth-mw commented Jan 30, 2019

Terraform Version

Affected Resource(s)

Terraform Configuration Files

Expected Behavior

Actual Behavior

Steps to Reproduce

ccayg-sainsburys commented Feb 7, 2019

dawidmalina commented Jul 4, 2019

leelakrishnachava commented Oct 31, 2019

ocervell commented Nov 27, 2019

ianwremmel commented Dec 5, 2019 • edited Loading

ocervell commented Dec 7, 2019 • edited Loading

ianwremmel commented Dec 7, 2019 • edited Loading

ocervell commented Dec 7, 2019 • edited Loading

ianwremmel commented Dec 7, 2019

ocervell commented Dec 7, 2019 • edited Loading

ianwremmel commented Dec 7, 2019

ocervell commented Dec 7, 2019 • edited Loading

ianwremmel commented Dec 7, 2019

Paulmolin commented Feb 7, 2020

jharley commented Apr 27, 2020

warrenstephens commented Jul 27, 2020

bmonty commented Oct 1, 2020 • edited Loading

hugbubby commented Oct 28, 2020 • edited Loading

shambhu9803 commented Jan 8, 2021

amine250 commented Apr 7, 2021 • edited Loading

josjaf commented Apr 16, 2021

JonnyDaenen commented Jun 10, 2021

akirax-git commented Jun 20, 2021

mikiisz commented Jul 8, 2021

dstuck commented Nov 10, 2021

christhomas commented Jan 15, 2022

edomaur commented Jan 21, 2022

CodyPaul commented Feb 24, 2022

micchickenburger commented May 18, 2022

amine250 commented May 18, 2022

krishansrimal commented Aug 3, 2022

christophemorio commented Jan 25, 2023

bendbennett commented Jan 26, 2023

monti-python commented Oct 12, 2023

queglay commented Nov 4, 2023

iVariable commented Nov 6, 2023 • edited Loading

WalterClementsJr commented Dec 27, 2023

lowkasen commented Jan 8, 2024

andrewedstrom commented Jan 31, 2024

mikemiller35 commented Feb 22, 2024

antoinefaure commented Mar 20, 2024

overfl0wd commented Mar 20, 2024 • edited Loading

JacobDiChiacchio commented Mar 21, 2024

Bruno1298 commented Mar 26, 2024

bwhaley commented Apr 16, 2024 • edited Loading

github-actions bot commented May 23, 2024

ianwremmel commented Dec 5, 2019 •

edited

Loading

ocervell commented Dec 7, 2019 •

edited

Loading

ianwremmel commented Dec 7, 2019 •

edited

Loading

ocervell commented Dec 7, 2019 •

edited

Loading

ocervell commented Dec 7, 2019 •

edited

Loading

ocervell commented Dec 7, 2019 •

edited

Loading

bmonty commented Oct 1, 2020 •

edited

Loading

hugbubby commented Oct 28, 2020 •

edited

Loading

amine250 commented Apr 7, 2021 •

edited

Loading

iVariable commented Nov 6, 2023 •

edited

Loading

overfl0wd commented Mar 20, 2024 •

edited

Loading

bwhaley commented Apr 16, 2024 •

edited

Loading