Skip to content

DataLake file system and path are required when using Azure Data Lake Gen2 #2444

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
thomasjsweet opened this issue Mar 23, 2025 · 0 comments

Comments

@thomasjsweet
Copy link

The instruction here:
https://github.com/Azure-Samples/azure-search-openai-demo/blob/main/docs/login_and_acl.md#azure-data-lake-storage-gen2-setup
state that file system and path are optional for the data lack storage.

#'To run this script with a Data Lake Storage Gen2 account, first set the following environment variables:

AZURE_ADLS_GEN2_STORAGE_ACCOUNT: Name of existing Data Lake Storage Gen2 storage account.
(Optional) AZURE_ADLS_GEN2_FILESYSTEM: Name of existing Data Lake Storage Gen2 filesystem / container in the storage account. If empty, gptkbcontainer is used.
(Optional) AZURE_ADLS_GEN2_FILESYSTEM_PATH: Specific path in the Data Lake Storage Gen2 filesystem / container to process. Only PDFs contained in this path will be processed.
Once the environment variables are set, run the scri'

I set AZURE_ADLS_GEN2_FILESYSTEM_PATH but did not set the other two as they were supposed to be optional.
My result is Traceback (most recent call last):
File "/workspaces/Azure-Search-AI-Demo/./app/backend/prepdocs.py", line 354, in
list_file_strategy = setup_list_file_strategy(
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspaces/Azure-Search-AI-Demo/./app/backend/prepdocs.py", line 94, in setup_list_file_strategy
raise ValueError("DataLake file system and path are required when using Azure Data Lake Gen2")
ValueError: DataLake file system and path are required when using Azure Data Lake Gen2

expected result is that the two additional environment variables are optional.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant