How to Use Wildcards in Data Flow Source Activity?

Raimond Kempees 101 Reputation points
2021-09-30T13:07:20.287+00:00

In Data Factory I am trying to set up a Data Flow to read Azure AD Signin logs exported as Json to Azure Blob Storage to store properties in a DB. The problem arises when I try to configure the Source side of things. No matter what I try to set as wild card, I keep getting a "Path does not resolve to any file(s). Please make sure the file/folder exists and is not hidden.".

I use the Dataset as Dataset and not Inline. I can click "Test connection" and that works. In the Source Tab and on the Data Flow screen I see that the columns (15) are correctly read from the source and even that the properties are mapped correctly, including the complex types.

The actual Json files are nested 6 levels deep in the blob store. Is that an issue?

I've tried all kinds of wildcards:

  • **
  • /**/*.Json
  • //////*.json
  • tenantId*/*.json
  • *.json

None of it works, also when putting the paths around single quotes or when using the toString function.

Does anyone know if this can work at all?

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
3,192 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,624 questions
{count} votes

Accepted answer
  1. Raimond Kempees 101 Reputation points
    2021-10-25T11:16:58.84+00:00

    @MartinJaffer-MSFT - thanks for looking into this. It proved I was on the right track. I've now managed to get json data using Blob storage as DataSet and with the wild card path you also have. The underlying issues were actually wholly different:

    It would be great if the error messages would be a bit more descriptive, but it does work in the end.

    1 person found this answer helpful.
    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. MartinJaffer-MSFT 26,236 Reputation points
    2021-10-04T22:48:20.19+00:00

    @Raimond Kempees

    Given a filepath
    tenantId=XYZ/y=2021/m=09/d=03/h=13/m=00/anon.json

    I was able to see data when using inline dataset, and wildcard path

    */*/*/*/*/*/*.json  
    

    137554-image.png

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.