Azure Function fails suddenly

Question

Azure Function fails suddenly

Jona 660

Hi,

I have a function called splitter with a blob trigger with even grid as a source. The only goal of this functions is to grab a blob and partition it in several pieces, so that it can be feed into an Event Hub, which has a publication limit of 1MB

This function is part of several functions to form an orchestation to inject messages to Event Hub. The files we receive have a variety of size, 100MB, 50MB, 10KB, 5KB, etc ...

So, when deployed and running on Azure, I realized that the splitter function stop executing suddenly when I stress the orchestation by uploading up to 100 files. Among those files, the 100MB or 50MB files are uploaded

Locally, my integration tests run just fine. When running on Azure, the splitter function log some messages and suddenly stops executing.

log7

This is the upper fragment of the function splitter:

def splitter_fn(blob:func.InputStream, context:func.Context):

    blob_content = blob.read()
    size_mb = round(len(blob_content) / (1024 * 1024), 2)
    logger.info(f'Downloaded blob | {blob.name} | {size_mb:,} MB', extra={
        'custom_dimensions' : {
            'experiment' : EXPERIMENT_NAME,
            'blob' : blob.name,
            'size_mb' : size_mb  
        }         
    })

    try:
        allowed_file_size_mb = float(EnvLoader.get_value('ALLOWED_FILE_SIZE_MB'))
        logger.info(f'ALLOWED_FILE_SIZE_MB set to {allowed_file_size_mb} MB')

        # We get the current metadata
        blob_service = BlobService()
        metadata = blob_service.get_blob_metadata(container='stage', blob_key=blob.name.split('/')[-1])
        partitioning = metadata['Partitioning']
        original_blob = metadata['OriginalBlob']

        if partitioning == 'Extra':
            q_files = 2
        elif partitioning == 'Normal':
            q_files = math.ceil(size_mb / allowed_file_size_mb) + 1
        else:
            raise ValueError(f'PartitionType \"{partitioning}\" is not allowed')       
        
        logger.info(f'{q_files} files will be created after splitting | PartitionType: {partitioning}', extra={
            'custom_dimensions' : { 
                'experiment' : EXPERIMENT_NAME,
                'q_files' : q_files,
                'partitioning' : partitioning  
            }           
        })

        #df = pandas.read_csv(StringIO(str(blob_content ,'utf-8')), sep='|', quotechar='`')
        df = pandas.read_csv(StringIO(blob_content.decode('utf-8')), sep='|', quotechar='`', quoting=csv.QUOTE_ALL)
        dfs = numpy.array_split(df, q_files)

        blob_service = BlobService()
        logger.info(f'Creating partitions | {len(dfs)} dataframes')
        for i, _df in enumerate(dfs):

            bytes_df = bytes(_df.to_csv(lineterminator='\r\n', index=False, sep='|', quotechar='`', quoting=csv.QUOTE_ALL), encoding='utf-8')
            #bytes_df = bytes(_df.to_csv(lineterminator='\r\n', index=False, sep='|', quotechar='`', quoting=csv.QUOTE_ALL), encoding='utf-8')
            blob_key = f"{original_blob}__{context.invocation_id}_partition_{i+1}.csv"
            metadata = blob_service.upload_file(
                data=bytes_df, 
                blob_key=blob_key, 
                container='landing',
                metadata={
                    'Source' : 'Splitter',
                    'OriginalBlob' : original_blob,
                    'PartitionNumber' : str(i+1)
                }
            )

            size_mb = round(len(bytes_df) / (1024 * 1024), 2)
            logger.info(f'Uploaded file | landing/{blob_key} | Size: {size_mb:,} MB | Rows: {_df.shape[0]:,}', extra={
                'custom_dimensions' : {
                    'experiment' : EXPERIMENT_NAME,
                    'blob' : f'landing/{blob_key}',
                    'size_mb' : size_mb,
                    'rows' : _df.shape[0],
                    'partition_number' : i+1  
                }              
            })

        blob_service.close()
    except KeyError as ex:
        logger.exception(f'{type(ex)}: {ex}')
        logger.exception(traceback.format_exc())

    except Exception as ex:
        logger.exception(f'{type(ex)}: {ex}')
        logger.exception(traceback.format_exc())

So, I went to Log Analytics to figure out what happened. This

log8 log9 log10 The error is unknown to me, since it never happened locally. In others runs on Azure, the function also stops suddenly, expressing the error shown here (System.Threading.Tasks.TaskCanceledException).

I would appeciate any help, since I don't know why this is happening. The only clue I have if that the situation only happends when I pass to the function files of 100MBor 70MB...

Regards

navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-11T05:44:04.71+00:00
@Jona Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

.

From the error / exception screenshot, it is pretty clear that the error is due to ModuleNotFoundError for your splitter function method.

.

I understand that the same app is working fine in your local dev environment and fails only in Azure. I believe this is because the splitter module might be installed in the local dev environment and might be missing in azure.

.

Here are some steps to ensure that all dependencies are properly packaged and deployed with your Azure Function:

Check Requirements File: Ensure that all the required libraries, local packages or modules are listed in your requirements.txt file.

Verify Deployment Package: Ensure your deployment package includes all the dependencies. If you are using Azure Functions Core Tools, you can create a deployment package with dependencies included by running:

func azure functionapp publish <YOUR_FUNCTION_APP_NAME> --build remote

Check Python Version: Ensure that the Python version specified in Azure Functions matches the one you are using locally. Ensure you have the correct extension bundles configured. This can be set in the host.json or in the Azure portal under the function app settings.

Install Missing Packages: If you are using any specific packages that might not be in the standard Python environment provided by Azure Functions, you may need to manually install them in the function app environment. You can do this by using the Kudu console or an SSH connection to the Azure App Service.

.

Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.
Jona 660 Reputation points

2024-07-11T06:40:29.2366667+00:00

Hi,

The error about the missing modules has appeared just two time out of ... let's say 100. The more repetititve error is the second one ...

This is the evidende that the error ModuleNotFound has been raised almost randomically ... And the rest of the time the functions works well (untill the second error ocurrs)

The second error is more frecuent.

Regards
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-11T06:46:52.48+00:00

@Jona May I know which is the second error you are referring to ? I only see below error related to ModuleNotFound Error in your screenshot.

Could you please confirm if you have the pandas.io.formats.csvs module in your requirements.txt ?
Jona 660 Reputation points

2024-07-11T07:14:27.0766667+00:00

Hi @navba-MSFT

I've been trying to reproduce the error, but it is quite random. As I say, the only clue I have about its occurence is when i've passed a 100MB or 70MB file to the function

The error is exactly the following (I even posted over there):

https://learn.microsoft.com/en-us/answers/questions/1526935/i-am-getting-random-system-threading-tasks-taskcan

Regards
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-11T07:42:23.55+00:00
@Jona Thanks for getting back. Next time if you encounter the same issue, please share the below details over Private Message:

Resource URI of the Azure Function App in below format: /subscriptions/XXXXX/resourceGroups/XXX/providers/Microsoft.Web/sites/XXXX

Time in UTC

Region where your resource is deployed to.

Screenshot of the error when you see it next time.

Http Error status code

Error from the diagnostics log analytics / Application insights.
Jona 660 Reputation points

2024-07-11T14:36:34.5066667+00:00

I've not been able to reproduce the error, but let me check all the logs in Log Analytics to find evidences

Regards
Jona 660 Reputation points

2024-07-12T06:16:54.7966667+00:00

hi @navba-MSFT, just a friendly reminder if you have seen my private messages ...

REgards
Jona 660 Reputation points

2024-07-12T20:01:22.67+00:00
hi @navba-MSFT

I post here in case somebody may give me a hand on this, since this randomic problem is causing us not to pass to Production.

On the Log Analytics Workspace, I run this query (using UTC 00):

exceptions | where timestamp > datetime(2024-07-08 17:59:00) and timestamp < datetime(2024-07-08 17:59:59) | where operation_Name contains "splitter" | where customDimensions['LogLevel'] == "Error" | order by timestamp asc

Then, in other opportunity:

In this thread was also disccused. May be, I'll need a one-time thicket support, since it seems that this is related to some Azure infrastructure runtime issues.

https://learn.microsoft.com/en-us/answers/questions/1526935/i-am-getting-random-system-threading-tasks-taskcan

Regards

1 answer

Your answer

navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-11T05:44:04.71+00:00

@Jona Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

.

From the error / exception screenshot, it is pretty clear that the error is due to ModuleNotFoundError for your splitter function method.

.

I understand that the same app is working fine in your local dev environment and fails only in Azure. I believe this is because the splitter module might be installed in the local dev environment and might be missing in azure.

.

Here are some steps to ensure that all dependencies are properly packaged and deployed with your Azure Function:

Check Requirements File: Ensure that all the required libraries, local packages or modules are listed in your requirements.txt file.

Verify Deployment Package: Ensure your deployment package includes all the dependencies. If you are using Azure Functions Core Tools, you can create a deployment package with dependencies included by running:

func azure functionapp publish <YOUR_FUNCTION_APP_NAME> --build remote

Check Python Version: Ensure that the Python version specified in Azure Functions matches the one you are using locally. Ensure you have the correct extension bundles configured. This can be set in the host.json or in the Azure portal under the function app settings.

Install Missing Packages: If you are using any specific packages that might not be in the standard Python environment provided by Azure Functions, you may need to manually install them in the function app environment. You can do this by using the Kudu console or an SSH connection to the Azure App Service.

.

Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.
Jona 660 Reputation points

2024-07-11T06:40:29.2366667+00:00

Hi,

The error about the missing modules has appeared just two time out of ... let's say 100. The more repetititve error is the second one ...

This is the evidende that the error ModuleNotFound has been raised almost randomically ... And the rest of the time the functions works well (untill the second error ocurrs)

The second error is more frecuent.

Regards
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-11T06:46:52.48+00:00

@Jona May I know which is the second error you are referring to ? I only see below error related to ModuleNotFound Error in your screenshot.

Could you please confirm if you have the pandas.io.formats.csvs module in your requirements.txt ?
Jona 660 Reputation points

2024-07-11T07:14:27.0766667+00:00

Hi @navba-MSFT

I've been trying to reproduce the error, but it is quite random. As I say, the only clue I have about its occurence is when i've passed a 100MB or 70MB file to the function

The error is exactly the following (I even posted over there):

https://learn.microsoft.com/en-us/answers/questions/1526935/i-am-getting-random-system-threading-tasks-taskcan

Regards
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-11T07:42:23.55+00:00

@Jona Thanks for getting back. Next time if you encounter the same issue, please share the below details over Private Message:

Resource URI of the Azure Function App in below format: /subscriptions/XXXXX/resourceGroups/XXX/providers/Microsoft.Web/sites/XXXX

Time in UTC

Region where your resource is deployed to.

Screenshot of the error when you see it next time.

Http Error status code

Error from the diagnostics log analytics / Application insights.
Jona 660 Reputation points

2024-07-11T14:36:34.5066667+00:00

I've not been able to reproduce the error, but let me check all the logs in Log Analytics to find evidences

Regards
Jona 660 Reputation points

2024-07-12T06:16:54.7966667+00:00

hi @navba-MSFT, just a friendly reminder if you have seen my private messages ...

REgards
Jona 660 Reputation points

2024-07-12T20:01:22.67+00:00

hi @navba-MSFT

I post here in case somebody may give me a hand on this, since this randomic problem is causing us not to pass to Production.

On the Log Analytics Workspace, I run this query (using UTC 00):

exceptions | where timestamp > datetime(2024-07-08 17:59:00) and timestamp < datetime(2024-07-08 17:59:59) | where operation_Name contains "splitter" | where customDimensions['LogLevel'] == "Error" | order by timestamp asc

Then, in other opportunity:

In this thread was also disccused. May be, I'll need a one-time thicket support, since it seems that this is related to some Azure infrastructure runtime issues.

https://learn.microsoft.com/en-us/answers/questions/1526935/i-am-getting-random-system-threading-tasks-taskcan

Regards

Answer 1

navba-MSFT 27,540 Microsoft Employee Moderator

@Jona Thanks for getting back.

.

As per your request, I have enabled a one-time free support ticket for your subscription-id 02f56XXX-XXXX-XXXX-XXXX-XXXX7ec3cb for quick and immediate assistance.
.

.

Details on how to raise a service request below:

• Go to the Health Advisory section within the Azure Portal: https://aka.ms/healthadvisories

• Select the Issue Name "You have been enabled for one-time Free Technical Support"

• Details will populate below in the Summary Tab within the reading pane and you can click on the link "Create a Support Request" to the right of the message

User's image

Kindly let me know what your support request number is so that I can keep track of your case. If you run into any issues, feel free to let me know.

After the ticket creation, Microsoft Support professional will get in touch with you and assist you further.

Hope this helps.

Jona 660 Reputation points

2024-07-13T16:13:20.4566667+00:00

hi @navba-MSFT

Support Request Number: 2407130040001236

Regards
navba-MSFT 27,540 Reputation points Microsoft Employee Moderator

2024-07-14T04:18:30.0933333+00:00

@Jona Thanks for your reply. I see that the support ticket is successfully created with the Azure Function App support team. One of our Microsoft Support professionals will get in touch with you and assist you further during the Business days during your time zone.

Share via

Azure Function fails suddenly

1 answer

Your answer