Kopīgot, izmantojot


TableValuedFunction.python_worker_logs

Returns a DataFrame of logs collected from Python workers.

Syntax

spark.tvf.python_worker_logs()

Parameters

This function takes no parameters.

Returns

pyspark.sql.DataFrame: A DataFrame containing logs from Python workers.

Examples

import pyspark.sql.functions as sf
import logging

@sf.udf("string")
def my_udf(x):
    logger = logging.getLogger("my_custom_logger")
    logger.warning("This is a warning")
    return str(x)

spark.conf.set("spark.sql.pyspark.worker.logging.enabled", "true")
spark.range(1).select(my_udf("id")).show()
+----------+
|my_udf(id)|
+----------+
|         0|
+----------+
import pyspark.sql.functions as sf
import logging

@sf.udf("string")
def my_udf(x):
    logger = logging.getLogger("my_custom_logger")
    logger.warning("This is a warning")
    return str(x)

spark.conf.set("spark.sql.pyspark.worker.logging.enabled", "true")
spark.range(1).select(my_udf("id")).show()
spark.tvf.python_worker_logs().select(
    "level", "msg", "context", "logger"
).show(truncate=False)
+-------+-----------------+---------------------+----------------+
|level  |msg              |context              |logger          |
+-------+-----------------+---------------------+----------------+
|WARNING|This is a warning|{func_name -> my_udf}|my_custom_logger|
+-------+-----------------+---------------------+----------------+