Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Returns the number of items collected in the KLL float sketch.
Syntax
from pyspark.sql import functions as sf
sf.kll_sketch_get_n_float(col)
Parameters
| Parameter | Type | Description |
|---|---|---|
col |
pyspark.sql.Column or str |
The KLL float sketch binary representation. |
Returns
pyspark.sql.Column: The count of items in the sketch.
Examples
Example 1: Get count of items in KLL float sketch
from pyspark.sql import functions as sf
df = spark.createDataFrame([1.0,2.0,3.0,4.0,5.0], "FLOAT")
sketch_df = df.agg(sf.kll_sketch_agg_float("value").alias("sketch"))
sketch_df.select(sf.kll_sketch_get_n_float("sketch")).show()
+------------------------------+
|kll_sketch_get_n_float(sketch)|
+------------------------------+
| 5|
+------------------------------+