indexes in databricks

Shambhu Rai 1,411 Reputation points
2024-01-18T10:18:01.5166667+00:00

Hi Expert, how we can add index on databricks using which command when the data is available like zorder and bloom indexes. pls help with sample query https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/delta-create-bloomfilter-index

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,514 questions
{count} votes

1 answer

Sort by: Most helpful
  1. PRADEEPCHEEKATLA 90,641 Reputation points Moderator
    2024-01-19T00:24:59.24+00:00

    @Shambhu Rai - Thanks for the question and using MS Q&A platform.

    A Bloom filter index is a space-efficient data structure that enables data skipping on chosen columns, particularly for fields containing arbitrary text.

    This official documentation: Bloom filter indexes explains about the what is bloom filter indexes and how to configure with sample example queries and notebook.

    Here is the article which explains how to use Bloom Filter Indexes using Databricks Delta.

    Z-ordering is a technique to colocate related information in the same set of files. This co-locality is automatically used by Delta Lake on Azure Databricks data-skipping algorithms.

    This official documentation: Data skipping with Z-order indexes for Delta Lake what is Z-ording filter indexes and how to configure with sample example queries and notebook.

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.