@Chandra Boorla Thank you again for your support.
Just to clarify the root of the issue: even though I only define "Produkt"
as the primary key when creating the feature table, Databricks automatically promotes the timestamp column ("Monat"
) to be a primary key as well — simply because it's specified as a timestamp_keys
during table creation.
You can see this in the attached screenshot:
Produkt
is marked as PK
Monat
is marked as PK(TS)
– i.e., timestamp and also implicitly primary key
This happens even though I explicitly exclude Monat
from the primary_keys
list. As a result, during AutoML processing with feature_store_lookups
, the Monat
column gets duplicated (once from the feature table, once from the training dataset), which causes the schema mismatch error — just as you described in your earlier message.
So while your recommended contract makes sense logically, the underlying platform behavior enforces a PK constraint on the timestamp column, and this seems to conflict with AutoML's expectations in the join.
Do you know of any way to prevent timestamp_keys
from becoming a primary key, or is this behavior currently hardcoded?
Thanks again for helping clarify this edge case!