From what I understand Bronze table in Delta Lake architecture represents the raw and (more or less) unmodified data in a table format. Does this mean that I also shouldn't partition the data for the Bronze table? You could see partitioning as something that depends on the use case, which points to Silver or even Gold table.
Look at this example:
def read():
return spark.read\
.format("csv")\
.option("delimiter", "\t")\
.option("header", True)\
.load("file.tsv.gz")
table_name = "file"
location = f"/mnt/storage/{table_name}"
read().write.partitionBy("something").format("delta").save(location)
spark.sql(f"CREATE TABLE {table_name} USING DELTA LOCATION '{location}/'")
Notice the partitionBy("something")
. Does this belong in a Bronze table?