Should we choose a star schema or a flat table for our reporting layer?
#1
I’m trying to decide between using a star schema or a denormalized flat table for our new reporting layer, and I keep going in circles. My team’s main argument is that the flat table would simplify the initial ETL and make it easier for our analysts to write queries without complex joins. But I’m worried about long-term maintainability and how we’ll handle slowly changing dimensions down the road.
Reply
#2
We started with a flat table and it felt simple at first. ETL was quicker and analysts could run basic queries without joins. Then we saw drift when dimensions changed and we spent weeks patching reports.
Reply
#3
I kept thinking about long term concerns like slowly changing dimensions. It got gnarly to keep history in a flat table and we ended up with messy flags and inconsistent dates.
Reply
#4
One teammate pushed for a star schema and we built a small warehouse with conformed dims. It slowed initial load but later kept governance and made audits easier.
Reply
#5
The business analyst on the team said the flat table felt empowering for quick ad hoc queries. We still hit questions about history and what happened when products launched.
Reply
#6
We ran a hybrid experiment. We kept a flat view for analysts but also built a lightweight dimensional layer for history and SCD. We refreshed it nightly.
Reply
#7
Maybe the real problem is data quality and naming conventions more than the model. If metrics are inconsistent then any structure will struggle.
Reply
#8
I briefly measured query times on a sample and tracked maintenance effort. The flat table was fast to query at first but maintenance spiked later when rules changed.
Reply


[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Forum Jump: