Discussion about this post

User's avatar
Michał I.'s avatar

Hi,

As in many other times I can only agree with most of the statements. It’s more of a “this is how things are done and it’s hard to believe people do it otherwise”.

I’d like to add one thought or a hint maybe, I tend to be strict to 3NF when creating an application where I have all the CRUD coded and supported with an UI (with some MVC or Flux architecture pattern). But when I’m in analytics and reporting world with code first (SQL, Python etc.) access to data plus visualization tools, I tend to allow myself a bit of denormalization which leads to data redundancy (like for example codes kept in some dictionary- I have the dictionary but I copy over the values to the entity, not just the relationship ) and needs more work in case of a change but makes the querying easier.

The classic “it depends” I guess.

Cheers

Expand full comment
Rama krishna's avatar

Thank you for the write up and interesting facts to revise the fundamentals of data modelling.

@JOE REIS are you planning to put down a thought how the data modelling be prevalent or need to be considered if the data lakehouse approach considered for Analytical work loads. Do we still consider relational modelling for datalake solutions as well ?

how about the real time streaming data and consumption pattern from those data sets ?

Expand full comment
40 more comments...

No posts