Datawarehouse? No, data kitchen! Get the right ingredients and serve on a plate!

Joined May 2019
26 Photos and videos
Today's bets after lunch. More in the green, but heavier in the red.
1
31
Cushioning down a bit further @ end of day.
20
Datapancake retweeted
Replying to @felipehoffa
@felipehoffa is demonstrating that your cloud database is not more… a database. I foresee that data lineage, data quality, data observability, ML & data science engine and data ingestion can be one product hoffa.medium.com/facebook-pr…
18
29
Sweden, ages 15-29, deaths with Covid-19 2020 vs deaths by drug overdose 1997-2019.
One of my dogs stealing my ass place
Mo Farah! Runner supreme! 🇬🇧
Scikit like Data prep in SQL is all cool, AI models are harder. I tried by building an image recognition CNN in SnowflakeDB. Got as far as the first few steps before deciding ”naaa...!” Looping, recursion just doesn’t work. Has to be in mem dataframe ops.
1
On that note: Cloud computing can be a climate killer! technologyreview.com/2019/06…

1
Watched the CL-final. Half of the excitement gone with no crowds. Football’s magic lies in the connect between pitch and terrace. Take away one and not much remain.
Bought two bottles of Champagne for home delivery at a slightly higher price from a local wine merchant rather than from Amazon. We need to keep local business going.
When you tweet, how will you be labeled? Analysis for topic ”Pharma”, a few days Nov 2019.
Performance testing row level security cake bake in the data kitchen: 1. python (rando user generation and mapping to ranges of dim values) 2. node.js (random sessions per user, executing asynch queries on fact joind w user-dim-mapping) 3. @SnowflakeDB to execute on massive scale
1
Datapancake retweeted
State of California COVID-19 Data is Now Available to Public via @SnowflakeDB Data Marketplace snowflake.com/news/state-of-… #COVID19 ❄️

3
3
Brings so much hope!
Breaking News: A long-awaited blood test for Alzheimer’s is in reach, scientists say. It could speed up treatment research and aid in diagnosing dementia patients. nyti.ms/3jN5V9Q
I mistakenly started to post fact based replies, with statistical significance accounted for, in a BBC Covid thread. Remembered the meaning of ”futile” and went back to delete them all. Wasted 2 hours of good life, still saved another 20 at risk
Fast food is the new terrorism
“They saw two people in there and they were ordering 20-odd meals at 1:30 this morning." A big KFC order helped tip off the Australian authorities that 16 people had broken coronavirus restrictions by attending a surprise birthday in a suburb of Melbourne. nyti.ms/3iWJazZ
I’m currently trying out @StarschemaLtd ’s shared Covid dataset on @SnowflakeDB which has a new analytic UI natively available. Here’s a graph of some countries’ DailyDeaths/million numbers as 21d moving avg.
Converting Teradata to Snowflake SQL is truly data mining. Like hammer and chisel 19th century Dickensian coal data mining. Carrying a canary in a cage.
Building AS OF JOINS for TS-analysis in #SnowflakeDB My solution uses LAG/LEAD to persist valid period per record, to join on. Good performance so far. Meaning low single seconds over 1Brows x 1Mrows
1
Actually 1Brows x 100Mrows (one year of sensor-A data @ 100/s as of joined one year of sensor-B @ 10/s). And I’m running on a low-end size cluster.