Joined November 2010
Photos and videos
Pinned Tweet
1
1
13
882
Good fun at #measurecamptoronto talking about Privacy by design and how applying data minimization gets you both K-anonymization AND ability to efficiently construct Gramian matrices for OLS.
Pure bias reinforcing dopamine right there. @mgershoff summarizes #PrivacybyDesign at #measurecampto
2
207
Off to London for @MeasureCampLDN We have a new batch of Conductrics INC Select whiskey from @MilamandGreene over here from Texas and alfajores direct from Buenos Aires as prizes and give aways. For talks I will do something on Privacy Engineering and AB Testing.
1
1
9
382
tl;dl Privacy by design & customer first philosophy, if not two sides of the same coin, are consistent with each other. Collecting customer data is akin to asking for something. Customer first means asking from customers only when its a GOOD FAITH attempt to enrich THEIR lives.
"Collect everything" is easy to say. It can even be rationalized. Think a little deeper, though, and the dangers of this mindset are pretty easy to surface. That's the topic of our latest episode with @mgershoff! analyticshour.io/2024/09/03/…
1
6
296
I had such a great time getting to meet and chat with the Columbus analytics community. Thank you @tgwilson and the team at @cbusdaw for taking personal time to organize such a welcoming event!
21 Aug 2024
Privacy by design and more with @mgershoff at @cbusdaw !
1
1
7
471
I know people and I still haven't been able to get a bottle from the Wildlife collection. Gotta up my click submit game. @MilamandGreene prweb.com/releases/beep-beep…

ALT Working GIF

1
1
122
Ride Sharing product idea. User Setting Toggle for accept/decline "air freshener'.

ALT Decline Denied GIF

4
115
Great. As an extension to her note about effects heterogeneity. I think it is more that contextual bandits/Targeting is unlikely to be valuable unless there is a theory why there should be heterogeneity. If you don't then just run a simple RCT or Epsilon-First bandit.
These recent slides from Susan Athey and Guido Imbens at NBER are a great recent review of the most valuable data science methods I'm aware of. They cover tons of ground with lots of pointers. conference.nber.org/confer/2…
1
217
1
3
180
Love the @thegautamkamath Youtube course on Diff Privacy.
1
1
4
1,685
Do you mind if I DM you here on X @thegautamkamath ? If not no worries. Was looking for some links/follow up around a quick question on the dual view of the Diff Privacy and Pearson-Neyman Hypth Testing.
1
2
138
Oh! I have to read. I have been thinking informally about this for Tech/Data Science - that methodological preferences etc. are drawn from individuals who are clustered (in part by being socially connected in the valley via a small set of interconnected VCs)
Replying to @MishaTeplitskiy
1. osf.io/preprints/socarxiv/2a… Meta-analyses combine trials that are presumably *independent*. But social relatedness between researchers --> more similar effect estimates, suggesting dependency via same "school of thought"
4
248
We love you Ted and our hearts are broken. 💔
1
211
I had assumed that @AmericanAir rewards work like other programs with a default option if you don't select before new year. But I just discovered for AA there is no default and you just lose the rewards. So they needlessly alienate their loyal customers (I'm 6 yr Ex Plat)🤣
1
1
239
Thanks @NandoDF ! BTW this 2009 talk covers much of what @ilyasut discusses in a bit of detail but uses GZIP as the compressor as the example algo rather than a NN/LLM wiki.santafe.edu/images/9/9a…

Predicting the next word "only" is sufficient for language models to learn a large body of knowledge that enables then to code, answer questions, understand many topics, chat, and so on. This is clear to many researchers now, and there are nice tutorials on why this works by @ilyasut resorting to compression ( youtube.com/watch?v=AKMuA_TV… ) and by @geoffreyhinton ( youtube.com/watch?v=iHCeAotH… ). However, the emergence of types of understanding is not unique to language models. In arxiv.org/pdf/1804.06318.pdf by @notmisha and @brandondamos the authors trained models to predict the next few time stems of over a hundred robot hand sensors (Touch, Gyro, Accelerometer, Joint Info, Actuator Info, etc.). They ten found out that they could regress the shape of the thing the hand was touching from the activations of the neural networks using probes. That is, the model developed an internal representation of shapes even though it was simply used to predict "only" the next few senses. Awareness follows from simple predictions and interaction with the world.
1
213
I think it might be helpful if the first few sections, or first few minutes of each section, was dedicated to the history/context. So a little narrative including folks like Hume, Popper, Laplace (Bayes), Neyman/Pearson, maybe Stein etc. just to provide context. @PhDemetri
IMO, the only thing an intro stats student really must learn is that the goal of "statistics" (inference, really) is to quantify uncertainty. Too many students get so lost in the mechanics that they loose site of what they are trying to do.
3
552
Post at @Conductrics discussing a some Privacy Engineering ideas in the context of AB Testing. tl;dr Many standard stats can be calculated on aggregated data without need to link to individuals. Useful when cardinality of feature set << Individuals. blog.conductrics.com/privacy…
6
282
Has the author flown intra-Europe lately (like in the past 5 yrs? Of course main thing is to get from place to place safely and so far so good. But there is next to ZERO added niceties in the Euro Econ cabin (At least on BA, Finnair, Norwegian) nytimes.com/2023/11/22/trave…
119
Why this might be so consider that SQL is a sublanguage based on the relational algebra. Both the relational algebra and the relational calculus (see: Codd' theorem) are implementations of First Order Logic on databases. So thinking in SQL makes you think like Spock - logical.
This is so very, very true. SQL is one of the most valuable skills for anyone working with data. It’s more than just about how to run queries - it helps you understand the entities you are querying better, and it gives you stronger mental models to make sense of the world.
1
7
828