CDL, @HumanFeedbackIO, and @daisTMU are proud to jointly announce the launch of the Responsible AI Adoption for Social Impact (RAISE) pilot program, a national initiative to help nonprofit organizations across Canada adopt AI responsibly and effectively.
globenewswire.com/news-relea…
📊 Introducing Prolific’s AI user experience leaderboard—the most reliable framework for evaluating AI against human preferences.
We’re thrilled to share our benchmark on @huggingface, which assesses how well language models handle real-world tasks based on user experiences. 👇
This evaluation approach means we can tune #AI output to human preferences for better accuracy, alignment, and advancement.
Special thanks to @huggingface, @HumanFeedbackIO, @MLCommons, @collect_intel, and @FactoredAI for supporting the development of this benchmark.
AI in Philanthropy Conference speakers that stood out:
- Mark Bloomberg of Bloombergs Charity Law
- Tareq Alani of @chorus_ai_inc
- Jason Shim of CCNDR
- Elena Yunusov, @HumanFeedbackIO
Presented by @Hilborninfo and @FoundationMaga1
OpenAI's CPO Kevin Weil: Why You Need Evals to Build AI Products
"The easiest way to think about it is as a quiz or a test to gauge how well a model knows a certain set of subject material, or how good it is at responding to a set of questions.
Evals basically perform as benchmarks for how smart and capable the model is."
@kevinweil with @lennysan
Loved the @lennysan ep with @rauchg . Some non-obvious insights/themes that stood out to me:
Idea velocity going wild: AI lets you test, iterate, and fail way faster.
Roles are blurring, silos collapsing: AI means PMs, Designers, Marketers can actually build stuff, fewer hand-offs needed.
Open ecosystems are powerful: Shared code (LLM training data) shared ideas (v0 Community) = faster innovation.
Engineering grounds AI: Best AI tools spit out reliable, scalable, production-level code and practical results.
Feedback loops power AI: Direct user feedback literally trains the AI, making products get better quickly.
Human intent still king: AI is a superpowered co-pilot, but human creativity, judgment, and vision are crucial.
Tools for builders = force multiplier: Building tools that help others build stuff creates exponential impact.
At my org @HumanFeedbackIO we:
- Produce Paper Club feat. AI researchers doing talks on their papers for AI/ML builders
- Build AI Tinkerers Toronto community via meetups, tech workshops, etc
- Collaborate with leading universities on AI in Production mentorship program
Join our next Paper Club session @AITinkerers@HumanFeedbackIO - almost 700 people registered for this online session with the authors. @communicable - I just added @Paper2Audio to the page so you can listen to a summary of the paper.
This awesome team of @UofT students is about to demo the AI Tutor Chatbot they created over the course of the last couple of months as part of their course. I’m very proud of the effort you put in guys 👏🎉 @HumanFeedbackIO@AITinkerers#TorontoAI@mozilla
One reason I wish more humanities-oriented people would engage with AI is that models are writers, trained on words, producing words.
There are strengths & weaknesses in the models that can only be seen if you engage deeply with them as writers. They do not show up in benchmarks
How about applying to the automated production of reproducibility reports of existing papers? This should be easier than producing entirely new papers, and be quite useful in that this is a type of work that we probably don't do enough as a community.