librarian @msulibrary, professor, hacker, author, footballer, skateboarder, fly-fisher, dogwalker, coffee brewer - I am large, I contain multitudes. he/his
ALT Responsible AI: Building Tools and Frameworks for Transparent and Ethical AI Implementations
Artificial intelligence (AI) can be used in libraries and archives as a powerful tool for enhancing metadata, improving search and discovery, recommending resources, powering library chatbots, and more. However, AI systems may incorporate surveillance technologies that threaten user privacy, and AI often reflects the biases of our society due to biased training data—for example, facial recognition technology is worse at recognizing the faces of people of color if training data is predominantly comprised of white faces. This talk discusses the early activities of the IMLS-funded Responsible AI project, which examines this tension between innovating library services and protecting library communities. The Responsible AI team will present key takeaways from an environmental scan of AI projects in libraries and archives, and our plans for an AI Harms Analysis tool that can ground AI software develo
Every time I get *yet another* rejection of my work analyzing existing models/datasets (because it "lacks novelty"), I worry that our obsession with novelty in ML will make us repeat the same mistakes, without ever understanding why.
✏️ AI Terminology Updates 2022 ➡️ 2024
NLP ➡️ "language modeling"
Multi-task training ➡️ instruction tuning
Finetuning ➡️ "post-training"
Semantic parsing ➡️ API tool use
Robustness ➡️ Red teaming
Train/test split ➡️ train/train split
Transfer learning ➡️ "it's already in the train set"
Knowledge distillation ➡️ training on GPT-4
Data augmentation ➡️ synthetic data
Released ➡️ “open sourced”
Formatting ➡️ prompt engineering
Evaluation➡️leaderboards ➡️ GPT-4 leaderboard elo rating
Transformer ➡️ MoE ➡️ Mamba?
Compute rich ➡️ Compute poor ➡️ API researcher
Large ➡️ tiny
Small ➡️ irrelevant
ML ➡️ AI ➡️ “A[some unnecessary letter]I”
NFT bro ➡️ e/<something>
(Just kidding! Don't be offended)
What am I missing?
University of Michigan will be the first major university to offer a custom AI platform for its entire community. Just in time for the start of the fall semester, ITS is releasing a suite of custom GenAI tools unlike anything currently offered in higher education, providing our users with AI tools that firmly emphasize the importance of equity, accessibility, and privacy. Thanks to VP Ravi Pendse and team for their leadership.
This is so important. When I talk about dismantling vocational awe in our lives/workplace and I say “put yourself first” I don’t mean doing stuff like Liz says here. Taking care of yourself at the expense of others is never a good look. And I will never advocate for that.
People say, "I can't take care of other people unless I take care of myself first." And they'll use that sentence to justify vacationing in Hawai'i, or dining indoors during an airborne plague.
I've been doing news round-ups on AI ethics & policy, for TikTok/Instagram. Towards this end I've started a more organized spreadsheet so it's easier for folks to dig in and read more if they like! (And feel free to suggest other articles!) #aiethicsnewsbit.ly/ai-ethics-news
Lovely to see Haining Wang (in person!), @msulibrary #LEADING fellow, present our project on "Science out of the Ivory Tower: Scientific Abstract Summarization for Everyone." Paper is forthcoming.
Our related machine learning model is available here: huggingface.co/haining/scien…
The Connecting Communities Digital Initiative (CCDI) just announced the next round of award opportunities for Artists/Scholars-in-Residence, Higher Education Institutions, Libraries, Archives, Museums! (1/2)
ALT Photo of person holding a device amidst text about awards.
My team at Stanford Arcadia Falcone (metadata analyst) wrote a paper about how we migrated the backend of our repository , and it's been published in the latest issue of the Code4Lib Journal: journal.code4lib.org/article…
Credit to @justin_littman for the heavy lifting!
One week to go and very much looking forward to this timely and thought provoking "AI and Machine Learning Symposium" presented by @natlibscot, @RL_UK and @SCURLScottish on 25 April. It's a hybrid meeting in Edinburgh with live-streaming, book your place eventbrite.ca/e/ai-and-machi…
Join us this Fri., Apr 14 for our virtual #DLFforum session: Convene, Discuss, Collaborate: Building a Transatlantic Digital Skills Community for Research Libraries During the Covid Pandemic with Jason Clark, Susan Halfpenny, and William Nixon. Register at hubs.li/Q01KR7h30
The @GLAM_labs line of work continues now with a new practical tool - a #checklist for institutions publishing their collections as data - with contributions from @gus_candela @schambers3@mahendra_mahey and more GLAMmers!
Are you attending #Code4Lib? Check out this talk, “{key: value} : algorithmic debiasing in practice” on eliminating algorithmic bias in practice from #JSTOR Labs’ @ThatAndromeda on March 16 at 11:10am ET: bit.ly/3kQRxml. #C4L23
Call for Applications: 2023 Virtual LEADING Fellowship (#LIS Education And #DataScience Integrated Network Group). Open to early-to-mid career library professionals and PhD students. Apply by Feb. 28 at
cci.drexel.edu/mrc/leading/a…#LEADINGDataSci