Open source, data, science enthusiast. I like free, as in speech, not beer. And cappuccino. And the South. My opinions are mine alone.

Joined January 2009
14 Photos and videos
Hilmar Lapp retweeted
Applications are now open for #FloraPalooza2026! Join ecologists, plant scientists, AI/ML researchers, and data experts in a collaborative workshop exploring how machine learning can unlock insights hidden within plant images. Apply by June 25. Scan the QR code to learn more.
3
2
40
Deadline is May 20
Only 1 week left for TDWG 2026 Call for Abstracts: tdwg.org/conferences/2026/su… Calling on the @imageomics, #biodiversity, @GBIF, @eol communities to consider contribution to session SYM25 "From Mobilizing Data to AI-Ready Knowledge: Infrastructure for Multimodal Biodiversity Data"
8
Only 1 week left for TDWG 2026 Call for Abstracts: tdwg.org/conferences/2026/su… Calling on the @imageomics, #biodiversity, @GBIF, @eol communities to consider contribution to session SYM25 "From Mobilizing Data to AI-Ready Knowledge: Infrastructure for Multimodal Biodiversity Data"
19
Hilmar Lapp retweeted
Join us today (4/20) at 4 PM EST on Zoom with Dr. Beth Cimini to explore how image analysis unlocks hidden insights in biology. Register: go.osu.edu/beth-cimini
1
1
66
Hilmar Lapp retweeted
Don't miss the Imageomics Seminar “Rethinking the Semantics of Biological Trait Recognition” with Sabina Leonelli on interoperable data, bio-ontologies, and ethics in AI science on February 23rd at 4-5PM EST! Register: go.osu.edu/sabina-leonelli
1
63
16 Dec 2025
Our catalog is feature-rich but also a single-page app easy to deploy and host sustainably. It's the benefit from making datasets, models, and codebases FAIR through utilizing public repository infrastructure (GH, HF) and, importantly, their respective metadata support mechanisms
📢The Imageomics Catalog is now available! Explore and discover public code, datasets, models, and spaces; all in one location! BioClip 2, TreeOfLife-toolbox, and so much more! 🖥️🔬imageomics.github.io/catalog…  #AIforNature #Imageomics #OpenScience #FAIROS #AIforScience
25
Hilmar Lapp retweeted
29 Jan 2025
We’re recruiting! Dryad is looking for a full-time #datacurator to join our team. Fully remote, but fully supported. If you have a passion for research data quality and open data publication, then take a look: buff.ly/3qtjoIG.
2
4
389
Hilmar Lapp retweeted
Here's the thing. We have **NO IDEA** how to pick good graduate students. I served on admission committees for 10 years, and chaired a few, and what I learned is that all the spreadsheets of grades and test scores and recommendations and essays and publications and interview rubrics are just an elaborate ruse to pretend we know what we're doing when we simply don't. Many of the most highly ranked applicants to our "top" program flamed out quickly, and tons of the students we summarily rejected have turned into amazing scientists. But in the name of creating meritocratic seeming rankings that are more about creating a workforce than great scientists (a system that anyone paying attention knows is bullshit), we've created a homogenous process adopted by nearly all institutions that has stamped out the one thing we should be striving for - given our lack of any clear understanding of what leads to success - a wide range of difference talents and experiences.
Pretty crazy after reviewing the applications of some Stanford PhD applicants and feel like they can graduate right away after the admission 😄
101
423
3,339
868,882
Hilmar Lapp retweeted
We’re excited to announce the #BeetlePalooza2024 workshop coming up in August. Apply to attend before June 10! Help shape the future of biodiversity data collection with #ai and #computervision
16
42
3,560
Hilmar Lapp retweeted
3 May 2024
A very useful paper by @MegBalk with @hlapp and colleagues we start to work up the image infrastructure needs for dissco-uk.org/ - see besjournals.onlinelibrary.wi…

1
1
103
26 Apr 2024
Me too!!
25 Apr 2024
Excited to start my tenure on @datadryad's strategic advisory task force next week, together with @wilbanks no less! #yaydatasharing #opendata
1
3
126
Hilmar Lapp retweeted
Wow, @ProjectJupyter is the winner of the White House OSTP "Technical Advancement to Enable Open Science" category. Amazing to see recognition of the project at a national level! Makes me proud of the ecosystem of interconnected tools we've built. whitehouse.gov/ostp/news-upd…

9
64
257
29,583
Hilmar Lapp retweeted
18 Dec 2023
ICYMI, some super exciting biology & medical ML models were recently released 🤩 They span from longitudinal EHR, to antibodies, to biological (taxonomy) classification, to large-scale unsupervised medical imaging models Check them out! 🧵⬇️
2
16
77
12,601
Hilmar Lapp retweeted
11 Dec 2023
Introducing BioCLIP: A Vision Foundation Model for the Tree of Life imageomics.github.io/bioclip… A foundation model that strongly generalizes on the tree of life (2M species), outperforming OpenAI CLIP by 18% in zero-shot classification, and supports open-ended classification over almost the entire tree of life What's the secrete ingredients? > Data: we curate and release TreeOfLife-10M, the largest and most diverse ML-ready dataset of organism images to date. It contains 10.4M images for over 450K taxa, sourced from iNaturalist, BIOSCAN, and Encyclopedia of Life. > Modeling: we creatively repurposes CLIP's multimodal contrastive learning objective for hierarchical image classification. The autoregressive language model naturally encodes the hierarchy of the tree of life taxonomy, which in turn bakes the hierarchical representation into the vision transformer encoder. Key results > Strong zero/few-shot classification for animals/plants/fungi, including rare species, outperforming CLIP by avg 16-18% absolute. > T-sne visualization shows that BioCLIP's vision encoder has captued the fine-grained hierarchical structure of the tree of life > BioCLIP is a kind of universal classifier for the tree of life. Just give it an organism image and it will likely find the correct species (among top 5)! But use it with caution; it's not perfect yet.. Final remarks > AI for Science is really hard but extremely rewarding! It took us a ton of time (1 year) and frustration trying to find a plausible way to integrate the tree of life taxonomy into foundation model training. But when the "Eureka!" moment came and the idea hit us (by the great @weilunchao) that CLIP's multimodal contrastive learning objective can be repurposed for that, everything just follows naturally. It was truly a moment of joy and excitement! > BioCLIP is our first attempt at foundation models for biology, but it certainly won't be the last! There's so much more to do at the intersection of one of the oldest scientific disciplines and the young but thriving field of AI. Biological intelligence is the foundation for artificial intelligence, and artificial intelligence will in turn become the most important tool for us to unraval the mysteries of biological intelligence. We are hiring postdocs and PhDs in the NSF @imageomics institute to explore this exciting field! Drop us an email. also happy to chat about it at #NeurIPS2023 with any of Tanya, @weilunchao, or me. - paper: arxiv.org/abs/2311.18803 - project: imageomics.github.io/bioclip… - demo: huggingface.co/spaces/imageo… - model: huggingface.co/imageomics/bi… - data (TreeOfLife-10M): to be released on Hugging Face soon joint work with the amazing @imageomics team: @samstevens6860 Lisa Wu, Matt Thompson, Elizabeth Campolongo @luke_ch_song @Carlyn2015 @donglixp @dahdulw Chuck Stewart, Tanya Berger-Wolf @weilunchao @ysu_nlp
9
89
428
80,636
Hilmar Lapp retweeted
12 Sep 2023
1️⃣ Share it.  2️⃣ Use it.  3️⃣ Cite it 10 of our most popular datasets in this month’s Open Data Digest, and over 50,000 more at datadryad.org buff.ly/47XRkkQ #OpenScience #DataReuse #DataScience
3
3
411
Hilmar Lapp retweeted
🗣Now open 🗣 @cziscience has partnered with @KavliFoundation @wellcometrust @ResearchSoft to open a new funding opportunity for Essential #OpenSource Software for Science. Learn more & apply czi.co/OpenSourceSoftwareRFA
25
37
11,127