We fine-tuned Alec Radford’s 1930 vintage LLM to solve SWE-bench issues.
After just ‼️250‼️ training examples, the model solves its first issue, a simple patch to the xarray library.
🧵👇
Today, napari users can’t fully leverage Xarray’s labeled metadata, as slider names, units & dimensions often get out of sync. That’s changing, as napari, Xarray & CellProfiler devs begin a collab to build true metadata-aware visualization across sciences
earthmover.io/blog/scientifi…
Check out our new API for seasonal aggreggations, including support for custom seasons!
`ds.resample(time=SeasonResampler(["DJF", "MAM", "JJAS", "ON"]).mean()`
and
`ds.groupby(time=SeasonGrouper(["DJFM", "MAMJ", "JJAS", "SOND"]).mean()`
xarray.dev/blog/season-group…
🎉 Zarr-Python 3 is here! 🎉
- Full support for Zarr v3 spec
- Chunk-sharding for more efficient data storage
- Major performance boosts with async I/O & parallel compression
💻 pip install --upgrade zarr
Blog post: zarr.dev/blog/zarr-python-3-…
Calculating quantiles, a common application in
#geospatial workloads, used to be slow due to GIL contention in NumPy.
The new implementation in @dask_dev@xarray_dev is up to a hundred times faster and scales independently of the number of threads 🥳.
docs.coiled.io/blog/array-qu…
At AGU I talked to NASA people about how agencies could better support open-source tools they rely on. I argued that our recent collaboration between Xarray and NASA ESDIS on xarray.DataTree was a good model to copy - read about how it happened here!
xarray.dev/blog/datatree
Beyond stoked to be sharing Icechunk with the world today! A new open source, transactional cloud-native storage engine for ND arrays, built on @zarr_dev. Check it out! 👇👇👇
🚀 We are thrilled to announce the release of the Icechunk storage engine, a new open-source library and specification for the storage of multidimensional array (a.k.a. tensor) data in cloud object storage.
Read our blog post about Icechunk here: earthmover.io/blog/icechunk
🚀 We are thrilled to announce the release of the Icechunk storage engine, a new open-source library and specification for the storage of multidimensional array (a.k.a. tensor) data in cloud object storage.
Read our blog post about Icechunk here: earthmover.io/blog/icechunk
The @xarray_dev data model now supports vector data cubes! 🎉 Check out our blog post discussing demonstrating this exciting development. Thank you to Xvec and others across the OS community for their work making this possible! 👏earthmover.io/blog/vector-da…
Register below 👇 for Vector Data in @xarray_dev and @zarr_dev . We'll demo how to build & use vector data cubes in Arraylake by ingesting GeoParquet vector data and ERA5 reanalysis data from publicly available cloud object storage. bit.ly/3WDkgtu
Eni Awowale and @lucassterzinger from @NASA_GESDISC presenting on how they use @xarray_dev 's new DataTree functionality for organizing HDF files with many groups!
Wonderful to collaborate directly with @NASA on #opensource for science!