📣 As of TODAY, Ponder is in Public Beta. 🚀 Now ANYONE can start running Python data workflows (pandas, NumPy) directly in their data warehouse in less than five minutes. Just pip install ponder.
Learn more below! 🧵[1/N]
ponder.io/ponder-in-public-b…#python#pandas#datascience
We’re excited to announce Snowflake's intent to acquire Ponder to bring Ponder's Python data science innovations to all its customers.
Thank you to all of the Ponder and Modin supporters who brought us to this point -- We're excited for what's to come!
ponder.io/snowflake-to-acqui…
Snowflake announced its intent to acquire @ponderdata to further enable Python data scientists in the Data Cloud. We look forward to welcoming the Ponder team and the Modin community to Snowflake. Learn more: okt.to/I6WXlc
You can run NumPy 🔢 at scale in your database with Ponder 💥
Through @modin_project, Ponder supports:
- Element-wise matrix ops (addition, subtraction, division, power)
- Axis-collapsing or reducing ops (min, max, sum, product, mean)
& more: ponder.io/numpy-on-snowflake…#NumPy#SQL
If you're backed into a dark alley, facing a pack of ravenous NaN dogs, you'll want a sidekick like pandas dropna.
Drop rows with n missing values using thresh=n.
Drop entirely empty rows with how='all'.
Learn more from @__mharrison__: ponder.io/professional-panda…#pandas#Python
How do you use database connections & cursors in Python?
It can be hard to figure out because database concepts & Python objects don't map 1:1.
To learn more (with an emphasis on the Snowflake-Python connector) check out: ponder.io/top-5-things-to-kn…#python#database#snowflake
If you want to automatically parallelize your pandas code, you should check out open-source Modin: github.com/modin-project/mod…@modin_project uses the pandas API, so all you need to do is change "import pandas as pd" to "import modin.pandas as pd"
#pandas#Python#opensource
How the Python Dataframe Interchange Protocol Makes Life Better
In this article, we answer three questions about the Python Dataframe Interchange Protocol: What it is what problems it solves; how it works; and how extensively it's been adopted.
ponder.io/how-the-python-dat…
We're happy to announce the release of #pandas 2.1.1. You can install it with `pip install pandas` or `mamba install -c conda-forge pandas`. You can find what's new in this version in the release notes. Thanks to all contributors and sponsors who made this release possible!
Ever had hourly data, but wished it were monthly? Daily, but wished it were weekly? 😢
pandas resample will dry your tears.
It makes upsampling downsampling really slick, & now it's possible to run it directly in your database: ponder.io/how-to-use-pandas-…#pandas#database
Pulling data from @SnowflakeDB to work w/ scikit-learn, LightGBM, XGBoost?
💥Snowpark ML lets you stay in Snowflake but use important ML classes: LGBMClassifier, Lasso, PCA, XGBClassifier, etc.
💥Ponder lets you do your pandas preprocessing in Snowflake
ponder.io/using-snowpark-ml-…
Ponder typically runs pandas code in your database by translating it to SQL. But pandas apply is different.
For apply, we create a temporary Python user-defined table function (UDTF).
Read more about apply how Ponder handles it: ponder.io/pandas-apply-in-yo…#Python#pandas#SQL
The Python Dataframe Interchange Protocol is a quiet hero 🦸♀️.
It makes it easy for libraries to accept many dataframes (Modin, Polars, Ibis, cuDF, Dask).
Read our encomium: ponder.io/how-the-python-dat…
Thanks to Marco Gorelli & @ralfgommers for your work!
#python#dataframes