Data projects often have two extremes
- no pre-merge checks, fast merge times, but constant breakages
- adopt best practices causing merge times to increase from hours to days
Keep best practices and speed up PR review with Recce
datarecce.io#dbt#dataengineering
LLM/AI sharing x LangChain Watch Party
Awesome sharings from @langchain, @InfuseAI CEO @clkao, and developers of the open-source project Accio(getaccio.ai/)!
🔓 Unlocking the Power of DBT: Visualizing Data Lineage, Diff Analysis, and Impact Analysis for Efficient Data Pipeline Management by @DaveFlynnmedium.com/inthepipeline/dbt…
Check out Alexey's thread below - he breaks down the MLOps process in this easy to understand thread
And if you need an open-source MLOps platform in 30 minutes, check out the PrimeHub 1-click AWS install:
one.primehub.io/#MLOps#MachineLearning#AI#opensource
I've been working these days with #PipeRider and it just amazed me the quickness of handling 36M records. This kind of cool tool would save me a lot of code, time and effort. Thanks to @DaveFlynn for supporting me on my doubts
Last week, we had special guests on Data Engineering Zoomcamp!
You'll want to check this out if you missed it!
Dave Flynn from @InfuseAI presented a free hands-on workshop on data profiling with dbt and PipeRider.
1/2
The PipeRider workshop that accompanies week 4 of the @datatalksclub Data Engineering Zoomcamp is now online to watch:
youtube.com/watch?v=O-tyUOQc…
You’ll learn how to use PipeRider's data comparison to understand the impact of your #dbt data model changes
The data profile comparison summary highlights things like schema change the percentage change of values within tables
The markdown-formatted summary is specially designed for pull request comments:
Don't forget that the PipeRider workshop with Data Talks Club is coming up on Wednesday Feb 22.
If you're following along with the Data Engineering Zoomcamp then the workshop will be linked with Week 4. (but really all you need is a dbt project to join!)
eventbrite.com/e/maximizing-…
We'll be joining @DataTalksClub to show how to maximize your confidence making data model changes in dbt using PipeRider
You'll learn how to use PipeRider's data profile comparison to compare production and dev data models and more
linkedin.com/events/maximizi…
In this first in a series of articles about GPT, InfuseAI Customer Success Engineer Simon Liu looks at the history of GPT models.
Mandarin content 中文:
blog.infuseai.io/gpt-model-p…#gpt#chatgpt#NLP#語言模型
We'll be joining @DataTalksClub to show how to maximize your confidence making data model changes in dbt using PipeRider
You'll learn how to use PipeRider's data profile comparison to compare production and dev data models and more
linkedin.com/events/maximizi…
📢PipeRider 0.18.0 is out now and our #dbt support is even better!
- dbt defined metrics in HTML reports
- Visualize metric differences between data profiles
- Metric comparison summary in Markdown to paste into your pull request comment
⭐️github.com/InfuseAI/piperide…#opensource
ALT PipeRider data profile comparison summary in a pull request comment. The perfect way to see the difference between data profiles such as development and production environments