I've been exploring Cryo, a blazing-fast tool that pulls raw on-chain data straight into Parquet files with zero friction.
Pair it with Apache Spark for transformation, and you've got a lean, modern pipeline that takes you from raw blockchain chaos to clean, query-ready data.