Solution Engineer @Cloudera. Formerly @Hortonworks and @SpringSource. Passionate about open source, data, integration, and analytics. Thoughts are my own.
IDC Vendor Profile | Cloudera: IoT: cloudera.com/content/dam/www… <- "Most IoT projects will require a hybrid architecture, where some data is processed and analyzed at the edge in near real time, and other data is analyzed and processed in the cloud (or other centralized datacenter)."
Amazing recognition for my team (Cloudera Machine Learning): Forrester has released it’s Notebook Based Predictive Analytics & Machine Learning Q3 2020 report with Cloudera named as a Leader in the industry bit.ly/2Fri0BD
Using CDE to Analyze the PPP Data: blog.cloudera.com/using-clou… <-- how CDE, using Apache Spark, can be used to produce reports based on the PPP data while addressing the challenges of working across a multi-stage analytics process against a very large, continuously evolving dataset
Automated Deployment of Apache Spark Jobs in Cloudera Data Engineering: youtube.com/watch?v=RA8UIgfB… <-- using CDE to extract, transform, and load data from an S3 bucket into Hive and then report off it using the Cloudera Data Warehouse
Spark Structured Streaming example with Cloudera Data Engineering (CDE): community.cloudera.com/t5/Co… <-- using CDE to stream data from Kafka with Spark Structured Streaming
Getting Started with Cloudera Data Engineering on CDP: youtube.com/watch?v=YD6VLQOH… <-- a new way for Data Engineers (both admins and users) to provision, track, schedule, deploy, monitor, and troubleshoot spark workloads in a centralized production environment
I spent last year building this code. It allows you to learn how to fully automate Cloudera Data Platform in AWS and Azure. I think it's worth sharing if you are using CDP today. #cloudera#cdp#hybridcloud#aws#azure#devopslnkd.in/gqMkn-u
Apache Submarine 0.4.0 Release: What’s New and Coming? medium.com/@apache.submarine… <— Submarine is ONE PLATFORM to support Data Scientists from exploring data pipeline creation, model training (experiments), to pushing the model to production, including model serving and monitoring
I’ve interviewed hundreds of people for numerous companies over the past 20 years of building businesses.
I’ve experimented with many interview questions and most are only semi useful, but one, above all, has been the most useful.
A thread...
The introduction of CDP Private Cloud, built on @RedHat OpenShift, accelerates data-driven #digitaltransformation across private and #hybridcloud with cloud-native speed, scale, and economics. Learn more here: bit.ly/2UzMVR1
With latest CDP-DC 7.1 release, I just spun up Kafka 2.4 compute cluster w/ SMM(monitoring),SRM(replication), Registry(schema), new Cruise Control(rebalancer) & Connect support. All integrated with SDX. Powerful new Kafka mgmt services in new release!
docs.cloudera.com/runtime/7.…
Congratulations to my son, Nick! 2020 has been a challenging year for him, but he faced that challenge, adapted to it l, and overcame it. That’s a blueprint for success in life and I couldn’t be more proud. So excited for Penn State main campus in the fall!