PolyBase is dead: Today, we are announcing the retirement of PolyBase scale-out groups in Microsoft SQL Server. Scale-out group functionality will be removed from the product in SQL Server 2022. In-market SQL Server 2019, 2017, and 2016 will continue to support the functionality to the end of support for those products. Okay, not dead-dead. But let's talk…
Interesting Resources for Chapter 8
For each chapter in Finding Ghosts in Your Data, I’ll include a few resources that I found interesting. This isn’t a bibliography, strictly speaking, as I might not use all of these in the course of writing, but they were at least worth noting. Streamlit Chapter 8 was all about visualization and ended up being an…
Getting Started with KQL
In the last exciting episode, we loaded some Mars farming data into our Data Explorer pool. Today, we're going to query that data. Navigating to https://dataexplorer.azure.com, we can see the Synapse workspace, Data Explorer pool, and table in the Query tab. Ready to query! In order to query data, we need to use the Kusto…
Upcoming Events: Triad Microsoft Data Platform UG
Key Details What: Triad Microsoft Data Platform User Group.Where: The cloud.When: Thursday, February 24th.Admission is free. Register on Meetup. What I’m Presenting 2022-02-24 — 18:00 – 20:00 PM EST — Where We're Going, We Don't Need Servers. I enjoy giving this talk, as the serverless SQL pool is a thing of wonder. Sometimes, of course, that wonder…
PolyBase versus Flat Files
Wait, What? Today's post comes inspired by an Ed Pearson tweet that Chris Wood clued me into. https://twitter.com/ecpdata/status/1490496984544854016 Is there someone? Maybe! It took me a couple of weeks to get to this thanks to my crazy schedule but I have good news for you. PolyBase 12, Flat Files 8 I really like PolyBase's ability…
Upcoming Events: Cloud Data Driven
Key Details What: Cloud Data Driven User Group.Where: The cloud.When: Thursday, February 24th.Admission is free. Register on the Cloud Data Driven meetup. What I’m Presenting 2022-02-24 — 12:00 – 13:15 PM EST — Locking Down Azure Synapse Analytics. This is a brand new talk. It's an area I've put a lot of focus into over the past few…
Beyond the Basics with Azure ML: Conclusion
This is part seven in a series on getting beyond the basics with Azure ML. What Did We Learn? Throughout this series, we gained a better understanding of how we can use Azure Machine Learning for more than drag-and-drop model training. In the first post of the series, we looked at using notebooks in Azure…
Interesting Resources for Chapter 7
For each chapter in Finding Ghosts in Your Data, I’ll include a few resources that I found interesting. This isn’t a bibliography, strictly speaking, as I might not use all of these in the course of writing, but they were at least worth noting. Approaching Normality Chapter 6 looked at an initial slate of tests. In…
Beyond the Basics with Azure ML: MLOps
This is part six in a series on getting beyond the basics with Azure ML. A Better Way for Deploying Code One development we've seen in software engineering over the past couple of decades has been the automation of code deployment leading to more frequent deployment of smaller sets of code changes. This in turn…
Beyond the Basics with Azure ML: ML Pipelines
This is part five in a series on getting beyond the basics with Azure ML. Pipelines Everywhere Azure ML is built around the notion of pipelines. With machine learning pipelines, we perform the process of data cleansing, data transformation, model training, model scoring, and model evaluation as different steps in the pipeline. Then, after we…