Giving Feedback on Microsoft Data Platform Tools
As a data professional, there is a set of tools that you use on pretty much a daily basis. Before I started as the editor of Simple-Talk, there were two Microsoft tools I used every day of the work...
View ArticleUsing Common Table Expressions: Transforming and Analyzing Data in...
In the first article in this transforming data series, I discussed how powerful PostgreSQL can be in ingesting and transforming data for analysis. Over the last few decades, this was traditionally done...
View ArticleAzure OpenAI and RAG Security
The most watched videos on my youtube channel are related to security. Azure SQL Networking Secrets and Cloud Security: Secure Access from your Applications to Azure SQL explain private endpoints from...
View ArticleHow to Use Any SQL Database With Rust
Rust is emerging as a frontrunner for ensuring memory safety without sacrificing performance. Its growing popularity isn’t solely based on the “fearless concurrency” mantra but also on its expanding...
View ArticleFabric and Azure SQL in a few clicks: Mirroring a Database
Finally, mirroring is available for Fabric! You can mirror an Azure SQL to Fabric. It works for CosmoDB and Snowflake as well, but in this article, I will focus on Azure SQL. It is 100%, no, but it is...
View ArticleSnake draft sorting in SQL Server, part 1
I recently had a restore job where I needed to split the work up into multiple parallel processes (which I’ll refer to here as “threads”). I wanted to balance the work so that the duration was...
View ArticleSnake draft sorting in SQL Server, part 2
In my previous post, I showed how to borrow a snake draft concept from fantasy football, or a packing technique from the shipping industry, to distribute different portions of a workload to run in...
View ArticleExploration Experience: Saving and Creating Datasets
I have been talking about Data Exploration in Power BI on many of my sessions, specially the sessions about Data Marts. The new data exploration feature is one more feature on this expanding scenario...
View ArticleDatabase Subsetting and Data Extraction
Let’s start by defining a subset and why you would require a data subset? When dealing with the development, testing and releasing of new versions of an existing production database, developers like to...
View ArticleBuilding a Culture of Data: The Technical Aspects
In the first part of this two-part series, I covered the mostly non-technical aspects of building a data culture. While the lion’s share of the work will be getting people to work together and embrace...
View ArticleRecursive CTEs: Transforming and Analyzing Data in PostgreSQL, Part 3
The first two articles in this series demonstrated how PostgreSQL is a capable tool for ELT – taking raw input and transforming it into usable data for querying and analyzing. We used sample data from...
View Article4 Reasons to use KustoDB for Real-Time
When implementing real-time ingestion, we usually implement an architecture called lambda. Using the lambda architecture, KustoDB in Microsoft Fabric is always recommended for the speed layer. Do you...
View ArticleWorking with IAsyncEnumerable in C#
IAsyncEnumerable is a powerful interface introduced in C# 8.0 that allows you to work with sequences of data asynchronously. It is a great fit for building ETLs that asynchronously stream data to get...
View ArticleBuilding RESTful APIs in Rust With Actix and Diesel
There are many packages and tools that you can use to facilitate your API development with Rust. Rust has a rich third-party ecosystem of crates for building APIs, including web packages like Actix and...
View ArticleDatabase technology learning resources
Before I started as the editor of Simple Talk, I worked on SQL Server. Only. (Ok, I used Redgate’s tools too). But when I started here, one of the goals was to stretch the topics farther and farther...
View ArticleUpdating Documents in MongoDB
This article is part of Robert Sheldon's continuing series on Mongo DB. To see all of the items in the series, click here. In the previous articles this series, I demonstrated various ways to retrieve...
View ArticlePySpark Secrets to use with Fabric
PySpark is a powerful language for data manipulation and it’s full of tricks. Let’s discover some of them. Control the Type of a NULL column If you are creating a pysspark dataframe, but one of the...
View ArticleTechnology For Humanity
There was a time, when I was in a team that was designing an important IT system for a multinational bank, the testers arranged for perfectly normal office workers from the bank to try the system out....
View ArticleHeadless CMS (Content Management Systems): Contrasting Git-Based and API-Based
The presentation layer of a headless CMS is separated from the content management system itself, making it a backend-only system for managing, creating, and storing material. Content presentation (how...
View ArticleHow to Build Metadata Driven Pipelines in Microsoft Fabric
Over the past years, “traditional” ETL development has morphed into data engineering, which has a more disciplined software engineering approach. One of the benefits of having a more code-based...
View Article