DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

The Data Refinery: Why Apache Spark is the Engine Behind Real-World Big Data Use Cases

Comments
2 min read
Data Warehousing & Modeling: From Foundation to AWS Cloud Implementation

Data Warehousing & Modeling: From Foundation to AWS Cloud Implementation

Comments
3 min read
Scaling data systems: How we process millions of records with Python

Scaling data systems: How we process millions of records with Python

Comments
3 min read
Data Management Systems: Transactional to Analytical Architectures

Data Management Systems: Transactional to Analytical Architectures

Comments
7 min read
OLAP vs OLTP: A Deep Dive into Database Processing Systems

OLAP vs OLTP: A Deep Dive into Database Processing Systems

Comments
3 min read
Medallion Architecture: Designing Bronze, Silver, and Gold Layers

Medallion Architecture: Designing Bronze, Silver, and Gold Layers

Comments
6 min read
OLAP vs OLTP: Understanding the Backbone of Modern Data Systems

OLAP vs OLTP: Understanding the Backbone of Modern Data Systems

1
Comments
2 min read
How Databricks Genie Turns Plain English Into SQL Code

How Databricks Genie Turns Plain English Into SQL Code

5
Comments
11 min read
I built a DuckDB extension to handle chemistry data without pandas or RDKit

I built a DuckDB extension to handle chemistry data without pandas or RDKit

Comments
5 min read
I Analyzed 10 Million Records in 47 Seconds Using Python + DuckDB (No Spark, No Cloud)

I Analyzed 10 Million Records in 47 Seconds Using Python + DuckDB (No Spark, No Cloud)

2
Comments 1
3 min read
Performance and Apache Iceberg's Metadata

Performance and Apache Iceberg's Metadata

Comments
7 min read
Stop Using Subqueries: 3 Advanced SQL CTE Patterns That Saved My Production Database

Stop Using Subqueries: 3 Advanced SQL CTE Patterns That Saved My Production Database

Comments 1
3 min read
Meet SDI: The No-Code ETL Tool Every Data Engineer Needs in Their Toolbox

Meet SDI: The No-Code ETL Tool Every Data Engineer Needs in Their Toolbox

Comments
2 min read
Transactional Power Vs Analytical Precision: The Essential Guide to OLTP and OLAP

Transactional Power Vs Analytical Precision: The Essential Guide to OLTP and OLAP

Comments
12 min read
HCC Risk Adjustment Data Model: Building Accurate Risk Score Pipelines in SQL

HCC Risk Adjustment Data Model: Building Accurate Risk Score Pipelines in SQL

Comments 1
6 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.