Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Building a Production-Ready Serverless App on Google Cloud (Part 2: The Data Contract)
Patricio Navarro
Patricio Navarro
Patricio Navarro
Follow
for
Google Developer Experts
Apr 5
Building a Production-Ready Serverless App on Google Cloud (Part 2: The Data Contract)
#
ai
#
dataengineering
#
python
#
googlecloud
7
 reactions
Comments
Add Comment
4 min read
Quantified Self: Building a Production-Grade ETL Pipeline for 10+ Wearables
Beck_Moulton
Beck_Moulton
Beck_Moulton
Follow
Mar 9
Quantified Self: Building a Production-Grade ETL Pipeline for 10+ Wearables
#
python
#
dataengineering
#
opensource
#
airflow
2
 reactions
Comments
Add Comment
4 min read
Our Data Extraction Pipeline Worked Perfectly… Until Month 6
Baldur12
Baldur12
Baldur12
Follow
Mar 4
Our Data Extraction Pipeline Worked Perfectly… Until Month 6
#
dataengineering
#
datascience
#
datastructures
#
dataextraction
1
 reaction
Comments
Add Comment
2 min read
Share of Shelf Analysis: How to Scrape Zappos Search Results
Jerry A. Henley
Jerry A. Henley
Jerry A. Henley
Follow
Mar 5
Share of Shelf Analysis: How to Scrape Zappos Search Results
#
webdev
#
devops
#
webscraping
#
dataengineering
1
 reaction
Comments
Add Comment
4 min read
Iterator Patterns: How to Process Millions of Records Without Running Out of Memory
Kunwar Jhamat
Kunwar Jhamat
Kunwar Jhamat
Follow
Mar 5
Iterator Patterns: How to Process Millions of Records Without Running Out of Memory
#
php
#
programming
#
dataengineering
#
performance
1
 reaction
Comments
Add Comment
5 min read
Why My Metrics Pipeline with Telegraf Didn’t Work (and What I Learned)
Mohamed Hussain S
Mohamed Hussain S
Mohamed Hussain S
Follow
Apr 7
Why My Metrics Pipeline with Telegraf Didn’t Work (and What I Learned)
#
dataengineering
#
clickhouse
#
devops
#
observability
2
 reactions
Comments
Add Comment
2 min read
Python was too slow for 10M rows—So I built a C-Bridge (and found the hidden data loss)
BUKYA NARESH
BUKYA NARESH
BUKYA NARESH
Follow
Apr 7
Python was too slow for 10M rows—So I built a C-Bridge (and found the hidden data loss)
#
python
#
cpp
#
performance
#
dataengineering
Comments
Add Comment
2 min read
9 Data Engineering Challenges That Kill Pipelines in Production (And How I approached Them With Pure Snowflake SQL)
Vibhu Gupta
Vibhu Gupta
Vibhu Gupta
Follow
Mar 8
9 Data Engineering Challenges That Kill Pipelines in Production (And How I approached Them With Pure Snowflake SQL)
#
snowflake
#
dataengineering
#
sql
#
datapipeline
1
 reaction
Comments
Add Comment
14 min read
How I stopped bad data from reaching my warehouse using a single Airflow task
Vignesh
Vignesh
Vignesh
Follow
Apr 7
How I stopped bad data from reaching my warehouse using a single Airflow task
#
dataengineering
#
airflow
#
python
#
api
Comments
Add Comment
4 min read
Stop Babysitting Servers: Build a Scalable Serverless Data Lake on AWS
Rocio Baigorria
Rocio Baigorria
Rocio Baigorria
Follow
Apr 6
Stop Babysitting Servers: Build a Scalable Serverless Data Lake on AWS
#
showdev
#
aws
#
serverless
#
dataengineering
Comments
Add Comment
2 min read
History of Kafka the message broker
shubham pandey (Connoisseur)
shubham pandey (Connoisseur)
shubham pandey (Connoisseur)
Follow
Mar 8
History of Kafka the message broker
#
architecture
#
dataengineering
#
distributedsystems
#
systemdesign
2
 reactions
Comments
Add Comment
3 min read
Introducing QueryFlux: Open-Source Universal Multi-Engine Query Router and SQL Proxy
Joni Sar
Joni Sar
Joni Sar
Follow
Apr 6
Introducing QueryFlux: Open-Source Universal Multi-Engine Query Router and SQL Proxy
#
database
#
dataengineering
#
devops
#
opensource
1
 reaction
Comments
Add Comment
7 min read
How I Built a Performance Dashboard for a Multi-Office Chiropractic Practice
Lenin Mishra
Lenin Mishra
Lenin Mishra
Follow
Apr 6
How I Built a Performance Dashboard for a Multi-Office Chiropractic Practice
#
googlesheets
#
analytics
#
looker
#
dataengineering
1
 reaction
Comments
2
 comments
5 min read
Optimizing Continuous Aggregate Performance for Large Datasets
Philip McClarence
Philip McClarence
Philip McClarence
Follow
Mar 3
Optimizing Continuous Aggregate Performance for Large Datasets
#
database
#
dataengineering
#
performance
#
postgres
Comments
Add Comment
4 min read
How to Build a Real-Time DynamoDB to S3 Analytics Pipeline with Apache Iceberg
Tejaswita Soni
Tejaswita Soni
Tejaswita Soni
Follow
Mar 3
How to Build a Real-Time DynamoDB to S3 Analytics Pipeline with Apache Iceberg
#
analytics
#
aws
#
dataengineering
#
tutorial
2
 reactions
Comments
Add Comment
8 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account