Published inData Engineer ThingsPrinting Your Machine’s Specifications with PythonA short and simple Python function that will help you print out detailed information about your system.2d ago2d ago
Published inData Engineer ThingsExcel Isn’t Going Anywhere, So Let’s Automate Parsing ItImagine receiving an Excel file where the data starts somewhere random, not in the first row or column.Nov 11Nov 11
Building Document Parsing Pipelines with PythonDocling — Python library designed to parse various document formats and export content into structured formats like JSON and Markdown.Nov 3Nov 3
Introduction to Network Analysis with Neo4j, AuraDB, and Python 🕸️In this guide, we’ll learn about network analysis using Neo4j, AuraDB, and Python.Oct 28Oct 28
Published inData Engineer ThingsGenerating 1 Billion Rows of Complex Synthetic Data 🚀Friendly guide to generating 1 billion rows of complex synthetic data with `dbldatagen`.Oct 20Oct 20
Understanding Parquet File Formatჩhoosing the right data format is critical. Apache Parquet is a columnar storage format designed for performance and efficiency.Aug 23Aug 23
🐍 Introduction to Virtual Environments in PythonWhat are virtual environments and why are they crucial for developers working on multiple projects?Aug 14Aug 14
🐍 4 Books to Master Python for Data and Software EngineeringGuide to Becoming a Skilled Data and Software Engineer with Python.Aug 14Aug 14