Quantcast
Browsing latest articles
Browse All 26 View Live

Reading Delta Lakes into Dask DataFrames

This post explains how to read Delta Lakes into Dask DataFrames.  It shows how you can leverage powerful data lake management features like time travel, versioned data, and schema evolution with Dask....

View Article


Image may be NSFW.
Clik here to view.

Content creators making more than $50,000 a month

This post demonstrates how much money you can make as a content creator and contrasts the content creation and restaurant business models. Content creators can make a lot of money and enjoy a nice...

View Article


Writing NumPy Array to Text Files

This post explains the different ways to save a NumPy array to text files. After showing the different syntax options the post will teach you some better ways to write NumPy data: using binary file...

View Article

Image may be NSFW.
Clik here to view.

Scale big data pandas workflows with Dask

pandas is a great DataFrame library for datasets that fit comfortably in memory, but throws out of memory exceptions for datasets that are too large. This post shows how pandas works well for a small...

View Article

Read multiple CSVs into pandas DataFrame

This post explains how to read multiple CSVs into a pandas DataFrame. pandas filesystem APIs make it easy to load multiple files stored in a single directory or in nested directories. Other Python...

View Article


Image may be NSFW.
Clik here to view.

Ultra-cheap international real estate markets in 2022

This post explains how to identify ultra-cheap international real estate markets and when you can capitalize on deeply discounted prices. Let’s borrow Andrew Henderson’s definition of an ultra-cheap...

View Article

Image may be NSFW.
Clik here to view.

Install PySpark, Delta Lake, and Jupyter Notebooks on Mac with conda

This blog post explains how to install PySpark, Delta Lake, and Jupyter Notebooks on a Mac. This setup will let you easily run Delta Lake computations on your local machine in a Jupyter notebook for...

View Article

Convert streaming CSV data to Delta Lake with different latency requirements

This blog post explains how to incrementally convert streaming CSV data into Delta Lake with different latency requirements. A streaming CSV data source is used because it’s easy to demo, but the...

View Article


DevRel Driven Development

DevRel Driven Development is driving software development from developer advocacy activities like creating documentation, writing blog posts, and producing videos. Developers advocates frequently...

View Article


The Virtuous Content Cycle for Developer Advocates

This post explains how to scale developer advocacy by creating content in a way that answers current user questions and makes it easier to generate additional content in the future. Developer...

View Article
Browsing latest articles
Browse All 26 View Live