In December, the Libraries acquired twelve full-text corpus datasets, compiled by Mark Davies, a retired professor of linguistics from Brigham Young University. The corpora will help Columbia researchers across many disciplines to understand how language is and has been used around the world, and they serve as another mark in the Libraries’ commitment to supporting […]
Month: March 2025
Data Engineering in Python with Polars 1
Today, we begin learning Polars, an alternative data analysis Python library to pandas. We’ll learn about how Polars is similar to and different from pandas and why it is an appealing choice in 2025 for ETL (extract-transform-load) operations. […]