Python for Data Science

Teaching materials for the cusy training courses on a Python-based data science workflow: cusy.io/en/seminars

Python for Data SciencePython4DataScience
2026-01-19

The section on performance measurements and finding bottlenecks has been significantly expanded to include cProfile/profiling.tracing, tprof, and profiling.sampling/Tachyon: python4data.science/en/latest/

Python for Data SciencePython4DataScience
2026-01-16

We have updated the documentation section with references to README, CONTRIBUTING, CHANGELOG, etc.
python-basics-tutorial.readthe

Python for Data SciencePython4DataScience
2026-01-08

@AlanSill I can confirm that this is also the primary expectation in new projects. We then present our infrastructure, which guarantees binary-compatible builds for around four years, checks the code quality of additional libraries, and so on.

Python for Data SciencePython4DataScience
2026-01-08

@autiomaa @AlanSill Among other things, we always recommend that data scientists in our projects check the training date of the LLMs. See also ‘Take the training cut-off date into account’: cusy.io/en/blog/how-llms-help-

Python for Data SciencePython4DataScience
2026-01-08

@AlanSill We emphasise in our data science courses and in our infrastructure that only actively maintained packages should be used. These are two of our efforts to make sustainable research possible.

Python for Data SciencePython4DataScience
2026-01-07

We have updated the FastAPI extensions. It is very surprising to us that millions of extensions are being downloaded that have not been updated for over a year.
python4data.science/en/latest/
@FastAPI

Python for Data Science boosted:
Veit Schieleveit
2025-12-23

I took a look at the changes coming with Python 3.15 – and I can’t wait to put them to productive use. I’ve already updated our tutorials:
• utf-8 as the default encoding: python-basics-tutorial.readthe
• Performance measurements: python4data.science/en/latest/
• Tachyon: python4data.science/en/latest/
• Python JIT compiler: python4data.science/en/latest/

Python for Data SciencePython4DataScience
2025-11-17

We have updated the section on pytest with many exciting use cases
* on command line options
* on generating markers
* and on parameterising exceptions
python-basics-tutorial.readthe

Python for Data SciencePython4DataScience
2025-10-21

We have updated our tutorial to data management with DVC. It also allows you to create lightweight data science and data modelling workflows and execute them in a parameterised manner: python4data.science/en/latest/

Precision-Recall-Curve comparison between workspace and HEADReceiver operating characteristic (ROC) comparison between workspace and HEADConfusion Matrix comparison between workspace and HEAD
Python for Data SciencePython4DataScience
2025-09-25
Python for Data SciencePython4DataScience
2025-09-24

@webology Step by step, we will document our experiences with programming support from LLM agents in the tutorial. Our blog posts are somewhat more abstract, such as the one about how we use LLMs: cusy.io/en/blog/how-llms-help-

Python for Data SciencePython4DataScience
2025-09-24

We have now described how to create a configuration for Claude Code so that it uses uv reliably: python4data.science/en/latest/
@claudeai

Python for Data SciencePython4DataScience
2025-09-23

Since we have recently been asked frequently whether pandas is slow and whether we should use Polars, Dask or DuckDB instead, we have now provided an initial overview of the various technologies: python4data.science/en/latest/

Python for Data SciencePython4DataScience
2025-09-16
Python for Data Science boosted:
Veit Schieleveit
2025-08-27

We have finally documented Ruff – the tool greatly simplifies static code analysis for Python projects: python4data.science/en/latest/

Python for Data SciencePython4DataScience
2025-08-22

We have now updated our packaging tutorial to include PEP 639, which enables SPDX-compliant licensing: python-basics-tutorial.readthe

Python for Data SciencePython4DataScience
2025-08-20
Python for Data SciencePython4DataScience
2025-07-31
Python for Data SciencePython4DataScience
2025-07-19

The XKCD comic on reproducible scientific results fits perfectly with our tutorial 🧐 😉
python4data.science/en/latest/

XKCD #3117: Replication Crisis

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst