As examn for a data engineering course I took at university, I scraped and analyzed delay data from the Deutsche Bahn for connections between a couple random train stations.
You can find the source code on Codeberg
Due to the very limited time I had to finish this project, I don't handle any cancellations and I don't do any analysis on the data quality.
The project is written entirely in Clojure, deployed (via ci/cd) on Kubernetes, uses Postgres for data storage and Clojure Clay notebooks (with plotly graphs) for visualization