I suggest you go to Kaggle, look through the datasets, and select one (most recent ones are about COVID19 so you may need to look for 2018 or older). Build a project around one of the datasets, integrating object storage for raw dataset persistance, a cloud database, then a client to read the data and show pertinent visuals. That would be a good start.