This is a dataset containing all of my recorded running, strength training, cycling and swimming activities from Tuesday 06 September 2022 to Saturday 23 March 2024. It was created with the goal of facilitating the open development of predictive models in the domain of sports and general fitness. I aim to update the dataset at the start of every month for the foreseeable future. Currently, the dataset contains 306 entries split in the following way:
Activity | Number of Entries | Percentage | Storage Size (MB) |
Running | 123 | 40.2 | 22.4 |
Strength | 183 | 59.8 | 29.3 |
Cycling | 0 | 0.0 | 0.0 |
Swimming | 0 | 0.0 | 0.0 |
All data in the dataset has been collected using a Garmin Forerunner 255. For most running activities, further running dynamics data is provided by a Garmin HRM-Pro Plus (though I do sometimes forget to wear it). Although the accuracy of such devices is debatable, the measurements are at least consistent.
It should be noted that I am, first and foremost, training with the purpose of becoming a better judoka, though I am also interested in hypertrophy, strength and cardiovascular endurance. With that said, my training goals often vary based on what I am currently working towards (i.e. a Judo competition, a 10k race, a swimrun, etc.)
As an accompaniment to the dataset, I have also made a static webpage generator that provides statistics and visualisations for every single activity in the dataset. This could be used for visual inspection of trends in the data or simply for inspiration. These visualisations can be accessed here.
When I have time to make them, some example projects based on the dataset will be provided.
Within this dataset, there are 2 sections / file types - the individual activity files and the activity statistics file. The activity files contain the second-by-second measurements (heart rate, speed, cadence, power, etc.) for each activity. The activity statistics file describes the training effects (calories, load, etc.) and pertinent statistics for each activity (sleep duration, HRV, etc.). The dataset (last updated on Tuesday 02 April 2024) can be downloaded below:
Checksums:
f9e92ad2ff47317fd43e9e69a0a35b5aecf8295efeca29002110f47739800d4d
650133f6cc47def3106631020d809ab2b5724e98f8d5c869751a0d1b92a65476
7abd5ad860c77fa93faa2242cc37c9ab88d1173033c4c910ba6a4175ac6dc198
There are a number of limitations with this dataset, some of which are inherent in its design, and others which will be improved in the future. They are listed below:
Inherent limitations:
Temporary limitations:
If you have any suggestions for the improvement of the dataset, feel free to contact me!