Welcome to the seventy second ever issue of Monday Morning Data Science from the Fred Hutch Data Science Laboratory. We are excited to show you what we have been working on (Fresh from the Lab), plus links that we think you would be interested in (Our Weekly Bookmarks Bar). Part of the purpose of this newsletter is to start conversations, so if you have a question or there is something you would like to share with us please let us know by responding directly to this email.
Our Weekly Bookmarks Bar
[Blog Post: How to Reduce the Size of a Large GitHub Repo] The blog post by Leonardo Collado-Torres discusses strategies to reduce the size of a large GitHub repository. It details the process of using the BFG Repo-Cleaner tool to identify and remove large files, specifically those over 10 MB, while keeping important data. The authors initially had a repository that occupied 55.9 GB and successfully reduced it to 12.8 GB. The post outlines the steps involved in preparing the workspace, diagnosing large files, and cleaning up the repository to maintain a manageable size. The complete process and more detailed steps can be found by clicking the link above.
As always you can contact us by replying directly to this email, you can contact the Data Science Lab at data@fredhutch.org, or you are welcome to join us on the Fred Hutch Data Slack Workspace. For more information about the Fred Hutch Data Science Lab, visit our website: https://hutchdatascience.org/. See you next week!
- The Fred Hutch Data Science Laboratory