Monday Morning Data Science

Share this post

🐰 Egg Salad

fhdata.substack.com

🐰 Egg Salad

Everything bloomed this week

Fred Hutch Data Science Lab
Apr 3, 2023
Share

Good morning!

Welcome to the twenty second ever issue of Monday Morning Data Science from the Fred Hutch Data Science Laboratory. We are excited to show you what we have been working on (Fresh from the Lab), plus links that we think you would be interested in (Our Weekly Bookmarks Bar). Part of the purpose of this newsletter is to start conversations, so if you have a question or there is something you would like to share with us please let us know by responding directly to this email.

Fresh from the Lab

  • [Welcome to Spring 2023!!!] We made it through the Dark and we’re back to start fresh. Now is a great time to think about spring cleaning for your data too. Also, despite Marie Kondo having a little real life adjustment lately (the parent in me is very smug about that), the sentiment of assessing whether a dataset still “sparks joy” and deleting it if it doesn’t, is still valid. Often times we move on to the next big project and don’t take the time to think about the ghosts of data past. Given recent policy changes by the NIH about data sharing, new requirements from publications for data sharing, reproducibility and documentation of analyses, it’s a great time to start thinking differently about your data stewardship skills so your future self has an easier time of it.Luckily for you, the Data Science Lab is also ramping up and available to help you think about how you might leverage all the resources you have here at FHCC to manage your research data in a way that:

    • protects it from loss or corruption (all those months of work and all those reagents wasted, ack!!!),

    • is cost effective (yes we have subsidized storage, but not all storage locations cost the same!) and most importantly,

    • helps you and yours do the best quality research you can as efficiently and reproducibly as possible so you can focus on the science, not the logistics of your work.

    We’re beginning to develop some guidance around data management and stewardship in the Data Science Lab portion of SciWiki. You can read more about where to store what and what tools we have to move data around in the Scientific Computing Data Storage section of SciWiki. Also, at any time you can schedule a Data House Call to talk about:

    • best ways to plan your lab’s data management scheme,

    • ways to move data to where they should be, and

    • ways to adjust how you do computing and access data that will inherently set up good data practices with the side effect of helping you do more reproducible analyses!!

    Remember you also always have support from Scientific Computing as well if you need to phone a friend to help you move and verify data or set up credentials for the cloud if your lab has not yet fully shifted to storing your larger (and all genomics) data to AWS S3 where IT supports PI buckets. They have office hours every Wednesday from 10a-12p on Teams or you can email them and describe your needs by emailing scicomp@fredhutch.org.

  • [New Regime for Data House Calls] Given the diversity of topics folks have brought to Data House Calls, and DaSL’s desire to meet people where they are at (in a hybrid and time-aware sort of way), we've shifted the format to ensure that everyone has some focused time to discuss their specific questions. If you have questions about anything data-related, from here on out you can, at any time, schedule a Data House Call via the link above.

Our Weekly Bookmarks Bar

Twitter avatar for @allison_horst
Allison Horst @allison_horst
📢 New course announcement: I'm teaching a free, beginner friendly "Intro to data wrangling & analysis in JavaScript" course in April. Ever wanted to dip your toes into JS? This short course (4x 1-hour sessions) is for you! ✨Learn more & register here: observablehq.com/@observablehq/…
observablehq.comCourse: Introduction to data wrangling and analysis in JavaScriptCourse Description As data practitioners increasingly create and share their data work (like interactive apps and data visualizations) on the web, it can be useful to build skills for working with data in the language of the web — JavaScript! In this course, you’ll learn basic skills and methods for…
5:31 PM ∙ Mar 31, 2023
30Likes4Retweets
Twitter avatar for @jburnmurdoch
John Burn-Murdoch @jburnmurdoch
Time for perhaps the most damning stat of all: One in 25 American five-year-olds today will not make it to their 40th birthday. No parent should ever have to bury their child, but on average across the US one set of parents from every kindergarten class most likely will.
Image
1:48 PM ∙ Mar 31, 2023
3,050Likes1,417Retweets

As always you can contact us by replying directly to this email, you can email Jeff Leek, Amy Paguirigan, and Sean Kross at data@fredhutch.org, or you are welcome to join us on the Fred Hutch Data Slack Workspace. For more information about the Fred Hutch Data Science Lab, visit our website: https://hutchdatascience.org/. See you next week!

- The Fred Hutch Data Science Laboratory

Share
Top
New

No posts

Ready for more?

Š 2023 Fred Hutch Data Science Laboratory
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing