November 6, 2024
Location: SB 145
Time: 12:00 pm
Presenter: Hadley Wickham
Data Science in Production
Abstract:
This talk will discuss what it means to put data science “in production”. In industry, any successful data science project will be run repeatedly for months or years, typically on a server that you can’t work with interactively. This poses an entirely new set of challenges that you won’t encounter in your classes in university, but are vital to overcome if you want to have an impact in your job. I’ll discuss three principles that I’ve found useful for understanding data science in production: not just once, not just my computer, and not just by myself. I’ll discuss the challenges associated with each, and where possible, what solutions (both technical and sociological) are currently available.
Dr. Hadley Wickham is Chief Scientist at Posit PBC, winner of the 2019 COPSS award, and a member of the R Foundation. He builds tools (both computational and cognitive) to make data science easier, faster, and more fun. His work includes packages for data science (like the tidyverse, which includes ggplot2, dplyr, and tidyr)and principled software development (e.g. roxygen2, testthat, and pkgdown). He is also a writer, educator, and speaker promoting the use of R for data science. Learn more on his website, http://hadley.nz.
Light lunch will be served.