Subject archive for "reproducibility," page 2
Data Scientist? Programmer? Are They Mutually Exclusive?
This Domino Data Science Field Note blog post provides highlights of Hadley Wickham’s ACM Chicago talk, “You Can’t Do Data Science in a GUI”. In his talk, Wickham advocates that, unlike a GUI, using code provides reproducibility, data provenance, and the ability to track changes so that data scientists have the ability to see how the data analysis has evolved. As the creator of ggplot2, it is not a surprise that Wickham also advocates the use of visualizations and models together to help data scientists find the real signals within their data. This blog post also provides clips from the original video and follows the Creative Commons license affiliated with the original video recording.
By Ann Spencer7 min read
The Machine Learning Reproducibility Crisis
Are We Back in the Dark Ages? Without Source Control?
By Pete Warden9 min read
Managing Data Science as a Capability
Nick Elprin, CEO at Domino, presented a 3-hour training workshop, “Managing Data Science in the Enterprise”, that provided practical insights and interactive breakouts. The learnings, anecdotes, and best practices shared in the workshop were based upon years of candid discussions with customers about managing and accelerating data science work. The workshop also featured reusable templates that included a pre-flight data science project checklist as well as a planning template for hiring and onboarding data scientists. We are sharing the breakout materials based on attendee feedback. If you missed Strata and are interested in joining similar discussions, then consider attending Rev.
By Domino5 min read
0.05 is an arbitrary cut off: "Turning fails into wins”
Grace Tang, Data Scientist at Uber, presented insights, common pitfalls, and “best practices to ensure all experiments are useful” in her Strata Singapore session, “Turning Fails into Wins”. Tang holds a Ph.D. in Neuroscience from Stanford University.
By Domino5 min read
Reproducible Dashboards and Other Great Things to do with Jupyter
Mac Rogers, Research Engineer at Domino, presented best practices for creating Jupyter dashboards at a recent Domino Data Science Pop-Up.
By Domino26 min read
Domino for Good: Collaboration, Reproducibility, and Openness, in the Service of Societal Benefit
When I joined Domino Data Lab to lead the Domino for Good initiative a few months ago, it felt like the perfect next step on a path I have been on for a long time.
By Lisa Green6 min read
Subscribe to the Domino Newsletter
Receive data science tips and tutorials from leading Data Science leaders, right to your inbox.
By submitting this form you agree to receive communications from Domino related to products and services in accordance with Domino's privacy policy and may opt-out at anytime.