Subject archive for "data-scientists," page 7

Data Science

Horizontal Scaling for Parallel Experimentation

The amount of time data scientists spend waiting for experiment results is the difference between making incremental improvements and making significant advances. With parallel experimentation, data scientists can run more experiments faster, leaving more time to try novel and unorthodox approaches—the kind that leads to exponential improvements and discoveries.

By Eduardo Ariño de la Rubia6 min read

Product Updates

Git Integration in Domino

We recently released new functionality that provides first-class integration between Domino and git. This post describes the new feature, and describes our perspective on the unique requirements of version control in the context of data science—as distinct from software engineering—workflows.

By Eduardo Ariño de la Rubia5 min read

Data Science

Data Science on AWS: Benefits and Common Pitfalls

More than two years ago, we wrote about the misguided fear of the cloud among many enterprise companies. How quickly things change! Today, every enterprise we work with is either using the cloud or in the process of moving there. We work with companies that insisted, just two years ago, that they “can’t use the cloud” — and are now undertaking strategic initiatives to have “real work in AWS by the end of 2017.” We see this happening across industries including finance, insurance, pharmaceuticals, retail, and even government.

By Nick Elprin4 min read

Data Science

Principles of Collaboration in Data Science

Data science is no longer a specialization of a single person or small group. It is now a key source of competitive advantage, and as a result, the scale of projects continues to grow. Collaboration is critical because it enables teams to take on larger problems than any individual. It also allows for specialization and a shared context that reduces dependency on "unicorn" employees who don't scale and are a major source of key-man risk. The problem is that collaboration is a vague term that blurs multiple concepts and best practices. In this post, we clarify the differences between repeatability, reproducibility, and whenever possible the golden standard of replicability. By establishing best practices of frictionless in-team and cross-team collaboration, you can dramatically improve the efficiency and impact of your data science efforts.

By Eduardo Ariño de la Rubia17 min read

Perspective

Building a Model is the Least Important Part of Your Job

In this Data Science Popup session, Kimberly Shenk, Director of Data Science Solutions at Domino Data Lab, explains why building models is the least important part of data scientists' jobs, and what they must focus on instead.

By Grigoriy42 min read

Data Science

AI in the Enterprise: Making Corporations Smart Again

In this Data Science Popup session, Danny Lange, VP of AI and Machine Learning at Unity Technologies, gives an inside look at practical applications and challenges of AI in enterprises such as Unity, Netflix, Uber, and elsewhere.

By Grigoriy36 min read

Subscribe to the Domino Newsletter

Receive data science tips and tutorials from leading Data Science leaders, right to your inbox.

*

By submitting this form you agree to receive communications from Domino related to products and services in accordance with Domino's privacy policy and may opt-out at anytime.