Dean Wampler (@deanwampler) is an expert in streaming systems, focusing on ML/AI. He is Head of Evangelism at Anyscale.io, which is developing Ray for distributed Python. Previously, he was an engineering VP at Lightbend, where he led the development of Lightbend CloudFlow, an integrated system for streaming data applications with popular open source tools. Dean has written books for O’Reilly and contributed to several open source projects. He is a frequent conference speaker and tutorial teacher, and a co-organizer of several conferences and user groups in Chicago. Dean has a Ph.D. in Physics from the University of Washington.
Ray (ray.io) is an open-source, distributed framework from U.C. Berkeley's RISELab that easily scales Python applications from a laptop to a cluster. It was developed to solve the general challenges of reinforcement learning, but it is flexible for any demanding workload that requires the following:
Ray has been used for reinforcement learning, hyper parameter tuning, model serving, and other applications in clusters up to thousands of nodes. I'll discuss examples that illustrate how Ray can be used with Spark to build robust, scalable data applications for enterprises, when to use Ray versus alternative choices, and how to adopt it in your projects.
The Application Spotlight will highlight selected “Certified on Spark” applications that leverage Spark to help their users derive greater value from their data. For each application their will be a brief demo of key functionality followed by a fireside chat discussing the developers experience with Spark, lessons learned, and wish list for the future.