From Jupyter to Production Filip Jankovic PyData Global 2021
>> YOUR LINK HERE: ___ http://youtube.com/watch?v=cFpUBiSgDwU
From Jupyter to Production: Deploying an Influenza Monitoring System at Scale With Wearable Sensors • Speaker: Filip Jankovic • Summary • This talk walks through developing and deploying a machine learning pipeline at scale to predict flu onset in a production setting. Leveraging the open-source tools nbdev and Ploomber, we developed a workflow that allows us to produce maintainable, robust, production-ready machine learning pipelines directly from Jupyter. • Description • Direct to individual infection monitoring programs are transforming how infections are identified, measured, and treated. Models built on permissioned wearable sensor data from devices such as Fitbit, Garmin, and Apple Watch, can be used to notify individuals of potential infections early on. • Overview: • Background and overview of the domain. • Data and analytics architecture overview. • Discuss the previous workflow and the challenges of taking a research model and creating a production pipeline. a. The complexity of translating between notebooks and production codebase b. Manual and inefficient tracking of pipeline status, outputs, and metadata c. Difficult to maintain repetitive, non-modular code • How open-source tools (Jupyter, nbdev, and Ploomber) helped us solve previous challenges. a. Notebook-based development promotes rapid development. b. Ploomber orchestrates workflows and facilitates the handoff between notebooks and production-ready code. • Filip Jankovic's Bio • Filip is a Senior Data Scientist on Evidation's Research, Analytics, and Learning – Product team. Filip works on designing and developing data science products to bring novel health insights to individuals. • Filip's research areas include developing digital representations of health and wellness, the impact of interventions on health and engagement, and quality assessments of novel data streams. • Recent projects have focused on developing novel deep learning time series methods and productionizing machine learning models to deliver personalized insights to millions of individuals. • LinkedIn: / fjankovic • PyData Global 2021 • Website: https://pydata.org/global2021/ • LinkedIn: / pydata-global • Twitter: / pydata • www.pydata.org • PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. • PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. • 00:00 Welcome! • 00:10 Help us add time stamps or captions to this video! See the description for details. • Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVi...
#############################
