Table Of Contents
Welcome to week 2 of the data science learning journey. This week, we’ll focus on the developer experience as you progress through the data science lifecycle (to build models) and introduce the paradigm shift to LLM Ops when exploring generative AI solutions that work with Large Language Models (LLMs). Each day this week, we’ll introduce a tool, platform or resource that can help streamline your end-to-end workflow.
Day | Topic | Description |
---|---|---|
1️⃣ | GitHub Codespaces | Reproducible dev environments for productivity |
2️⃣ | Visual Studio Code | Shareable profiles for consistency & collaboration |
3️⃣ | GitHub Copilot | Focused learning with AI assistance |
4️⃣ | Open Datasets | Community inspiration from Kaggle, HuggingFace |
5️⃣ | Responsible AI | Explore Model Debugging & Decision-Making |
6️⃣ | Project LIDA | AI-assisted data visualization & summarization |
7️⃣ | Azure AI Platform | Paradigm shift to LLM Ops with a unified platform |
Here’s an illustrated guide to Week 2:
Data Science Day
March 14, 2024 (3.14) was Pi Day aka Data Science Day at Microsoft - a full-day of talks from Python enthusiasts and experts from Microsoft and the broader community. You can check out the livestream here for all the talks:
Simplifying Data Analysis with Developer Tools & AI
I presented a talk on Simplifying Data Analysis with Developer Tools & AI targeting the non-Python developer. My target audience was someone new to Python or Data Science, but otherwise experienced in development. And my goal was to provide a learning roadmap and quickstart environment so they could get productive quickly in their data science journey. You can check out my talk slides here - and you can watch the talk starting at the 1:02 mark Python Data Science Day livestream.
In that talk, I outlined the following roadmap for developers new to this topic, to structure their learning journey but also create a reusable, shareable and reproducible development environment for producitivity. Each of the steps has an associated Exercise that can take developers from conceptual understanding to hands-on practice.
In Week 2 of #14DaysOfDataScience, I’ll be publishing one post a day focused on one of these tools, with a link to the associated hands-on exercise. You can get a head-start on this by forking the repository identified in the roadmap at https://aka.ms/workshops/python-data-analysis - then check back for the daily posts to guide you through the exercises.
Learning Resources
Check out these resources for self-guided exploration of related topics.
- 2024: Data Science Day Collection - Skill up on Data Science Tools & Techniques
- 2024: Responsible AI for Developers - Skill up on Responsible AI Principles & Tooling
- 2024: Generative AI Code-First on Azure - Build Generative AI Apps End-to-End with Azure AI