Getting Started

  • Welcome & Overview
  • Logistics & Format
  • Prerequisites
  • Project Component

Week 1 — Agentic Cowork Tools

  • Overview & Objectives
  • How LLMs & Agents Work
  • Working with Claude & Claude Cowork
  • Claude Code at Scale

Week 2 — Agentic Systems

  • Overview & Objectives
  • REACT Loops & Harnesses
  • Building Good Skills
  • Agentic Workflow Systems

Week 3 — Deep Learning

  • Overview & Objectives
  • DL Architectures & Training
  • Open-Source Models & Evaluation
ESDS Advanced/Week 3 — Deep Learning/Open-Source Models & Evaluation
OverviewESDS CourseAll Programs

On this page

    Open-Source Models & Evaluation

    Days 3–5

    The rest of Week 3 is about standing on the shoulders of existing high-performing models: pulling them, adapting them, and rigorously evaluating the results.

    Using open-source models

    • Pulling and using open-source models and architectures.
    • Fine-tuning, plus pre- and post-training, to adapt a model to your domain.
    • Leveraging high-performing deep learning models in your own scientific area.

    Improving and adapting

    • Improving open-source architectures rather than starting from scratch.
    • Using models to inform mechanism, not just to predict.

    Validation and evaluation

    • Validating and evaluating model results.
    • Evaluating the performance of deep learning models and of agentic systems.
    • Avoiding the trap of impressive-looking results that don't hold up.

    Connecting science to impact

    • Using these models to connect Earth systems science with impact in other domains, and to communicate that impact to non-technical stakeholders.

    Outcomes

    By the end of the course you should be able to pull, use, and fine-tune open-source models, evaluate them and your agentic systems rigorously, and use them to inform mechanism and communicate impact.