Open-Source Models & Evaluation

Days 3–5

The rest of Week 3 is about standing on the shoulders of existing high-performing models: pulling them, adapting them, and rigorously evaluating the results.

Using open-source models

Pulling and using open-source models and architectures.
Fine-tuning, plus pre- and post-training, to adapt a model to your domain.
Leveraging high-performing deep learning models in your own scientific area.

Improving and adapting

Improving open-source architectures rather than starting from scratch.
Using models to inform mechanism, not just to predict.

Validation and evaluation

Validating and evaluating model results.
Evaluating the performance of deep learning models and of agentic systems.
Avoiding the trap of impressive-looking results that don't hold up.

Connecting science to impact

Using these models to connect Earth systems science with impact in other domains, and to communicate that impact to non-technical stakeholders.

Outcomes

By the end of the course you should be able to pull, use, and fine-tune open-source models, evaluate them and your agentic systems rigorously, and use them to inform mechanism and communicate impact.