Schedule

The AY 2025–26 workshop series on multimodal generative AI is now complete — browse the materials in the Past Sessions archive below. We'll return in fall 2026 with a new series for AY 2026–27.

Fall 2026

AY 2026–27 series — coming soon

New sessions will be announced in fall 2026. Topics and dates to be confirmed.

Past Sessions

Browse notebooks, slides, and other materials from previous workshops. Each session includes a read-only preview, a one-click link to run the notebook in Google Colab, and a portable version you can run in any Jupyter environment.

April 15, 2026, 3–4 pm

Multimodal AI — Video and temporal understanding

Vision-language models can process video and image series, grounding their responses in time to indicate when particular events or shifts occur in a film. This session explored the use of vision-language models for the analysis and interpretation of moving images.

vision video multimodal

April 8, 2026, 12 pm

AI for humanities research?

A collaborative session exploring how large language models and vision-language tools can support humanities scholarship — from working with archival image collections via IIIF to contextual analysis of primary sources.

humanities archives vision

March 4, 2026, 3–4 pm

DiScho Discovery Hours — Translating secondary sources

Three practical approaches to translating scholarly texts: quick paragraph-level translation via Google Translate / DeepL, offline reproducible translation with MarianMT, and context-aware scholarly translation with an LLM. Attendees compared outputs on the same passage.

translation text humanities

February 25, 2026, 3–4 pm

Multimodal AI — Visual tool calling

Multimodal AI models can include visual tools that enable them to manipulate images or retrieve external information. A zoom tool can focus on a section of a painting; reverse image search retrieves metadata. We also built a custom image restoration tool and covered practical document-to-text workflows.

vision tools multimodal

February 18, 2026, 3–4 pm

DiScho Discovery Hours — LLM Steering

An exploration of LLM activation steering — adding abstract concept vectors to a model's hidden state to alter its output. We tinkered with the technique using nnsight and sparse autoencoders, and discussed what it reveals about how models represent concepts internally.

steering interpretability llm

January 28, 2026, 3–4 pm

Multimodal AI — Visual reasoning and chain of thought

Recent models can reason about the visual contents of images, "thinking aloud" about meaning and relationships between objects. This capability enables more effective recognition of signs and contextual information within images. We explored how this might further visual analysis, interpretation, and distant viewing.

vision reasoning multimodal

Responsible AI

A series of workshops and collaborative sessions where
researchers can learn about recent developments
in generative AI.

Schedule

AY 2026–27 series — coming soon

Past Sessions

Multimodal AI — Video and temporal understanding

AI for humanities research?

DiScho Discovery Hours — Translating secondary sources

Multimodal AI — Visual tool calling

DiScho Discovery Hours — LLM Steering

Multimodal AI — Visual reasoning and chain of thought

Location

Commons Library Classroom (D112)

Responsible AI

A series of workshops and collaborative sessions where researchers can learn about recent developments in generative AI.

Schedule

AY 2026–27 series — coming soon

Past Sessions

Multimodal AI — Video and temporal understanding

AI for humanities research?

DiScho Discovery Hours — Translating secondary sources

Multimodal AI — Visual tool calling

DiScho Discovery Hours — LLM Steering

Multimodal AI — Visual reasoning and chain of thought

Location

Commons Library Classroom (D112)

A series of workshops and collaborative sessions where
researchers can learn about recent developments
in generative AI.