Large Language Models (LLMs) have evolved beyond being mere downstream Natural Language Processing (NLP) components: an increasing number of recent LLM-based systems can perform end-to-end planning, assembly, and evaluation of machine learning (ML) workflows. Advances in alignment techniques, tool integration, and program-generation methods enable LLMs to create end-to-end automated pipelines that execute from data collection through model deployment, verification, and reporting. The review exam