layout: true --- # Sequence Learning Korbinian Riedhammer --- # Today - Logistics. - Why you should take this class. - Motivation - Syllabus - What you should bring to this class. --- # Logistics - Thursdays at 8a. Online for now (Teams); starting April 24: HQ.104 - materials:
- announcements and discussion: Microsoft Teams `Sequence Learning` (code: see email) - programming languages: Java, Python for PyTorch; bash for glue - until April 20: online, individually and (mostly) async - work your way through the posted literature - work on the assignment - async workflow: post questions, get peer and instructor help - Thursdays at 8a: Q&A session - starting April 24: regular class - 60-90 min. lecture, then (pair) programming - exercises: peer programming, discussion --- # Credits - In teams of two, **write** a [4 page paper](https://journals.ieeeauthorcenter.ieee.org/create-your-ieee-article/authoring-tools-and-templates/ieee-article-templates/templates-for-transactions/) (excluding references), due **June 24** (60%) .center[
] - individually **review** 3 other papers by **July 1** (20%) - **present** your paper on **July 8** (tentative date). (20%) --- # Flashback: Verbmobil (1993-2000)
--- # Why you should take this class. ### Machine learning is the future.* ### Many applications are to sequences, not single observations. ### Understand the foundations of sequence classification.
.right[*or at least a well-paid part of it.] --- # Motivation ### Sequence Learning in Fiction --- # Star Trek (1966) und Star Trek TNG (1987)
--- # 2001: Odyssee im Weltall (1968)
--- # Star Wars: A New Hope (1977)
--- # Knight Rider (1982)
(at around 2:00) --- # Charlie's Angels (2000)
(at around 0:50) --- # 24 (S02E23, 2003) .center[
] "Cypress Recording" wurde durch Sound Engine mit angelernter Sprachsynthese erstellt. --- # Wild Hogs (2007)
--- # Her (2013)
--- # Blade Runner 2049 (2018)
--- # Motivation ## Sequence Learning in Products --- # Radio Rex (1920)
"Classic" signal processing: triggers on 500Hz ("re**ks**") --- # Worlds of Wonder's Julie Doll (1987)
--- # PenPoint OS (1991)
--- # Graffiti (Palm OS, 1997) Xerox PARC Unistrokes
Required defined stroke/trajectory for letters. --- # Microsoft Speech Recognition (2008)
--- # BMW Sprachsteuerung (2009)
--- # Siri (2011)
(at about 2:20) --- # Alexa (2014)
--- # Microsoft Cortana (2015)
--- # HELLO Barbie "She Talks" (2015)
--- # Google GBoard Predictive Text and Autocorrect
--- # Google GBoard Glide Typing
--- # Spelling and Grammar Correction
--- # Smart Home Voice Activation Alexa, Google, Cortana hotword activation and retraining. --- # Automatic Summarization
--- # Stock Market Prediction
--- # Music Composer
--- # Digital Sports: Drop Jump Classification
.right[...mit JSTK] --- # Machine Translation Moses:
([demo](http://demo.statmt.org/index.cgi)) OpenNMT:
([demo](https://demo-pnmt.systran.net/production)) --- # MarI/O
--- # MariFlow
--- # Data Sources Analog signals (digitized) - microphones - vibration - conductivity - ambient: pressure, temperature, humidity, ... - positional: GPS, gyro, distances - user input: key-press, gestures, pressure, swipe Digital or "Big Data" signals - log streams - network traffic - events (IoT, MQTT) - user-generated content --- # Toolkits ### JSTK
- basic speech processing and recognition - reference implementation for HMM and related algorithms ### PyTorch
- general purpose machine learning toolkit - extensions for sequence learning (LSTM, encoder-decoder, ...) ### Kaldi
- speech recognition toolkit with deep learning and FSTs - sequence classification of continuous signals - _not covered in this class_ --- # Syllabus - matching and comparing discrete sequences ("auto-correct") - improving auto-correct with probabilities - extending on auto-complete - basic sequence classification (HMM) - sequence kernels for instance classifiers - neural networks ("deep learning") - sequence to sequence learning - deep dive: evolution of speech recognition from the 80ies to today --- # What you should bring to this class. ### A little bit of probability theory. ### A little bit of optimization. ### Algorithms and Programming. ### Genuine interest in sequence classification.