Course taught in the ATHENS Program
saclay

Data streams are everywhere, from F1 racing over electricity networks to social media feeds. Data stream mining or Real-Time Analytics relies on and develops new incremental algorithms that process streams under strict resource limitations.

This course focuses on, as well as extends the methods implemented in open source tools as MOA and Apache SAMOA. Students will learn to how select and apply an appropriate method for a given data stream problem; they will learn how to design and implement such algorithms; and they will learn how to evaluate and compare different solutions.

Evaluation:

  • 30% Lab Assignments
  • 70% Final Test

Lecture Slides

    • 2. Introduction to Data Science Slides
    • 3. Stream Algorithmics Slides
    • 9. Distributed Streaming Slides

Session Labs

Bibliography

  • Data Stream Mining: A practical approach. PDF
  • Knowledge Discovery from Data Streams by Joao Gama Website