{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Datenvalidierung mit Voluptuous (Schemadefinitionen)\n", "\n", "In diesem Notebook verwenden wir [Voluptuous](https://github.com/alecthomas/voluptuous), um Schemata für unsere Daten zu definieren. Wir können dann die Schemaprüfung an verschiedenen Stellen unserer Bereinigung verwenden, um sicherzustellen, dass wir die Kriterien erfüllen. Schließllich können wir Ausnahmen für die Schemaüberprüfung verwenden, um unreine oder ungültige Daten zu markieren, beiseite zu legen oder zu entfernen.\n", "\n", "
| \n", " | Unnamed: 0 | \n", "timestamp | \n", "city | \n", "store_id | \n", "sale_number | \n", "sale_amount | \n", "associate | \n", "
|---|---|---|---|---|---|---|---|
| 0 | \n", "0 | \n", "2018-09-10 05:00:45 | \n", "Williamburgh | \n", "6 | \n", "1530 | \n", "1167.0 | \n", "Gary Lee | \n", "
| 1 | \n", "1 | \n", "2018-09-12 10:01:27 | \n", "Ibarraberg | \n", "1 | \n", "2744 | \n", "258.0 | \n", "Daniel Davis | \n", "
| 2 | \n", "2 | \n", "2018-09-13 12:01:48 | \n", "Sarachester | \n", "2 | \n", "1908 | \n", "266.0 | \n", "Michael Roth | \n", "
| 3 | \n", "3 | \n", "2018-09-14 20:02:19 | \n", "Caldwellbury | \n", "14 | \n", "771 | \n", "-108.0 | \n", "Michaela Stewart | \n", "
| 4 | \n", "4 | \n", "2018-09-16 01:03:21 | \n", "Erikaland | \n", "11 | \n", "1571 | \n", "-372.0 | \n", "Mark Taylor | \n", "