In the 21st
century our civilization is getting one global data based society. Large
quantities of data are collected everywhere, from traditional fields like
factories, medicine, and science to completely new areas like farming, personal
health, home sensors, kitchen appliances, traffic, cars, and of course internet
with vast collections of numeric and textual data. Yet, data itself means
nothing. We are interested in the patterns and knowledge hidden in it, which
enable us to comprehend trends and decide wisely.
Data mining
is an interdisciplinary approach to knowledge discovery. It helps us handle enormous
quantities of data and exploit data diversity. It encompasses many ideas and
methods from machine learning, statistics, artificial intelligence, and
databases. Many solutions from this field are already part of everyday life. For
example, contents of shop shelves is optimized according to preferences of
customers which articles to buy together. Internet search engines display hits different
for each individual according to the pages they visit and clicks in their
social network activities. Traffic lights in cities are tuned to traffic density
patterns. Medical treatment guidelines are formed according to the history of
successful recoveries. Insurance companies detect fraudulent claims. Various
“big brothers” detect terrorist groups, etc.
The course contents offers a review of up to date
applicative knowledge. The lectures introduce main approaches without
theoretical background. Some important types of data reviewed, like tables,
text, networks, and surveys. For many fields comprehensibility of prediction
models is very important and want to understand the causes for different
phenomena. Therefore course tackles visualization
of data, trends, patterns, and predictive models. In the lab work the gained
knowledge will be put into practice by using open-source data mining and
visualization tools.
As a saying
goes: we are drowning in data and starving for knowledge. This course can throw
you a life belt and help you swim in the sea of data. It will also teach you
how to harvest some knowledge, so that you don’t starve any more.
- nosilec: Marko Robnik Šikonja