CS 460 Big Data Analysis (4 SH)

Rapid advances in digital sensors, networks, storage, and computation, along with their availability at low cost, are leading to the creation of large data sets. This course provides an introduction to the definitions, principles, and de-facto standard and industrial frameworks for handling these large datas. Among the multitude of software platforms, the course will utilize Hadoop, Spark, Pig, and ROOT, and will interact with Python and C++ programming to resolve practical problems and experiment with data-analysis algorithms.