Have you heard of Hadoop but don’t know what it is or what it does?
Or do you know that Hadoop is used to store very large datasets but you don’t know what it can do or why it might be relevant to you?
If so, this webinar is for you.
This webinar will provide an overview of Hadoop, including:
- Hadoop and Hadoop clusters
- why Hadoop can process datasets far larger than those comfortable inside desktop applications
- some key ‘add-in’ products and what they are used for (e.g. Hive for data manipulation on a grand scale and Spark for statistical analysis)
- a demonstration of using the Hive package to process a large dataset into something more manageable on the desktop
This webinar is intended for researchers with no in-depth knowledge of programming with data.
The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.
Further information and booking: https://ukdataservice.ac.uk/news-and-events/eventsitem/?id=4434