Course intended for
Course main audience are software engineers and data analysts, who want to learn the basis of Big Data processing, which surpasses the capabilities of traditional processing, using tools of Apache Spark family. The course is both for people interested in starting to work with Big Data, as well as, people with previous experience in other Big Data systems, such as Apache Hadoop, who want to learn new technology.
The attendees will learn about new problems that arise during Big Data analysis from various sources using Apache Spark family tools. During the course a general set of typical Big Data problems and their solutions using Apache Spark will be presented. Moreover, the attendees will have a general overview of the pros and cons of using Apache Spark for their business problem solving. In addition, the course allows the attendees to familiarize themselves with fast-moving Big Data processing field and the novel approach to problem solving that Apache Spark presents.
Course intended for:
The training is aimed at developers, architects and application administrators who want to create or maintain systems based on scalable architectures such as Big Data, in particular specialists for whom performance and volume of processed data is the highest priority. The course is aimed at people currently involved in creation of relational databases who want to gain knowledge of alternative technologies, which gradually displace relational databases from different application areas. The training is also dedicated for people who want to supplement their knowledge of concepts of Big Data, MapReduce, NoSQL and their implementation using Apache Hadoop & Family software.
Course participants will gain knowledge of cross-cutting concepts such as MapReduce algorithm, Big Data, BigTable, DFS distributed file systems, NoSQL databases. After the training, course participants will be able to choose the right techniques for their projects. In addition to general introduction to Big Data, this training is focusing on the entire Apache Hadoop stack.
Course intended for
Training course is intended mainly for programmers/developers and data analysts who want to learn the basics of Big Data processing. Tools from Apache Storm and Spark family can be used to process very big sets of data in (almost) real time. The training offers a good background knowledge both for those who want to start working with Big Data stream processing and for those who are already experienced in such systems, e.g. Apache Hadoop, and want to learn this new technology.
Participants will learn about the problem of analysis of very big sets of data (Big Data) in (almost) real time from various sources. During the course a basic set of problems with stream processing of Big Data will be presented along with the solutions using the tools from Apache Storm and Spark family. What is more, participants will be made aware of the advantages and disadvantages of these technologies when approaching real-world, business-related problems.