Apache Kylin: Real-Time OLAP on Big Data Training Course
Apache Kylin is a distributed analytics engine designed to provide OLAP (Online Analytical Processing) capabilities on big data. With the release of version 5.0, Kylin introduces real-time streaming support, enabling sub-second query responses on streaming data sources like Kafka. The platform offers a powerful SQL interface, seamless integration with BI tools, and a redesigned web UI for intuitive modeling and analysis.
This instructor-led, live training (online or onsite) is aimed at intermediate-level big data professionals who wish to utilize Apache Kylin for building real-time data warehouses and performing multidimensional analysis on large-scale datasets.
By the end of this training, participants will be able to:
- Set up and configure Apache Kylin with real-time streaming data sources.
- Design and build OLAP cubes for both batch and streaming data.
- Perform complex queries with sub-second latency using Kylin's SQL interface.
- Integrate Kylin with BI tools for interactive data visualization.
- Optimize performance and manage resources effectively in Kylin.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction to Apache Kylin
- Overview of OLAP and its significance in big data analytics
- Evolution of Apache Kylin and its architecture
- Key features and capabilities of Kylin 50
Setting Up Apache Kylin
- Installation prerequisites and environment setup
- Configuring Kylin with Hadoop, Spark, and Kafka
- Understanding Kylin's web UI and command-line tools
Data Modeling in Kylin
- Designing star and snowflake schemas for OLAP cubes
- Defining dimensions and measures
- Creating and managing data models in Kylin's web UI
Building and Managing Cubes
- Cube building process and job management
- Incremental builds and auto-merge strategies
- Monitoring cube health and performance
Real-Time Streaming with Kylin
- Integrating Kafka as a streaming data source
- Setting up real-time cubes and fusion models
- Achieving low-latency analytics with streaming data
Querying and Analysis
- Executing SQL queries using Kylin's query interface
- Connecting BI tools (eg, Tableau, Power BI) to Kylin
- Performing multidimensional analysis and drill-downs
Performance Optimization
- Best practices for cube design and aggregation
- Resource management and tuning for scalability
- Troubleshooting common performance issues
Advanced Topics
- Security and access control in Kylin
- Extending Kylin with custom plugins and integrations
- Exploring Kylin's REST APIs for automation
Summary and Next Steps
Requirements
- An understanding of Hadoop and big data ecosystems
- Familiarity with SQL and data warehousing concepts
- Basic knowledge of streaming data platforms like Kafka
Audience
- Big data engineers seeking to implement real-time analytics solutions
- Data analysts aiming to leverage OLAP capabilities on large datasets
- Data warehouse architects interested in modernizing their infrastructure
Open Training Courses require 5+ participants.
Apache Kylin: Real-Time OLAP on Big Data Training Course - Booking
Apache Kylin: Real-Time OLAP on Big Data Training Course - Enquiry
Apache Kylin: Real-Time OLAP on Big Data - Consultancy Enquiry
Consultancy Enquiry
Testimonials (5)
Hands-on examples allowed us to get an actual feel for how the program works. Good explanations and integration of theoretical concepts and how they relate to practical applications.
Ian - Archeoworks Inc.
Course - ArcGIS Fundamentals
All the topics which he covered including examples. And also explained how they are helpful in our daily job.
madduri madduri - Boskalis Singapore Pte Ltd
Course - QGIS for Geographic Information System
how the trainor shows his knowledge in the subject he's teachign
john ernesto ii fernandez - Philippine AXA Life Insurance Corporation
Course - Data Vault: Building a Scalable Data Warehouse
Actual application of spotfire and all basic functions.
Michael Capili - STMicroelectronics, Inc.
Course - Introduction to Spotfire
The thing I liked the most about the training was the organization and the location
Hamid Tuama - Ability with Innovation General Contracting (DMCC Branch)
Course - ArcGIS for Spatial Analysis
Upcoming Courses
Related Courses
Advanced Data Analysis with TIBCO Spotfire
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire Analyst techniques for analyzing data.
By the end of this training, participants will be able to:
- Share visualizations among different team members.
- Secure access to software based on roles and access controls.
- Create visualizations such as map charts.
- Integrate statistical computing languages such as R with Spotfire.
ArcGIS for Spatial Analysis
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at field ecologists and conservation managers who wish to create data spatial projects in ArcGIS.
By the end of this training, participants will be able to:
- Output spatial data as visualizations.
- Conduct geostatics on actual data.
- Implement spatial data analysis, data processing, and mapping with ArcGIS.
- Analyze spatial data for projects in ArcGIS.
ArcMap in ArcGIS
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at intermediate-level environmental scientists and archaeologists who wish to learn how to use ArcMap in ArcGIS for collecting, organizing, managing, and analyzing geographic information.
By the end of this training, participants will be able to:
- Understand the fundamentals of ArcMap and ArcGIS.
- Collect, organize, manage, and analyze geographic information on the social and archaeological elements.
- Conduct spatial queries for impact evaluation.
ArcGIS from Basic to Advanced
35 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at beginner-level to advanced-level GIS professionals and analysts who wish to learn how to effectively use ArcGIS for data visualization, spatial analysis, and geospatial project management.
By the end of this training, participants will be able to:
- Navigate and utilize ArcGIS tools for geospatial data management.
- Create and customize maps with layers and attributes.
- Perform advanced spatial analysis and geoprocessing tasks.
- Automate workflows using ModelBuilder and Python.
ArcGIS Enterprise for Technical Support
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at beginner-level IT support personnel who wish to provide robust support for ArcGIS Enterprise, addressing any anomalies or failures effectively.
By the end of this training, participants will be able to:
- Understand the architecture and components of ArcGIS Enterprise.
- Learn to install, configure, and manage ArcGIS Enterprise.
- Gain skills in troubleshooting and resolving common issues.
- Develop proficiency in monitoring and maintaining ArcGIS Enterprise environments.
- Master the techniques for backup, recovery, and performance optimization.
ArcGIS Fundamentals
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at beginner-level professionals who wish to learn the fundamental concepts and tools of ArcGIS.
By the end of this training, participants will be able to:
- Understand the basic concepts of GIS and spatial data.
- Navigate the ArcGIS interface.
- Create and manage spatial data.
- Perform basic spatial analysis.
- Create maps and visualizations.
ArcGIS Pro for Spatial Analysis
14 HoursBy the end of this training, participants will be able to:
- Understanding GIS concepts and spatial data types using the latest version of ArcGIS Pro.
- Investigate the user interface and evaluate the uses
- Explore ArcGIS* Pro and how to utilize online content
- Manage, manipulate and analyze your own data
- Display and share your data in a meaningful and creative way
Note: We will use the latest version of ArcGIS Pro.
Advanced ArcGIS Pro for Spatial Analysis
35 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at advanced-level GIS professionals who wish to use ArcGIS Pro to enhance their spatial analysis capabilities, conduct comprehensive geostatistical analysis, and apply advanced 3D modeling techniques for more effective decision-making and problem-solving in real-world scenarios.
By the end of this training, participants will be able to:
- Develop advanced skills in spatial analysis techniques using ArcGIS Pro.
- Utilize Python scripting for automation and complex data processing.
- Apply spatial modeling for problem-solving in real-world scenarios.
- Conduct geostatistical analysis for advanced data interpretation.
- Integrate external data sources and leverage 3D spatial data analysis.
ArcGIS with Python Scripting
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at GIS analysts who wish to automate repetitive tasks in GIS processes.
By the end of this training, participants will be able to:
- Build GIS applications using Python and ArcGIS tools.
- Develop with the ArcGIS package ArcPy, using Python.
- Apply the ArcGIS modules for map automation using object classes in Python.
Data Vault: Building a Scalable Data Warehouse
28 HoursIn this instructor-led, live training in Panama, participants will learn how to build a Data Vault.
By the end of this training, participants will be able to:
- Understand the architecture and design concepts behind Data Vault 2.0, and its interaction with Big Data, NoSQL and AI.
- Use data vaulting techniques to enable auditing, tracing, and inspection of historical data in a data warehouse.
- Develop a consistent and repeatable ETL (Extract, Transform, Load) process.
- Build and deploy highly scalable and repeatable warehouses.
QGIS for Geographic Information System
21 HoursA geographic information system (GIS) is a system designed to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The acronym GIS is sometimes used for geographic information science (GIScience) to refer to the academic discipline that studies geographic information systems and is a large domain within the broader academic discipline of geoinformatics.
QGIS functions as geographic information system (GIS) software, allowing users to analyze and edit spatial information, in addition to composing and exporting graphical maps. QGIS supports both raster and vector layers; vector data is stored as either point, line, or polygon features. Multiple formats of raster images are supported, and the software can georeference images. To summarize it allows the users to Create, edit, visualise, analyse and publish geospatial information on Windows, Mac, Linux, BSD.
This program, in its first phase, introduces the QGIS interface for general usage. In the second phase, we introduce PyQGIS - the python libraries of QGIS that allows the integration of GIS functionalities in your python code or your python application, so that you may even create your own Python Plugin around a particular GIS functionality.
Introduction to Spotfire
14 HoursThis instructor-led, live training in Panama (online or onsite) is aimed at business analysts and data analysts who wish to learn basic Spotfire Analyst techniques for analyzing data.
By the end of this training, participants will be able to:
- Install and configure TIBCO Spotfire.
- Combine data from different databases.
- Visualize large datasets.
- Create and share complex dashboards.