Pentaho Assessment Report
This topic contains information about the Pentaho assessment report. The assessment assesses workloads and produces in-depth insights that help plan the migration. The input format for Pentaho assessment is KJB and KTR file formats.
In This Topic:
Highlights
The highlights section gives you a high-level overview of your assessment summary of the analytics performed on the selected workloads. It includes information about the components, steps, and queries.
Summary
This section summarizes the input Pentaho scripts analyzed at job and transformation level. It includes information about files, jobs, transformations, queries, and entities.
- Files: Displays the number of source files.
- Jobs: A job is a set of tasks that control the system for scheduling, monitoring, and reporting. It can be in the form of command, file, or script.
- Transformations: Transformation is a process of moving and transforming workloads from source to target.
- Queries: Displays the number of queries.
- Entities: Displays the number of entities.
Jobs
This section showcases an overview of active and inactive components existing in the source file along with their complexity.
- Total Components: Displays the total number of components in the jobs.
- Active Components: Displays the number of components in the jobs that are engaged in the dataflow.
- Inactive Components: Inactive elements or the components that are not engaged or used in the source files. Displays the number of components in the jobs that are not engaged in the dataflow.
Transformation
This section showcases an overview of active and inactive steps along with their complexity.
- Total Steps: Displays the total number of steps in the transformation.
- Active Steps: Displays the number of steps in the jobs that are active in the dataflow.
- Inactive Steps: Displays the number of steps in the jobs that are inactive in the dataflow.
Queries
It displays a synopsis of analyzed queries along with information about unique, parsed, and unparsed queries.
- Unique Queries: Displays the number of unique queries. In Unique Queries, the duplicate queries are eliminated, and solitary queries are identified.
- Parsed Queries: Displays the number of parsed queries.
- Unparsed Queries: Displays the number of unparsed queries.
Query Database Types
This section provides an overview of different query database types within the entire inventory.
Entities
This section provides an overview of tables within the entire inventory.
Analysis
This topic provides detailed information about the source files.
Files
This section provides a comprehensive report of the source files along with information about databases, jobs, transformations, and so on.
- File Name: Displays the name of the file.
- Database: Displays the number of associated databases.
- Jobs: Displays the number of jobs. It is a set of tasks that control the system for scheduling, monitoring, and reporting.
- Transformations: Displays the number of transformations. It is a process of moving and transforming workloads from source to target.
- Queries: Displays the number of queries.
- Complexity: Displays the file complexity.
Downloadable Reports
Downloadable reports allow you to export detailed Pentaho assessment reports of your source data which enables you to gain in-depth insights with ease. To access these assessment reports, click Reports.
Types of Reports
In the Reports section, you can see various types of reports such as Insights and Recommendations, and Source Inventory Analysis. Each report type offers detailed information allowing you to explore your Pentaho assessment results.
Insights and Recommendations
This report provides an in-depth insight into the Pentaho source input files. It contains the final output including the details of queries, complexity, jobs, transformations, and so on.
Pentaho_Assessment_Complete_Report.xlsx: This report provides insights about the source inventory. It helps you plan the next frontier of a modern data platform methodically. It includes information about jobs, transformation, databases, and more.
This report contains the following information:
- Report Summary: Provides information about all the generated artifacts.
- Volumetric Info: Presents a summary of the aggregated inventory after analyzing the source files. For instance, it provides volumetric information about the total number of jobs, transformations, components, and likewise. It also provides transformation and job-level complexity.
- Job Summary: Lists all the jobs associated with the input files. It also provides statistical information about components, queries, connections, jobs, and a lot more.
- Transformation Summary: Provides transformation details along with information about the number of queries, connections, transformations called, jobs called, and so on.
- DB Summary: Provides information about databases along with statistical information about jobs, transformations, and so on.
Source Inventory Analysis
It is an intermediate report which helps to debug failures or calculate the final report. It includes all the generated CSV reports.
transformation_detail_pentaho.csv: This report provides information about transformations along with steps, step types, queries, and more.
Browse through the csv folder to access the entity.csv, external_files_detail_report.csv, ExternalFiles.csv, and query_complexity.csv reports.
entity.csv: This report provides information about tables along with its type, database type, and more.
external_files_detail_report.csv: This report provides information about external files along with its type and availability.
ExternalFiles.csv: This report provides information about external files along with its type, directory, executable file path, and command.
query_complexity.csv: This report provides information about queries including analyzed status, query parsing status, complexity, and more. If the analyzed status is TRUE, it indicates that the query is analyzed successfully. Conversely, a FALSE status indicates that the query is not analyzed.