Skip to the content
LeaplogicLeaplogic
  • Home
  • About Us
  • Contact
SIGN IN
  • Home
  • About Us
  • Contact

  • Getting Started
    • Before You Begin
    • Creating an Account
    • Logging into LeapLogic
    • Reset Password
    • Quick Tour of the Web Interface
    • LeapLogic in 15 minutes
      • Prerequisites
      • Step 1. Log into LeapLogic
      • Step 2. Create Assessment and Get Insights
      • Step 3. Create Transformation Pipeline and See Results
      • Step 4. Edit or Optimize the Transformed Code
      • Step 5: Complete the Transformation Lifecycle
  • Introduction to LeapLogic
    • Overview
    • High Level Architecture
    • Supported Legacy and Cloud Platforms
    • Key Features
  • Workload Assessment
    • Overview
    • Value Proposition
    • Creating Assessment
      • Prerequisites
      • Step 1. Provide Primary Inputs
        • Automation Coverage
      • Step 2. Add the Additional Inputs
        • Table Stat Extraction Steps
          • Teradata
          • Oracle
          • Netezza
      • Step 3. Update the Source Configuration
      • Step 4. Configure the Recommendation Settings
    • Assessment Listing
    • Understanding Insights and Recommendations
      • Volumetric Info
      • EDW
        • Oracle
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Vertica
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Snowflake
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Azure Synapse
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • SQL Server
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Teradata
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Netezza
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Google Big Query
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Redshift
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • PostgreSQL
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Duck DB
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • ClickHouse
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • Exasol
          • Highlights
          • Analysis
          • Optimization
          • Lineage
          • Recommendations
          • Downloadable Reports
        • DB2
          • Highlights
          • Analysis
          • Optimization
          • Recommendations
          • Lineage
          • Downloadable Reports
      • ETL
        • Informatica
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • Ab Initio
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • DataStage
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • Talend
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • SSIS
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • Informatica BDM
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • Oracle Data Integrator
          • Highlights
          • Analysis
          • Downloadable Reports
        • Pentaho
          • Highlights
          • Analysis
          • Downloadable Reports
        • Azure Data Factory
          • ARM Template
          • Highlights
          • Analysis
          • Downloadable Reports
        • Matillion
          • Highlights
          • Analysis
          • Downloadable Reports
        • SnapLogic
          • Highlights
          • Analysis
          • Downloadable Reports
      • Orchestration
        • AutoSys
          • Highlights
          • Analysis
          • Downloadable Reports
        • Control-M
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • SQL Server
          • Highlights
          • Analysis
      • BI
        • OBIEE
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • Tableau
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • IBM Cognos
          • Highlights
          • Analysis
          • Downloadable Reports
        • MicroStrategy
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • Power BI
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • SSRS
          • Highlights
          • Analysis
          • Downloadable Reports
        • SAP BO
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
        • WebFOCUS
          • Highlights
          • Analysis
          • Downloadable Reports
      • Analytics
        • SAS
          • Highlight
          • Analysis
          • Lineage
          • Downloadable Reports
        • Alteryx
          • Highlights
          • Analysis
          • Lineage
          • Downloadable Reports
      • Integrated Assessment (EDW, ETL, Orchestration, BI)
        • Highlights
        • Analysis
        • Optimization
        • Lineage
        • Recommendations
    • Managing Assessment Reports
      • Downloading Report
      • Input Report Utility
      • View Configuration
    • Complexity Calculation Logic
    • Key Benefits
    • Ad hoc Query
  • Metadata Management
    • Overview
    • Introduction to Data Catalog
      • Managing Data Catalog
        • Building Data Catalog
        • Insights to Data Catalog
        • Managing the Repository and Data Source
      • Creating Repository (Repo)
      • Creating Data Source
    • Tag Management
    • Key benefits
  • Batch Processing using Pipeline
    • Introduction
    • Designing Pipeline
      • How to create a pipeline
        • Configuring Migration Stage
          • Schema Optimization
        • Configuring Transformation Stage
          • On-premises to Cloud
          • Cloud-to-Cloud
          • LeapLogic Express
        • Configuring Validation Stage
          • Data Validation
            • Table
            • File
            • File and Table
            • Cell-by-cell validation
          • Query Validation
            • Query Validation (When Data is Available)
            • Query Validation (When Data is Not Available)
          • Schema Validation
        • Configuring Execution Stage
        • Configuring ETL Conversion Stage
          • Ab Initio
          • Informatica
          • Informatica BDM
          • Matillion
          • DataStage
          • SSIS
          • IICS
          • Talend
          • Oracle Data Integrator
          • Pentaho
          • SnapLogic
        • Configuring Mainframe Conversion Stage
          • Cobol
          • JCL
        • Configuring Orchestration Stage
          • AutoSys
          • Control-M
        • Configuring BI Conversion Stage
          • OBIEE to Power BI
          • OBIEE to AWS QuickSight
          • Tableau to Amazon QuickSight
          • Tableau to Power BI
          • Tableau to Superset
          • Tableau to Looker
          • IBM Cognos to Power BI
        • Configuring Analytics Conversion Stage
          • SAS
          • Alteryx
        • Configuring Script Conversion Stage
    • Key Features
      • How to schedule a pipeline
      • Configuring Parameters
  • Pipeline Reports
    • Overview of Pipeline Report
    • Pipeline Listing
    • Reports and Insights
      • Migration
      • Transformation
        • On-premises to Cloud
        • Cloud-to-Cloud
        • LeapLogic Express
      • Validation
        • Data
          • File
          • Table
          • File and Table
        • Query
          • Query Validation Report (When Data is Available)
          • Query Validation Report (When Data is not Available)
        • Schema
      • Execution
      • ETL
        • Ab Initio
        • Informatica
        • Informatica BDM
        • Matillion
        • DataStage
        • SSIS
        • IICS
        • Talend
        • Oracle Data Integrator
        • Pentaho
        • SnapLogic
      • Mainframe
        • Cobol
        • JCL
      • Orchestration
        • AutoSys
        • Control-M
      • BI
        • OBIEE to Power BI
        • OBIEE to Amazon QuickSight
        • Tableau to Amazon QuickSight
        • Tableau to Power BI
        • Tableau to Superset
        • Tableau to Looker
        • IBM Cognos to Power BI
      • Analytics
        • SAS
        • Alteryx
      • Shell Script
      • Common Model
    • Automation Level Indicator
      • ETL
        • Informatica
        • Matillion
        • DataStage
        • Informatica BDM
        • SnapLogic
        • IICS
        • Ab Initio
        • SSIS
        • Talend
        • Pentaho
      • Orchestration
        • AutoSys
        • Control-M
      • EDW
      • Analytics
        • SAS
        • Alteryx
      • BI
      • Shell Script
    • Error Specifications & Troubleshooting
  • SQL Transformation
    • Overview
    • Creating and Executing the Online Notebook
      • How to Create and Execute the Notebook
      • Supported Features
    • Configuring the Notebook
      • Transformation
      • Unit Level Validation
      • Script Level Validation
    • Notebook Listing
  • Operationalization
    • Overview
      • Basic
      • Advanced
      • Cron Expression
    • Parallel Run Pipeline Listing
  • Transformation Source
    • Introduction
    • Creating Transformation Source Type
  • Governance
    • Summary of Governance - Roles and Permissions
    • User Creation
      • Creating a new User Account
    • Adding Roles and permissions
      • How to add Roles and Permissions to a new user?
    • Adding Group Accounts
    • Default Quota Limits
    • Product Usage Metrics
  • License
    • EDW
    • ETL
  • LeapLogic Desktop Version
    • Overview
    • Registration and Installation
    • Getting Started
    • Creating Assessment
      • ETL
      • DML
      • Procedure
      • Analytics
      • Hadoop
    • Reports and Insights
      • Downloadable Reports
      • Reports for Estimation
    • Logging and Troubleshooting
    • Sample Scripts
    • Desktop vs. Web Version
    • Getting Help
  • LeapLogic (Version 4.8) Deployment
    • System Requirements
    • Prerequisites
    • Deployment
      • Extracting Package
      • Placing License Key
      • Executing Deployment Script
      • Accessing LeapLogic
    • Uploading License
    • Appendix
    • Getting Help
  • Removed Features
    • Configuring File Validation Stage
    • Variable Extractor Stage
      • Variable Extractor Report
    • Configuring Meta Diff Stage
      • Meta Diff
    • Configuring Data Load Stage
      • Data Load
    • Configuring Multi Algo Stage
  • FAQs
  • Tutorial Videos
  • Notice
Home   »  Workload Assessment  »  Complexity Calculation Logic

Complexity Calculation Logic

This topic describes the logic for calculating complexity. Based on the count of certain applicable patterns as per the workload type, the intelligent assessment engine calculates the workload complexity. As per the input workload, it may also consider variables, lines of code, expression length, or number of queries per job, etc.

In This Topic:

  • EDW – Query (Teradata, Oracle, SQL Server, Vertica, Netezza, BigQuery, Snowflake, Redshift)
  • EDW – Procedural Code
  • ETL (Informatica, DataStage, Ab Initio, Talend, SSIS, Matillion, ODI)
  • Analytics (SAS, Alteryx)
  • BI (OBIEE, IBM Cognos, MicroStrategy, SAP BO, Tableau, SSRS, Power BI)
  • Orchestration (AutoSys, Control-M)
  • Hadoop


EDW – Query

The below table describes the logic for calculating query complexity. It calculates the query complexity on the basis of minimum/maximum occurrences of functions, operations, sub-queries, joins, total length of the query, case clauses, and nested sub-queries.

Category Sub Category Min Max Complexity Number
Recursive Recursive 1 1 5
Window Functions windowfunctions 1 4 1
windowfunctions 5 10 2
windowfunctions 11 3
Aggregate Functions aggregatefunctions 1 4 1
aggregatefunctions 5 10 2
aggregatefunctions 11 3
Union union 1 4 0.5
union 5 2
Operation minus 1 1 1
minus 2 2
Sub Query clause subquery with select clause 1 2 1
subquery with select clause 3 4 1.5
subquery with select clause 5 5 2
subquery with select clause 6 6 2.5
subquery with select clause 7 3
subquery with where clause 1 2 0.5
subquery with where clause 3 4 1
subquery with where clause 5 6 2.5
subquery with where clause 7 3
subquery with from clause 1 2 0.5
subquery with from clause 3 4 1
subquery with from clause 5 2
subquery with other clause 1 2 1
subquery with other clause 3 4 2
subquery with other clause 5 3
subquery with in /anyall/exists 1 2 0.5
subquery with in /anyall/exists 3 4 1.5
subquery with in /anyall/exists 5 3
Join cross join 1 2 0.5
cross join 3 4 1
cross join 5 6 2
cross join 7 3
inner join 1 2 0.5
inner join 3 5 1
inner join 6 8 2
inner join 9 3
outer join 1 2 0.5
outer join 3 4 2
outer join 5 6 3
outer join 7 8 4
outer join 9 5
Order By order by columns/(having/groupby) derived columns 2 1
Length length 5001 10000 1
length 10001 20000 2
length 30001 40000 3
length 40001 50000 4
length 50001 5
Join Condition where and join condition 1 5 0
where and join condition 6 10 0.5
where and join condition 10 20 1
where and join condition 21 30 2
where and join condition 31 40 3
where and join condition 40 4
Case Clause case 1 2 0.5
case 3 6 1
case 6 1.5
Sub Query nested subquery 1 1 1
nested subquery 2 2 2
nested subquery 3 3 3
nested subquery 4 4 4
nested subquery 5 5
select* 1


EDW – Procedural Code

The below table describes the logic for calculating PL/SQL complexity. It calculates the PL/SQL complexity based on the number of queries, conditions, cursors, goto statements, loops, and lines of code.

Complexity Total Query Count Condition Count Cursor Count GoTo Count Line Of Code Loop Count
Trivial <=10 <=2 <=1 <=2 <=100 <=1
Simple >=11 and <=20 >2 and <=5 =2 >2 and <=5 >=101 and <=350 =2
Medium >=21 and <=40 >=6 and <=10 >=3 and <=5 >=6 and <=10 >=351 and <=1000 >=3 and <=5
Complex >=41 and <=60 >=11 and <=20 >=6 and <=10 >=11 and <=20 >=1001 and <=2000 >=6 and <=10
Very Complex >60 >20 >10 >20 >2000 >10


ETL

The below tables describe the logic for calculating ETL complexity.

Informatica

The below table describes the logic for calculating Informatica complexity. It calculates the complexity on the basis of components, transformations, expressions, variables, running aggregate functions, and length of columns.

Script Severity Components & Inputs/output Active/Passive Transformations Expressions Variables Running Agg Out Column Len Other Transformations
Simple <=30 <=24 <=5 <=10 <=2 <=1000 <= 1
Medium >30 and <= 100 >24 and <= 88 >5 and <=10 >10 and <=30 >2 and <=6 >1000 and <=4000 >1 and <= 2
Complex >100 >88 >10 >30 >6 >4000 >2


DataStage

The below table describes the logic for calculating DataStage complexity. It calculates the complexity based on number of components, job type, transformations, and more.

Complexity Job Type Total Components Active/ Passive Transformations Other Transformation CTransformation Stage
Simple Parallel/ Mainframe/ Server job <=30 <=24 <=1 <=5
Medium Parallel/ Mainframe/ Server job <=100 <=88 <=2 <=10
Complex Parallel/ Mainframe/ Server job >100 >88 >2 >10


Ab Initio

The below table describes the logic for calculating Ab Initio complexity. It calculates the complexity on the basis of number of components, lookup files, ebcdic files, dml, and xfr directory.

Script Severity Component Count Lookup File Count Ebcdic File Count Complex DML Count Complex XFR Count
Simple <=20 <=2 <=2 <=2 <=2
Medium >20 and <=50 >2 and <=3 >2 and <=3 >2 and <=3 >2 and <=3
Complex > 50 >=4 >=4 >=4 >=4


Talend

The below table describes the logic for calculating Talend complexity. It calculates the complexity on the basis of tmaps, expression length, and components.

Graph Severity Tmap Count Expression Length Component Count
1 <=4 <=500 <=15
2 >4 & <=10 >500 & <=1000 >15 & <=30
3 >10 & <=30 >1000 & <=4000 >30 & <=100
4 >30 & <=60 >4000 & <=10000 >100 & <=200
5 >60 & <=120 >10000 & <=20000 >200 & <=300


SSIS

The below table describes the logic for calculating SSIS complexity. It calculates the complexity on the basis of components, event handlers, constraints, loops, and microsoft.pipeline.

Graphs Severity (1-5 Scale) Total Components Total Event Handlers Total Constraint For Each Loop/For Loop Microsoft.Pipeline
Trivial (1) <=30 <=5 <=5 0 <=1
Simple (2) >30 & <=80 >5 & <=10 >5 & <=10 1 <=5
Medium (3) >80 & <=150 >10 & <=15 >10 & <=15 <=3 <=20
Complex (4) >150 & <=200 >15 & <=20 >15 & <=20 <=6 <=40
V Complex (5) >200 >20 >20 >6 >40


Matillion

The below table describes the logic for calculating Matillion complexity. It calculates the complexity on the basis of joins, formulas, components, and more.

Category Simple Medium Complex Very Complex
Number of SQL Queries <5 5-8 9-12 >12
Number of Joins <5 5-10 11-30 >30
Number of Iterators <5 5-10 11-20 >20
Number of Custom Logic 0 1-4 5-20 >20
Number of Formulas / Calculators <10 10-12 13-40 >40
Number of Aggregations <5 5-10 11-30 >30
Number of Filters/ Conditions <10 11-15 16-50 >50
Number of Python Scripts <5 5-10 11-20 >20
Number of Orchestration <5 5-10 11-12 >12
Number of Components <60 61-100 101-250 >250


ODI

The below table shows the complexity matrix for ODI packages.

Complexity Trivial Simple Medium Complex V Complex V. V. Complex
Complexity Number Range <=3 <=6 <=9 <=15 <=20 >20


Analytics

The below tables describe the logic for calculating the complexity of analytics scripts such as SAS and Alteryx.

SAS

The below table describes the logic for calculating SAS complexity. It calculates the SAS complexity based on the number of SQL statements, external resources, SQL lines, conditional procedural statements, and more.

Complexity Total SQL Statement External Resource Total Sql Lines Total Create Macro Lines External Acccess Conditional Procedural Statement Proc Base SAS Proc Other Advanced Proc SAS ETS
Simple (1) <4 <4 200 200 4 3 4 1 1
Medium (2) <8 10 1000 500 8 50 10 2 2
High (3) <50 30 2000 1000 50 100 40 5 5
Very High (4) <100 80 3000 2000 100 500 100 10 10
Very Very High (5) >500 500 10000 10000 500 1000 500 50 50


Complexity Proc SAS STAT Proc SAS OR Proc SAS ODS Proc SQL Count Total Source Code Lines Total Statement Count Data Statement Count Create Macros Total Statement Count SAS Script Generation Statement Count
Simple (1) 1 1 1 4 100 50 3 8 50 0
Medium (2) 2 2 2 8 500 100 10 15 200 0
High (3) 5 5 5 50 1000 200 50 50 500 0
Very High (4) 10 10 10 100 2000 500 200 100 1000 0
Very Very High (5) 50 50 50 500 5000 1000 500 500 5000 1


Alteryx

The below table describes the logic for calculating Alteryx complexity. It calculates the complexity on the basis of nodes, length of expressions, and more.

Script Severity Total Nodes Expression Length AIML NLP Reporting SAS
Trivial <=15 <=500 0 0 0 0
Simple >15 & <=30 >500 & <=1000 0 0 0 0
Medium >30 & <=100 >1000 & <=4000 <=1 0 0 0
Complex >100 & <=200 >4000 & <=10000 >1 & <=5 <=1 <=1 0
Very Complex >200 >10000 >5 >1 >1 >0

BI

The below tables describe the logic for calculating BI complexity.

OBIEE

The complexity of OBIEE reports depends on the number of columns, filters, views, or prompts. There are three complexity levels: simple, medium, and complex. The overall complexity of the report is determined by the highest complexity among the individual components.

The below table shows the complexity matrix for OBIEE reports.

Complexity Number of columns Number Of Filters Number Of Views Number Of Prompts
Simple <= 5 <=1 <=2 <=1
Medium >5 AND <= 10 >1 AND <=2 >2 AND <=4 >1 AND <=2
Complex > 10 >2 >4 >2

For example, if the report has 4 columns (which falls under simple complexity), 1 view (which falls under simple complexity), and 3 filters (which falls under complex complexity), then overall report complexity is considered complex. This is because the system will always consider the highest complexity level among the individual components when determining overall report complexity.



IBM Cognos

The complexity of IBM Cognos is determined by calculating the object’s corresponding score and comparing it to predefined complexity values.

The following table describes the predefined values for complexity calculation for IBM Cognos.

Complexity Overall Score
Simple <=3
Medium >3 and <=8
Complex > 8

The following table displays the objects and its corresponding values.

Objects Corresponding Score
Page 1
Visual 1
Field count in report across the queries <= 5 0
Field count in report across the queries > 5 and <= 10 1
Field count in report across the queries > 10 and <= 20 2
Field count in report across the queries > 20 4
1 package 0
2 packages 2
> 2 packages 4
Custom Queries 4 + sum of complexity score of queries involved



MicroStrategy

The table below describes the logic for calculating the dataset complexity of MicroStrategy. It calculates the complexity based on the count of columns, expressions, tables, and join statements.

Dataset Complexity Total Column Count Expression Count (Filter Included) Table Count Join Count
Simple 100 30 1-10 <=10
Medium 101-200 31-60 11-20 <=20
Complex 201+ 61+ 21+ >21

The table below describes the logic for calculating the visual complexity of MicroStrategy. It calculates the complexity based on the count of columns, measures, and filters.

Visual Complexity Column Count (Dimensions and Masures) Measure Count Filter Count
Simple         <5                  <3           <3
Medium        5-10                 3-6          3-5
Complex         >10                   >6            >5



SAP BO

The table below describes the logic for calculating the dataset complexity of SAP BO. It calculates the complexity based on the count of columns, expressions, tables, and join statements.

Dataset Complexity Total Column Count Expression Count (Filter Included) Table Count Join Count
Simple 100 30 1-10 <=10
Medium 101-200 31-60 11-20 <=20
Complex 201+ 61+ 21+ >21

The table below describes the logic for calculating the visual complexity of SAP BO. It calculates the complexity based on the count of columns, measures, and filters.

Visual Complexity Column Count (Dimensions and Masures) Measure Count Filter Count
Simple         <5                  <3           <3
Medium        5-10                 3-6          3-5
Complex         >10                   >6            >5



Tableau

The table below describes the logic for calculating the dataset complexity of Tableau. It calculates the complexity based on the count of columns, expressions, tables, and join statements.

Dataset Complexity Total Column Count Expression Count (Filter Included) Table Count Join Count
Simple 100 30 1-7 <=6
Medium 101-200 31-60 8-15 <=14
Complex 201+ 61+ 16+ >15

The table below describes the logic for calculating the visual complexity of Tableau. It calculates the complexity based on the count of columns, measures, and filters.

Visual Complexity Column Count (Dimensions and Masures) Measure Count Filter Count
Simple         <5                  <3           <3
Medium        5-10                 3-6          3-5
Complex         >10                   >6            >5



SSRS

The table below describes the logic for calculating the visual complexity of SSRS. It calculates the complexity based on the count of columns, calculated columns, filters and parameters.

Visual Complexity Total Column Count Calculated Columns Count Filter and Parameters count Comments
Simple         <5                  <3           <3 Sub-reports should be considered as separate reports
Medium        5-10                 3-6          3-5
Complex         >10                   >6            >5



Power BI

The table below describes the logic for calculating the dataset complexity of Power BI. It calculates the complexity based on the count of columns, measures, tables, and relationships.

Dataset Complexity Total Column Count Measure Count Table Count Relationship Count
Simple 0-50 0-14 1-5 <=4
Medium 51-150 15-50 06-15 <=14
Complex 150+ 50+ 15+ >14

The table below describes the logic for calculating the visual complexity of Power BI. It calculates the complexity based on the count of columns, measures, and filters.

Visual Complexity Column Count Measure Count Filter Count
Simple         0-3                  <3           <4
Medium        4-8                 3-6          4-9
Complex         >8                   >6            >9



Orchestration

The below tables describe the logic for calculating the complexity of orchestration scripts such as AutoSys and Control-M.

AutoSys

The below table describes the logic for calculating AutoSys job complexity.

Job Complexity Condition Custom Calendar Box scheduled jobs (Job and Parent scheduled at diff time) Comment
Simple No No No If the three objects (Condition, Custom calendar, and Box scheduled jobs) are absent, then the complexity is Simple
Medium If one or two of the objects (Condition, Custom calendar, Box scheduled jobs) are present, then the complexity is Medium
Complex Yes Yes Yes If all three objects (Condition, Custom calendar, Box scheduled jobs) are present, then the complexity is Complex

The box complexity is determined based on the highest complexity factor of the box or the jobs within the box. If the box contains multiple jobs, the job complexity factor is the average of the complexities of all the jobs in the box.

The below table describes the logic for calculating AutoSys box complexity.

Box Complexity Condition Custom Calendar Box scheduled jobs (Job and Parent scheduled at diff time) Comment
Simple No No No If the three objects (Condition, Custom calendar, and Box scheduled jobs) are absent, then the complexity is Simple
Medium If one or two of the objects (Condition, Custom calendar, Box scheduled jobs) are present, then the complexity is Medium
Complex Yes Yes Yes If all three objects (Condition, Custom calendar, Box scheduled jobs) are present, then the complexity is Complex
* If the box contains multiple jobs, then the box complexity is determined based on the highest complexity factor of the box or the jobs within the box

The complexity levels are categorized as Simple (1), Medium (2), and Complex (3). Let’s see a few scenarios to calculate the box complexity if it contains multiple jobs.

If a box has Simple (1) complexity and that box contains 3 jobs – J1, J2, and J3 with 1, 2, and 3 complexity values respectively, then the box and job complexity factors are as follows:

  • The box complexity factor is 1, because the box complexity is Simple.
  • The job complexity factor is 2, because average job complexity is 2 (sum of job complexity values/number of jobs).

Here, the complexity of box is considered as Medium (2) because the job complexity factor (2) is higher than the box complexity factor (1).

Similarly, if box complexity factor is 3 and the job complexity factor is 2, then the box complexity is considered as Complex (3) because the box complexity factor (3) is higher than the job complexity factor (2).


Control-M

The below table describes the logic for calculating Control-M folder and job complexity.

Folder and Job Complexity Frequency Dependencies Comments
Simple Every Day: Scheduled the job to run on every day <=1 The complexity of regular folder is always simple.
Medium Specific Dates: Scheduled the job to run on specific dates such as Dec 30, June 15, etc. 2 or 3
The complexity of Smart folder is determined as Medium or Complex based on the highest number of complexity of jobs present in it.
  • If the frequency of a Smart folder is Advanced schedule (scheduled to run on weekdays or specific days of the month), then the complexity is Complex.
  • If the frequency of a Smart folder is Every Day (scheduled to run on every day), then the complexity is Medium. If that Smart folder contains jobs with complexity higher than the Smart folder, then its complexity is determined based on the highest number of jobs’ complexity.
  • For instance, if the frequency of a Smart folder is Medium and that Smart folder contains 3 jobs – J1, J2, and J3 with Complex complexity for each job, then the complexity of Smart folder is Complex. Similarly, if the frequency of a Smart folder is Medium and that Smart folder contains 3 jobs – J1 and J2 with Medium complexity, and J3 with Complex complexity, then the complexity of Smart folder is Medium because the number of Medium complexity jobs (2) are greater than Complex complexity jobs (1).
 
Complex
Advanced schedule: Scheduled the jobs to run based on an advanced schedule, such as:
  • Run on weekdays: Schedules the job to run on specific weekdays. For example, WEEKDAYS="1,4".
  • Run on specific days of the month: Schedules the job to run on specific days of the month. For example, DAYS="1,6,4,17,27"
 
>=4



Hadoop

The below tables describe the logic for calculating Hadoop complexity.


Hive/ Impala Job to Spark SQL

The below table describes the logic for calculating the complexity of Hive/ Impala jobs with the target as Spark SQL. It calculates the complexity based on the number of queries per job.

Hive/ Impala Job to Spark SQL Number of queries per job
1 <=5
2 <=10
3 <=20
4 <=30
5 >30


PySpark/ Spark Job to PySpark/ Spark

The below table describes the logic for calculating the complexity of PySpark/ Spark jobs with the target as PySpark/ Spark. It calculates the complexity on the basis of data frame operations.

PySpark/ Spark Job to Pyspark/ Spark Data Frame Operations
1 <=5
2 <=10
3 <=20
4 <=30
5 >30


Hive/ Impala Job to Redshift/ Snowflake

The below table describes the logic for calculating the complexity of Hive/ Impala jobs with the target as Redshift/ Snowflake. It calculates the complexity based on the number of queries per job.

Hive/ Impala Job to Redshift/ Snowflake Number of queries per job
1 <=5
2 <=10
3 <=20
4 <=30
5 >30


Spark SQL Job to Spark SQL

The below table describes the logic for calculating the complexity of Spark SQL jobs with the target as Spark SQL. It calculates the complexity based on the number of queries per job.

Spark SQL Job to Spark SQL Number of queries per job
1 <=5
2 <=10
3 <=20
4 <=30
5 >30


MapReduce/ Pig Latin Job to PySpark/ Spark

The below table describes the logic for calculating the complexity of MapReduce/ Pig Latin jobs with the target as PySpark/ Spark. It calculates the complexity on the basis of lines of code.

MapReduce/ Pig Latin Job to Pyspark/ Spark LOC
1 <=100
2 <=500
3 <=1000
4 <=2500
5 >2500


Spark SQL Job to Redshift/ Snowflake

The below table describes the logic for calculating the complexity of Spark SQL jobs with the target as Redshift/ Snowflake. It calculates the complexity based on the number of queries per job.

Spark SQL Job to Redshift/ Snowflake Number of queries per job
1 <=5
2 <=10
3 <=20
4 <=30
5 >30


Sqoop/ Nifi Job to Spark SQL

The below table describes the logic for calculating the complexity of Sqoop/ Nifi jobs with the target as Spark SQL. It calculates the complexity on the basis of transformations.

Sqoop/ Nifi Job to Spark SQL Number of transformations
1 <=5
2 <=10
3 <=20
4 <=30
5 >30

To learn more, contact our support team or write to: info@leaplogic.io

Copyright © 2025 Impetus Technologies Inc. All Rights Reserved

  • Terms of Use
  • Privacy Policy
  • License Agreement
To the top ↑ Up ↑