Configuring SSIS – Leaplogic

Configuring SSIS

This topic provides steps to configure the SSIS conversion stage.

Choose the ETL Type as SSIS.
In Input Artifacts, upload the source data via:
- Browse Files: To select the source files from the local system.
- Select From Data Source: To select the source files from the data source. To do so, follow the steps below:
  - Click Select From Data Source.
  - Choose repository.
  - Select data source.
  - Select entities.
  - Click to save the source data source.

In Target Type, select the target to which you need to transform the source scripts. The Target types are:

Databricks Notebook
Databricks Lakehouse
AWS Glue Job
Spark
Matillion ETL

Click Data Configuration.

The Input column in the table below provides the input requirements based on the Target Type selection.

Target Type	Input
Databricks Notebook	In Output Type, the default output type for the transformation is set to Python. In Source Database Connection, select the required source database to load the data such as Oracle, SQL Server, Teradata, Netezza, etc. If the database is selected, the converted code will have connection parameters related to the database. In case the database is not selected you need to manually add the database connection details to the parameter file to execute the dataset. In DBFS File Base Path, specify the DBFS (Databricks File System) location where you need to fetch the input files and store the transformed data. In other words, it is a base path for input files and output data. In Default Database, select default database for the queries for which the database type is not defined in the uploaded artifacts. Selecting Not Sure will convert only those queries whose database type is available. In Attainable Automation, select the way you want the system to calculate achievable automation for the transformation of the source scripts. Assessment-Based: Calculates the level of automation based on assessment logic. The conversion-config.json file contains a pre-defined automation percentage for each component and you can also modify it as required. Transformation-Based: Calculates the level of automation based on actual conversion. In this method, automation percentage is calculated for each component based on the used, supported, and unsupported properties.
Databricks Lakehouse
AWS Glue Job	In Output Type, the default output type for the transformation is set to Python. In Source Database Connection, select the required source database to load the data such as Oracle, SQL Server, Teradata, Netezza, etc. If the database is selected, the converted code will have connection parameters related to the database. In case the database is not selected you need to manually add the database connection details to the parameter file to execute the dataset. In Default Database, select default database for the queries for which the database type is not defined in the uploaded artifacts. Selecting Not Sure will convert only those queries whose database type is available. In Attainable Automation, select the way you want the system to calculate achievable automation for the transformation of the source scripts. Assessment-Based: Calculates the level of automation based on assessment logic. The conversion-config.json file contains a pre-defined automation percentage for each component and you can also modify it as required. Transformation-Based: Calculates the level of automation based on actual conversion. In this method, automation percentage is calculated for each component based on the used, supported, and unsupported properties. In Artifacts Location, specify the location from where you need to call external files such as parameter files, orchestration scripts. In S3 Bucket Base Path, provide the S3 storage repository path where you need to store the source and target files.
Spark	In Output Type, the default output type for the transformation is set to Python. In Source Database Connection, select the required source database to load the data such as Oracle, SQL Server, Teradata, Netezza, etc. If the database is selected, the converted code will have connection parameters related to the database. In case the database is not selected you need to manually add the database connection details to the parameter file to execute the dataset. Choose the Validation Type - None or Cluster. If the Validation type is Cluster, upload the Data source. In Default Database, select default database for the queries for which the database type is not defined in the uploaded artifacts. Selecting Not Sure will convert only those queries whose database type is available. In Attainable Automation, select the way you want the system to calculate achievable automation for the transformation of the source scripts. Assessment-Based: Calculates the level of automation based on assessment logic. The conversion-config.json file contains a pre-defined automation percentage for each component and you can also modify it as required. Transformation-Based: Calculates the level of automation based on actual conversion. In this method, automation percentage is calculated for each component based on the used, supported, and unsupported properties. In File Base Path, specify the base path for input files and output data. In Artifacts Location, specify the location from where you need to call external files such as parameter files, orchestration scripts.
Matillion ETL	In Output Type, the default output type for the transformation is set to JSON. In Data Interaction Technique, select your data interaction method. Following are the options: Snowflake - Native: Select Snowflake - Native to fetch, process, and store data in Snowflake. Snowflake: External: Select this data interaction technique to fetch input data from an external source such as Oracle, Netezza, Teradata, etc., and process that data in Snowflake, and then move the processed data or output to an external target. For instance, if the source input file contains data from any external source like Oracle, you need to select Oracle as the Source Database Connection to establish the database connection and load the input data. Then data is processed in Snowflake, and finally the processed or output data gets stored at an external target (Oracle). However, if you select Oracle as the Source Database Connection but the source input file contains data from an external source other than Oracle, such as Teradata, then by default, it will run on Snowflake. If the selected data interaction technique is Snowflake: External, you need to specify the source database of your data. In the Source Database Connection, select the database you want to connect to. This establishes the database connection to load data from external sources like Oracle, Teradata, etc. If the database is selected, the converted code will have connection parameters (in the output artifacts) related to the database. If the database is not selected, you need to add the database connection details manually to the parameter file to execute the dataset; otherwise, by default, it executes on Snowflake. In Default Database, select default database for the queries for which the database type is not defined in the uploaded artifacts. Selecting Not Sure will convert only those queries whose database type is available.

Click Save to update the changes.

An alert pop-up message appears. This message prompts you to refer your respective assessment to determine the anticipated quota deduction required when converting your scripts to target. Then click Ok.

Click to provide a preferred pipeline name.
Click to execute the pipeline. Clicking (Execute) navigates you to the listing page which shows your pipeline status as Running state. It changes its state to Success when it is completed successfully.
Click pipeline card to see reports.

To view the SSIS conversion report, visit SSIS Conversion Report

Next:

Configuring IICS