Configuring Matillion – Leaplogic

Configuring Matillion

This topic provides steps to configure Matillion conversion stage.

Select the ETL Type as Matillion ETL.
In Input Artifacts, upload the source data via:

Browse Files: To select the source files from the local system.
Select From Data Source: To select the source files from the data source. To do so, follow the steps below:

Click Select From Data Source.
Choose repository.
Select data source.
Select the entities.
Click to save the source data source.

Choose the Target Type to which you need to transform the source scripts. The target types are:

Redshift and Airflow (The transformation jobs are transformed into Amazon Redshift SQL and the orchestration jobs are transformed into the Airflow equivalent Python artifacts.)
Databricks Notebook

Click Data Configuration.

The Input column in the table below provides the input requirements based on the Target Type selection.

Target Type	Input
Redshift and Airflow	In Output Type, the default output type for the transformation is set to Python.
Databricks Notebook	In Data Interaction Technique, select your data interaction method. Following are the options: Databricks-Native: Select Databricks-Native to fetch, process, and store data in Databricks Lakehouse. Databricks: Unity Catalog: Select Databricks: Unity Catalog to access data via Databricks Unity Catalog. In Databricks, the Unity Catalog serves as a metadata repository from which data is fetched, processed, and stored within the catalog. Databricks: External: Select this data interaction technique to fetch input data from an external source such as Oracle, Netezza, Teradata, etc., and process that data in Databricks, and then move the processed or output data to an external target. For instance, if the source input file contains data from any external source like Oracle, you need to select Oracle as the Source Database Connection to establish the database connection and load the input data. Then data is processed in Databricks, and finally the processed or output data gets stored at an external target (Oracle). However, if you select Oracle as the Source Database Connection but the source input file contains data from an external source other than Oracle, such as Teradata, then by default, it will run on Databricks. If the selected data interaction technique is Databricks: External, you need to specify the source database of your data. In the Source Database Connection, select the database you want to connect to. This establishes the database connection to load data from external sources like Oracle, Teradata, etc. If the database is selected, the converted code will have connection parameters (in the output artifacts) related to the database. If the database is not selected, you need to add the database connection details manually to the parameter file to execute the dataset; otherwise, by default, it executes on Databricks. Convert Orchestration Jobs to Databricks Workflows converts Matillion orchestration jobs to Databricks workflow equivalent and generates the corresponding JSON artifacts. By default, this feature is enabled. In Validation Type, select None or Cluster as the mode of validation. None: Select this option if you do not want to perform any validation. Cluster: Select this option to perform syntax validation. In Data Source, upload the corresponding data source. To successfully perform the syntax validation of the transformed queries, it is advisable to ensure that the required input tables are created or already present on the target side and secondly all the user-defined functions (UDFs) are registered on the target data source.

Click Save to update the changes.
An alert pop-up message appears. This message prompts you to refer your respective assessment to determine the anticipated quota deduction required when converting your scripts to target. Then click Ok.

Click to provide a preferred pipeline name.
Click to execute the pipeline. Clicking (Execute) navigates you to the listing page which shows your pipeline status as Running. It changes its state to Success when it is completed successfully.
Click pipeline card to see reports.

To view the Matillion conversion report, visit Matillion Conversion Report