Adding Prefix to Metadata Created from Excel File

CloverDX Customer Portal
Forum
Adding Prefix to Metadata Created from Excel File

yichuansancun

January 16, 2015 00:00

Answered

I create metadata by reading the headline from an excel file, however, I want to add prefix to the headline. Is there a way to accomplish this? Right now, I will add prefix in the excel file then create metadata from it.

Thanks,
Perri

Comments 5

imriskal

January 19, 2015 12:03
0

Comment actions Permalink
Hi Perri,

You can of course edit the result metadata as the last step of the metadata extraction process. In case this is not the right solution for you, can you please give an example of the desired input and output?

Thanks.
yichuansancun

January 19, 2015 18:55
0

Comment actions Permalink
I attached a graph where I have this need. Take a look at the metadata: Input_LegalName and Input_PrimaryName. These two meta data are derived from the same excel file. However, I need to separate LegalName and PrimaryName then merge them back to do further transformation. Adding a prefix to one of the metadata can be tedious if were to do it manually. Is this something can be achieved within Clover?
- UL_Name_n_Contact_P2.grf
imriskal

January 20, 2015 14:07
0

Comment actions Permalink
I am affraid there is no direct way of achieving this in CloverETL Designer. But there is one indirect way.

You can externalize the metadata. The result is an XML file (*.fmt). This file can be read by UniversalDataReader as one string value and used as an argument of replace function in CTL.
```
$out.0.field1=replace($in.0.field1,"app","LEG");
```
The result of this function can be saved into another fmt file and used by other graphs.

By the way, looking into your graph, I recommend replacing ExtHashJoin components with ExtMergeJoins which are more suitable for joining two inputs of similar size.

I hope this helps
yichuansancun

January 20, 2015 20:01
0

Comment actions Permalink
Thanks! What is the diff between merge join and ExtHashJoin? Merge join requires same data structure?
imriskal

January 21, 2015 08:55
0

Comment actions Permalink
ExtHashJoin

Number of records on a slave port should be small enough because all slave records are stored in memory during the whole joining process. This speeds up the process significantly but it also has increased memory requirements. A common example is let's say 10 million records on the master port and 10 thousand records on the slave port. 10 thousand records can be stored in memory which causes the 10 million master records to be processed faster than in case of ExtMergeJoin. An advantage is also the fact that the input records do not have to be sorted first. You can find more information on our ExtHashJoin documentation page.

ExtMergeJoin

Both input sets can be larger than your memory because none of them is held in memory as a whole. This fact, on the other hand, enforces that both inputs have to be sorted first. But it has an advantage as well because sorted inputs can be joined much faster than unsorted inputs. More information is again available on ExtMergeJoin documentation page.

Please sign in to leave a comment.

Quick links

Access my products

SUPPORT & SERVICES

Community

RESOURCES