Customer Portal

Join by substring match

Comments 1

  • Avatar
    dpavlis
    0
    Comment actions Permalink
    You are perfectly right. CloverETL, being LGPL, allows you to take existing component (its code) and create a new one.

    Said that, I would be careful how you implement the join if you want it to perform well. You will need some kind of pre-generated key to bring together groups of records which may potentially match/join and then just refine the pairing based on substring match.
    You better start with something like ApproximativeJoin which does exactly this by using edit distance for doing the fine-pairing.

    What exactly is your use-case for which you need this functionality ? It is a bit uncommon in ETL arena.

Please sign in to leave a comment.