Transaction Labelling Tool

About the project:

We had access to all client transactions, in the form of bank statements, but the transactions were not labelled; so we had to devise a plan to label the transactions. I developed a tool that utilised transaction embeddings (using an implementation of BERT) and clustering algorithms to label massive amounts of transactions on bank statements. Previously it would take quite a while to label bank statements that had hundreds of transactions, but with this tool, we could label similar transactions within seconds.


Technology used:

BERT, Python, SKLearn