Using an on-demand in-memory SQL database in PDI

h2_logo

Anybody who finds themselves working on a client’s environment will usually face the fact that access to databases is restricted to what’s absolutely required to get the job done. The source files and target systems will be available, but creating helper tables or databases may be completely out of the question, or it may involve overcoming . . . → Read More: Using an on-demand in-memory SQL database in PDI

GeoIP lookup using MaxMind’s Country Database and Kettle

geoip_screenshot

Reliable location information is a valuable asset when looking at internet traffic. Among other uses it can be utilized for fraud prevention or help in estimating foreign market potential. This article explains how you can lookup location information for an IP address using Kettle and MaxMind’s free GeoIP database.

Edit: As Daniel Einspanjer points out, there’s a . . . → Read More: GeoIP lookup using MaxMind’s Country Database and Kettle

Custom Database Lookups using the JavaScript Step and fireToDb

fire2db

Suppose you need to do a non-trivial lookup operation.  For arguments sake, let’s assume you need to check the database for a set of regular expressions, and you need to determine for a row field how many of those expressions match. In fact, a similar scenario has been brought up in the Pentaho forums in this post. One way to solve this is to use a JavaScript step. When initialized, it queries the database for the regular expressions. For each passing row it then loops over the expressions and counts how many match, writing the result to a row field. Sounds easy enough, but how do you query the database using JavaScript? . . . → Read More: Custom Database Lookups using the JavaScript Step and fireToDb