Flume: sending data via stream
It is possible to capture streaming data in HDFS files. A tool to do this is Flume. The idea is that we have 3 elements: sources that provide a stream,…
It is possible to capture streaming data in HDFS files. A tool to do this is Flume. The idea is that we have 3 elements: sources that provide a stream,…
It is possible to partition the tables in Hive. Remember the data are stored in files. So we expect the files to be partitioned. This is accomplished by a split…
Avro files are binary files that contain data and the description of the files. Thereby it is a very interesting file format. One may send this file to any application…
As we know, we may store table definitions in the metastore. These table definitions then refer to a location where the data are stored. The format of the data might…
In Hive, we see a situation where a table definition is stored in a metastore. This table definition is linked to a directory where the data are stored. It is…
In this little note, I want to show three different ways to create a table on Hive. The first one starts with a file on HDFS that is available and…
In this small note, it is described how an HDFS file can be stored in a Hive context. In it stored in a Hive context, it can be accessed from…
The successor to OWB is the Oracle Data Integrator. This tool has more functionalities than OWB. Next to that, it has an interface that more or less steers the user…
I know that Oracle Warehouse Builder (OWB) is at end of life. On the other hand, I encounter OWB quite often and it is interesting to see how it works.…
Only this weekend I downloaded a Docker package from https://docs.docker.com/docker-for-windows. This package allows you to run very small light weight containers on your server than act as components to perform…