NAME
ETL::Yertl - ETL with a Shell
SYNOPSIS
### On a shell...
# Convert file to Yertl's format
$ yfrom csv file.csv >work.yml
$ yfrom json file.json >work.yml
# Mask document
$ ymask 'field/inner' work.yml >masked.yml
# Convert file to output format
$ yto csv work.yml
$ yto json work.yml
# Parse HTTP logs into documents
$ ygrok '%{LOG.HTTP_COMMON}' httpd.log
# Read data from a database
$ ysql db_name 'SELECT * FROM employee'
# Write data to a database
$ ysql db_name 'INSERT INTO employee ( id, name ) VALUES ( $.id, $.name )'
### In Perl...
use ETL::Yertl;
# XXX: To do: Perl API
DESCRIPTION
ETL::Yertl is an ETL (Extract, Transform, Load) for shells. It is designed to accept data from multiple formats (CSV, JSON), manipulate them using simple tools, and then convert them to an output format.
Yertl will have tools for:
- Extracting data from databases (MySQL, Postgres, MongoDB)
- Loading data into databases
- Extracting data from web services
- Writing data to web services
- Distributing data through messaging APIs (ZeroMQ)
SEE ALSO
- http://preaction.me/yertl
-
The Yertl home page.
Yertl Tools
- yfrom
-
Convert incoming data (CSV, JSON) to Yertl documents.
- yto
-
Convert Yertl documents into another format (CSV, JSON).
- ygrok
-
Parse lines of text into Yertl documents.
- ysql
-
Read/write documents from SQL databases.
- ymask
-
Filter documents with a mask, letting only matching fields through.
- yq
-
A powerful mini-language for munging and filtering.
Other Tools
Here are some other tools that can be used with Yertl
- recs (App::RecordStream)
-
A set of tools for manipulating JSON (constrast with Yertl's YAML). For interoperability, set the
YERTL_FORMAT
environment variable to"json"
. - jq
-
A filter for JSON documents. The inspiration for yq. For interoperability, set the
YERTL_FORMAT
environment variable to"json"
. - jt
-
JSON Transformer. Allows multiple ways of manipulating JSON, including JSONPath. For interoperability, set the
YERTL_FORMAT
environment variable to"json"
.