CPAN Testers Has a New API
As part of the MetaCPAN hackathon, meta::hack, I was invited to work on the CPAN Testers integration. CPAN Testers is a community of CPAN users who send in test reports for CPAN modules as they are uploaded. MetaCPAN adds a summary of those test reports to every CPAN distribution to help you determine which module you'd most like to use. For quite a few months, this integration was broken, and the nature of the current integration (a SQLite database) means it is not as generally useful as it could be.
So, I decided that the best way to improve the CPAN Testers / MetaCPAN integration was to build a new CPAN Testers API. This API uses the CPAN Testers schema to expose CPAN Testers data using a JSON API. This API is built using the Mojolicious web framework, and an OpenAPI specification (using Mojolicious::Plugin::OpenAPI.
Documenting the API using the OpenAPI language automatically generates the application routes, and makes input and output validation very simple (all handled by Mojolicious::Plugin::OpenAPI). Having the API specification be a simple JSON document also makes it possible to automatically generate documentation and display it using Swagger-UI. The Swagger-UI tool even allows users to try out the API right in the browser, displays the input parameters and their documentation, and gives you real output based on the request you input.
Right now, the CPAN Testers API has access to only some of the data CPAN Testers has:
- The /release data collects test report summaries for all of CPAN, or you can limit releases by dist or releases by author (good for making CPAN dashboards).
- The /upload data shows all uploads to CPAN, which you can limit to uploads by dist or uploads by author.
These APIs can be easily used by any HTTP client, and since they return
JSON, can be easily manipulated with simple tools like
jq
or
ETL::Yertl:
# Get all test data for every release of the CPAN-Testers-API distribution
curl 'http://api.cpantesters.org/v1/release/dist/CPAN-Testers-API'
# Get all Statocles release test data updated since the beginning of 2016
curl 'http://api.cpantesters.org/v1/release/dist/Statocles?since=2016-01-01T00:00:00'
# Select all of my releases where there was a failure report
curl http://api.cpantesters.org/v1/release/author/PREACTION | jq '.[] | select( .fail > 1 )'
# Build my own database for personal use!
sqlite3 data.db 'CREATE TABLE release ( dist VARCHAR, version VARCHAR, pass INTEGER, fail INTEGER, na INTEGER, unknown INTEGER )'
curl http://api.cpantesters.org/v1/release/author/PREACTION | yfrom json | yq '.[]' | ysql --dsn dbi:SQLite:data.db --insert release
# ... and then find failed dists
ysql --dsn dbi:SQLite:data.db --select release --where 'fail > 100'
In addition to these standard JSON API calls, the new CPAN Testers API also provides a WebSocket API for getting push notifications of important CPAN Testers events. With just a WebSocket and a JSON parser, you can subscribe to a WebSocket feed of new uploads to CPAN. In the future, all internal CPAN Testers communication will be done through the API using the Mercury message broker on the backend, allowing you to use simple WebSockets to build your own data alerts and updates.
Any WebSocket client can connect to get updates, like so:
# feed.pl from https://github.com/cpan-testers/cpantesters-api/blob/master/eg/feed.pl
$ perl feed.pl http://api.cpantesters.org/v1/upload
Got upload: Mercury (0.011) by PREACTION
Once the message is send on the websocket, the archive is available on CPAN Testers CPAN mirror (http://cpan.cpantesters.org), so you could immediately start testing it and sending in reports (hint hint)!
This is just the start of what this API could be made to do. If you'd like to use some CPAN Testers data, but it is not yet available from the API, check our contributing guide for how to add new features, and if you have questions here's how to get in contact.
If you'd like to help, but don't know how, here's some possible features that could be implemented to make the API easier to use:
- You can make the post-filters I use
jq
and ETL::Yertl for into API query parameters so that I can directly request only the releases with more than 10 failure reports - You could make a special filter to only include the latest version of the dist
- You could add more endpoints to filter the data:
/release/dist/{dist}/{version}
and/upload/dist/{dist}/{version}
to only get a single version/release/dist/{dist}/latest
and/upload/dist/{dist}/latest
to get the latest version
As always thanks to all the sponsors of CPAN Testers, and specifically to the sponsors of meta::hack for providing the means to get this API started.