The application consists of two repositories: a Spark-powered Java backend and a Javascript frontend written with React and Redux. To install and deploy the application, you will need Java 8, Maven, Node/npm, yarn, and mastarm.

User authentication is done via Auth0. You will need an Auth0 account and application to use the Data Manager.

Two databases are required for the application: MongoDB and a SQL database (PostgreSQL is recommended).

Installation and Basic Configuration

Clone the repo and change to the project directory:

$ git clone
$ git clone

Copy the included configuration env templates for both the server and UI repos:

$ cp datatools-ui/configurations/default/env.yml.tmp datatools-ui/configurations/default/env.yml
$ cp datatools-server/configurations/default/env.yml.tmp datatools-server/configurations/default/env.yml

You'll then need to supply Auth0 account information (see below) and API keys for various services used in the application.

The default server.yml (for datatools-server) and settings.yml (for datatools-ui) should work out of the box, but you may want to specify alternative settings files outside of these repositories. These can be specified as a directory during datatools-ui build with mastarm:

$ mastarm build --config /path/to/configurations/dir

AND as individual file args for datatools-server:

$ java -jar target/dt-v1.0.0.jar /path/to/env.yml /path/to/server.yml

In datatools-server:server.yml, be sure to update the paths for where the databases will be stored:

    gtfs: /path/to/gtfs/storage

Database setup

GTFS data storage

GTFS data storage is handled by whichever standard RDBMS you prefer. However, the application has been significantly tuned and optimized for PostgreSQL 9, so we highly recommend using PostgreSQL.

Once PostgreSQL is installed and the service has been started, create the database:

$ createdb gtfs_storage_db

Pass the URL of the database in the server's env.yml (and optionally add additional connection variables):

GTFS_DATABASE_URL: jdbc:postgresql://localhost/gtfs_storage_db

Application data storage

Application data storage (i.e., where projects, feed sources, and feed versions are stored) is handled by MongoDB. There is no need to manually initialize a database in MongoDB (MongoDB will handle this automatically if you prefer). Connection details for MongoDB are also set in the server's env.yml:

MONGO_URI: # defaults to localhost:27017 (MongoDB default) if empty
MONGO_DB_NAME: application_db

Setting up Auth0

Creating account and application (client)

  1. Create an Auth0 account (free).
  2. Once you've created an Auth0 account, create an application (client) in Auth0 to use with the Data Manager with the following settings:
    • enable only Username-Password-Authentication connections (i.e., turn off Google)
    • set Use Auth0 instead of the IdP to do Single Sign On to true
    • update the following application- and account-level settings to include http://localhost:9000 (or the domain where the application will be hosted):
      • Account level (Account Settings > Advanced)
        • Allowed logout URLs
      • Application level
        • Allowed Callback URLs
        • Allowed Origins (CORS)
    • keep all other default settings

Creating your first user

Create your first Auth0 user through Auth0 web console (Users > Create User). In addition to an email and password, you'll need to supply the user with the following default application admin app_metadata (user_metadata should remain empty):

  "datatools": [{
    "permissions": [
        "type": "administer-application"
    "subscriptions": [],
    "projects": [],
    "client_id": "your-auth0-client-id"

Update env.yml for server and UI repos

Update the following properties in both env.yml files to reflect the public Auth0 application settings:

AUTH0_CLIENT_ID: your-auth0-client-id

Update the following properties in datatools-server env.yml to reflect the secure Auth0 application settings:

AUTH0_SECRET: your-auth0-client-secret
AUTH0_TOKEN: your-auth0-api-token

Note: to generate the api_token, go to Documentation > Management API. After adding scopes, your token will appear in the input field.

Auth0 token generator

To allow for the creation, deletion and editing of users you must generate a token for the following scopes:

  • users:
    • read, update, create and delete
  • users_app_metadata:
    • read, update, create and delete`

Building and Running the Application

Install the Javascript dependencies using yarn:

$ yarn

Build and deploy the frontend to s3 using npm script (which calls mastarm):

$ npm run deploy -- s3://$S3_BUCKET_NAME/dist

Package the application using Maven:

$ mvn package

Deploy the application with Java:

$ java -jar target/dt-v1.0.0.jar /path/to/env.yml /path/to/server.yml

The application back-end should now be running at http://localhost:9000 (or whatever port you specified in server.yml). The front-end assets are pointed to by the back end at whatever s3 bucket name is specified in server.yml at application.assets_bucket.

Configuring Modules

The application contains a number of optional modules that each require their own configuration settings and resources. At minimum, each module must be set to enabled: true and may require additional configuration.

Note: for datatools-server v3.0.0, the editor and r5_network should be disabled because they have not been refactored to handle updates to the loading of GTFS data into an RDBMS. Please use v2.0.0 or wait for releases following v3.0.0.


Enables the GTFS Editor module.

List of configuration settings

  • R5_URL (optional parameter for r5 routing in editor pattern drawing)

R5 network validation

While the application handles basic validation even without the r5_network module enabled, this module allows for enhanced accessibility- and map-based validation.

List of configuration settings

  • OSM_VEX - datatools-server:env.yml the validator requires the URL of a running instance of the OSM vex server.

Sign Configurations

Enables the sign configuration module.

List of configuration settings

  • use_extension - extension key to use for sign configuration
  • url
  • Requires gtfsapi module


Enables the real-time alerts module.

List of configuration settings

  • use_extension - extension key to use for sign configuration
  • url
  • Requires gtfsapi module

GTFS+ (gtfsplus)

Enables the GTFS+ module.


Enables the OTP automated deployments module.

List of configuration settings

  • OSM_VEX - datatools-server:env.yml the validator requires the URL of a running instance of the OSM vex server.


Supports other modules with API endpoints for getting GTFS entities (routes, stops, etc.)

List of configuration settings

  • load_on_fetch - whether to load GTFS feeds when new feed is detected
  • load_on_startup - whether to load GTFS feeds on application startup
  • use_extension - which extension to connect to
  • update_frequency - update frequency for GTFS API (in seconds). Comment out to disable updates.

Configuring Extensions

The application supports integration with several third-party resources for retrieving feeds.

Integration with

Ensure that the extensions:transitland:enabled flag is set to true in config.yml:

    enabled: true

Integration with TransitFeeds

Ensure that the extensions:transitfeeds:enabled flag is set to true in config.yml, and provide your API key:

    enabled: true
    key: your-api-key