Skip to content

Tool for automating Git-based Quality Standards, inspired by the Transifex Engineering Manifesto.

License

Notifications You must be signed in to change notification settings

transifex/totem

Repository files navigation

Totem

Totem is a Health Check library that checks whether or not certain quality standards are followed on Pull Requests or local Git repositories.

It is inspired by the Transifex Engineering Manifesto (TEM), a document that defines the Quality Standards used in Transifex. Totem was created as an automated way to ensure high quality in Git-related guidelines described in the TEM.

Currently it supports Github Pull Requests only, but can also be used locally.

Features

  • Multiple quality checks on Pull Requests
  • Multiple quality checks on local Git repositories
  • Comes with pre-commit support, and can also be added as a pre-push Git hook
  • Configurable: you can only enable the checks you want, and define the configuration parameters for each check
  • Detailed report in the console, makes it easy to spot issues
  • Compact summary shown as a comment created on the Pull Request, with configurable content (disabled by default)
  • Ability to disable all checks on a specific commit or a specific line of a commit message body

Checks

Totem supports the following checks:

  • branch_name: the name of the branch must follow a certain regex pattern
  • pr_title: the title of the Pull Request must follow a certain regex pattern
  • pr_body_checklist: the body of the Pull Request must not contain any unfinished checklist item
  • pr_body_excludes: the body of the Pull Request must not contain certain strings
  • pr_body_includes: the body of the Pull Request must contain certain strings
  • commit_message: the message of each commit must follow these guidelines:
    • subject:
      • has a minimum and maximum allowed length
      • must follow a certain regex pattern, e.g. start with an uppercase character and not end with a '.'
    • body
      • if there is a body, each line has a maximum allowed length
      • if the number of total changed lines in a commit is above a certain threshold, a body must be present and must have a minimum number of lines

With a custom configuration, you can define which checks will be executed. All of the checks have at least a certain level of configuration.

Failure level

If a check is executed but fails to pass, it can either provide a failed status check (exit status = 1) or simply print out a warning. The former can be used in order to prevent a Pull Request from being merged, a local commit to be completed, or local changes to be pushed to the remote, until all Totem checks are fixed. The latter is mainly used as a sign that something might not be right, and can be useful when comitting in or pushing from a local repo, or when reviewing a Pull Request. The warning level is necessary because in some repos a rule may not be always applicable, so it should be judged on a case-by-case basis.

Manual ignore

The configuration file defines which checks will run.

However, if you want to disable the checks that run on a specific commit, you can use one of two special flags ([!totem], #!totem) as follows:

Ignoring checks on the whole commit

The following commit message will pass, although the subject and the first line of the body are both too long. The reason it doesn't fail is the global [!totem] flag, which ignores all errors on a particual commit message.

Note that this flag needs to be on the message body; it does nothing if added in the message subject.

this is the subject and its too long, also doesn't start with a capital letter

This line is also way too long but you know what, it doesn't matter because the message
includes this special (global) ignore flag: [!totem]

Ignoring checks on a specific body line

The following commit message will pass, although the first line of the message body is too long. The reason it does not fail is the #!totem flag.

This is the subject

This line includes a_very_long_string_that_cannot_be_broken_down_into_smaller_chunks #!totem
but this line is just fine and should be checked.

URLs

Please note that URLs are handled as special cases, so you don't need to add an ignore flag if a URL makes a line too long; checks are automatically ignored if a line contains a URL.

Supported formats are: http://..., https://..., ftp://....

Installation

Totem can be installed by running pip install totem. It requires Python 3.6.0+.

Running on a PR

Command line

Totem provides a console command and requires only the URL of the pull request to check. By default, it will attempt to read the .totem.yml file on the repo as configuration. If it is not found, it defaults to ./contrib/config/sample.yml on the Totem repo.

totem -p https://www.github.com/:owner/:repo/pulls/:number

Example:

totem -p https://github.com/transifex/totem/pull/17

NOTE: the default configuration will not create a comment on the Pull Request being checked. If you use a custom config, you can enable the comment feature.

A custom config can be provided and supports a lot of options.

totem -p https://www.github.com/:owner/:repo/pulls/:number -c ./totem_config.yml

The project includes a sample configuration file, which can be found at ./contrib/config/sample.yml.

CI

When running from a CI service, you need to retrieve the pull request URL from the environment variables the service provides. Also, you can set the URL of the CI build page, in which case a link appears on the PR comment that the Totem creates.

For example, you make a call like this:

totem --pr-url "<pull_request_url>" --config-file ".totem.yml" --details-url "<ci_service_build_page>"

where <pull_request_url> is a variable given from the CI service. For example, for CircleCI it's $CIRCLE_PULL_REQUEST.

Github authentication

In order to run Totem on pull requests of private projects, as well as in order to be able to enable reporting in PR comments, the tool needs to be authenticated when contacting Github. In order to do that, you need to add an environment variable with the Github access token to your CI service: GITHUB_ACCESS_TOKEN=<my_super_secret_token>

You also need to authorize add a deploy key on the CI service. For example, on Circle CI go to the project Settings > Permissions > Checkout SSH keys and click on [Add Deploy key].

An example of a complete setup on a CI, together with GitHub authentication, looks like this:

jobs:
  totem:
    docker:
    - image: python:alpine
      environment:
    steps:
    - checkout
    - add_ssh_keys:
        fingerprint:
          # a public deploy key of the current repository on GitHub, something like:
          "8a:32:b1:d4:24:12:c4:33:8f:ac:0f:37:c8:84:c4:cc"
    - run:
        name: Install git/openssh-client and add github to the list of known hosts
        command: apk add git openssh-client && mkdir ~/.ssh && ssh-keyscan github.com > ~/.ssh/known_hosts
    - run:
        name: Install totem
        command: pip install totem
    - run:
        name: Run Totem
        command: totem --pr-url "<pull_request_url>" --config-file ".totem.yml" --details-url "<ci_service_build_page>"

CircleCI

Keep in mind that because of a bug in CircleCI, sometimes the $CIRCLE_PULL_REQUEST variable is empty. If the pull request argument in the totem CLI command is empty, Totem runs in local mode because there is no pull request to check. This can create false positives (that everything is OK when in fact it's not). Therefore, in order to run Totem without the false positives, the following workaround can be used:

if [[ "$CIRCLE_BRANCH" == "devel" || "$CIRCLE_BRANCH" == "master" ]]; then
  echo "Totem is disabled on branch '$CIRCLE_BRANCH'. Won't execute."
else
  if [[ "$CIRCLE_PULL_REQUEST" == "" ]]; then
    echo "\$CIRCLE_PULL_REQUEST is empty. It's probably due to CircleCI's bug"
    echo "(https://discuss.circleci.com/t/circle-pull-request-not-being-set/14409)."
    echo "Please rerun the workflow until the PR variable is populated by CircleCI."
    exit 1
  else
    totem --pr-url "$CIRCLE_PULL_REQUEST" --config-file ".totem.yml" --details-url "$CIRCLE_BUILD_URL"
  fi
fi

The script above does not run Totem if the current branch is devel or master, which means that it's running on a merge commit. Of course, these are just sample branches and may differ from the base branches you have in your workflow.

Running on a local repository

You can call the command without any arguments. In this case it reads the .totem.yml file on the repo as configuration. If this file does not exist, the tool cannot run.

totem

You can also define a custom config file to use.

totem -c <file>

The local mode of Totem runs only a subset of the available (and enabled) checks:

  • branch_name: the name of the branch must follow a certain regex pattern
  • commit_message: the message of each commit must follow certain guidelines

The reason is that the rest of the checks require a Pull Request, which is not available locally.

Pre-commit hook

In order to use it as a pre-commit hook, add the following in your .pre-commit-config.yaml file.

- repo: https://github.com/transifex/totem/
  rev: master
  hooks:
  - id: totem

Make sure you follow the instructions given in pre-commit on how to install and use the hooks. As soon as you do that, Totem will run every time you attempt to create a new commit and will abort the command in case any checks fail. Note that it will not abort in case of warnings.

Pre-push hook

In order to use it as a pre-push hook, add the following in the .git/hooks/pre-push file:

#!/bin/sh
totem

Note: Make sure the file is executable (chmod +x .git/hooks/pre-push).

This way, totem will run every time you call git push, and will abort the command in case any checks fail. Note that it will not abort in case of warnings.

Configuration

The following is a sample of configuration that contains all available options.

settings:
  pr_comment_report:
    enabled: True
    show_empty_sections: True
    show_message: True
    show_details: True
  console_report:
    show_empty_sections: True
    show_message: True
    show_details: True
    show_successful: True
  local_console_report:
    show_empty_sections: False
    show_message: True
    show_details: True
    show_successful: False
checks:
  branch_name:
    pattern: ^[\w\d\-]+$
    pattern_descr: Branch name must only include lowercase characters, digits and dashes
    failure_level: warning
  pr_title:
    pattern: ^[A-Z].+$
    pattern_descr: PR title must start with a capital letter
    failure_level: warning
  pr_body_checklist:
    failure_level: error
  pr_body_excludes:
    patterns:
    - excl1
    - excl2
    failure_level: error
  pr_body_includes:
    patterns:
    - incl1
    - incl2
    failure_level: error
  commit_message:
    subject:
      min_length: 10
      max_length: 50
      pattern: ^[A-Z].+(?<!\.)$
      pattern_descr: Commit message subject must start with a capital letter and not
        finish with a dot
    body:
      max_line_length: 72
      smart_require:
        min_changes: 100
        min_body_lines: 1
    failure_level: error

Note that the checks part supports both dictionaries and lists. The example above uses a dictionary, while the one below uses a list.

The list structure allows you to include more multiple checks of the same type, which can be convenient in certain cases.

Here is a sample of just the checks part that contains 2 different pr_body_excludes checks, one of which is reported as an error and the other as a warning.

checks:
  - type: branch_name
    pattern: ^[\w\d\-]+$
    pattern_descr: Branch name must include lowercase characters, digits and dashes, e.g. `update-something`
    failure_level: error
  - type: pr_body_excludes
    patterns:
    - error_pattern1
    - error_pattern2
    failure_level: error
  - type: pr_body_excludes
    patterns:
    - warning_pattern1
    - warning_pattern2
    failure_level: warning

Sample report

This is how a report created as a comment on the pull request may look like:

Checking if this PR follows the expected quality standards. Powered by totem.

failures warnings successful
2 1 3

‼️ Failures (2) - These need to be fixed!

  • pr_body_includes Required strings in PR body are missing: "Problem and/or solution"
  • commit_message Found 2 commit message(s) that do not follow the expected format (errors: "smart_body_size", "subject_pattern", "subject_length") errors:

✴️ Warnings (1) - Fixing these may not be applicable, please review them case by case

  • pr_title PR title "Fix things" does not match pattern: "^XX-[0-9]+ .+$". Explanation: PR title must start with the Jira ID

Successful (3) - Good job on these!

  • branch_name
  • pr_body_checklist
  • pr_body_excludes

About

Tool for automating Git-based Quality Standards, inspired by the Transifex Engineering Manifesto.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published