Ansible for Developers

This repository will allow you to run any of the major stages of the Alliance data pipeline on an AWS instance using Docker images from AWS ECR, code from GitHub branches, or a combination of both.

Upon launching an AWS instance, a publicly-accessible URL is also created for demonstration and testing purposes (e.g. running a test version of the Alliance website for curator review).

Additional requirements before using this repository.

Please use version <= 20 for Docker. We've had issues with Docker version 21. Hopefully this will be resolved in the near future.
Contact someone from the DevOps team in order to:
- Obtain access to EC2 servers running on us-east-1 (requires your IP address).
- Obtain access to AWS ECR for our Docker images.
- Obtain access to the AnsibleDevelopers AWS secret for the Ansible vault.

Clone the repository.

Clone agr_ansible_developers to your local machine.

Copy the template configuration files.

Create your own directory in the environments folder.
- This directory can be committed to GitHub for future use.
Copy the file main.yml from the environments/template directory to your newly created directory.

Edit the configuration files before running.

main.yml

In your newly created directory, edit the main.yml file.
The NET value is used for the DNS name of your server. Please change it from main to another value, e.g. olin.
- This value will be appended with -dev.
- The address will be structured as e.g. olin-dev.alliancegenome.org.
- Once launched, this name will appear in the #aws channel of Slack along with your new server's IP address.
The ALLIANCE_RELEASE value is used for the data snapshot from the FMS. Please change it to the appropriate release depending on your desired source data.
For the remaining values, most of the configuration options allow the pipeline to be run using either code from GitHub or images from AWS ECR.
- Please choose the appropriate configuration values based on the code you are testing.
- If assistance is required, please post a message in the #devops channel on Slack and we'll be happy to help.

Makefile

Before running Ansible, edit the Makefile variable ENV at the top of the file to match the name of the folder you've created in environments.

Launch the AWS EC2 instance.

Run the command make launch from the root directory to launch your AWS instance.
Check the Slack #aws channel for your server IP address and URL.
Logs are viewable online:
- http://{YOUR_NET_VALUE}-dev.alliancegenome.org:5601/app/logtrail
- Click the All Systems button at the bottom of the LogTrail screen to view output from different Docker containers on your server.
- After launching new services, the browser window may need to be refreshed before the output appears in the All Systems dropdown.

Launch additional software on your AWS EC2 instance.

The following commands are available (use make before each command):

Make Command	Description
`launch`	Launch the AWS EC2 instance.
`terminate`	Terminate the AWS EC2 instance.
`startdb`	Start the Neo4J database. Required before most other steps.
`stopdb`	Stop the Neo4J database. This also removes the container.
`restartdb`	Restart the Neo4J database This removes and creates a new container.
`startcurationdb`	Start the curation database. This database must be started before running the indexer.
`stopcurationdb`	Stop the curation database.
`restartcurationdb`	Restart the curation database.
`run_loader`	Run the loader.
`run_loader_tests`	Runs the loader's integrated tests. This requires a populated Neo4J database.
`run_file_generator`	Runs the file generator. Will attempt to upload files to FMS.
`run_file_generator_no_upload`	Runs the file generator without uploading files to the FMS.
`run_indexer`	Run the indexer. Requires both Neo4J (`startdb`) and the curation database (`startcurationdb`).
`run_mod_variant_indexer`	Run the MOD variant indexer.
`run_human_variant_indexer`	Run the human variant indexer.
`start_infinispan`	Start infinispan.
`run_cacher`	Run the cacher. Requires starting infinispan first.
`start_api`	Start the API.
`start_ui`	Start the UI.
`start_nginx`	Start Nginx. Should always be run last after all other services have started.
`restartelk`	Restart the ELK stack (ElasticSearch / Cerebro / Logstash / Kibana).
`run_jbrowse`	TODO ~~Run a JBrowse instance~~.

Important Note regarding the Indexer and generating indexes.

Once the indexer is run, it will generate a timestamped index using your ENV name, e.g. site_index_chris_1615817944264 .
You'll need to launch Cerebro via the web interface on your server and assign an alias for this index in order to launch a functioning website.
- Visit http://{YOUR_NET_VALUE}-dev.alliancegenome.org:9000/
- Login with the node address http://elasticsearch:9200
- Click more at the top navigation bar and choose aliases.
- Under changes on the right, type site_index in the alias box and then choose your newly created index from the select index dropdown.
- Click the plus symbol to the far right.
- Click the apply button to the far right.
This process will need to be repeated each time the indexer is run. We are currently working to automate this process and will update this README with any changes in the near future.

Terminate the AWS EC2 instance.

When you are finished working with your instance, be sure to shut it down with the command make terminate run from the agr_ansible_developers directory.

Example use cases

Running the loader using a GitHub branch in the `stage` environment.

Be sure to follow all the preliminary steps above at the top of this readme.
Ensure the following variables are set in your main.yml file:
- Neo4J
  - NEO_ENV_IMAGE_FROM_AWS_TAG: stage
  - DOWNLOAD_NEO4J_DATA_IMAGE_FROM_AWS: false
- Loader
  - DOWNLOAD_LOADER_IMAGE_FROM_AWS: True
  - GITHUB_LOADER_BRANCH: "AGR-1234" (Set AGR-1234 to your GitHub branch.)
Run the following command to bring your server online:
- make launch
Logs can be viewed from the web address: http://{YOUR_NET_VALUE}-dev.alliancegenome.org:5601/app/logtrail
Start Neo4J as an empty database:
- make startdb
Run the loader:
- make run_loader
If you've pushed changes to your GitHub branch and need to re-run the loader:
- make restartdb
- make run_loader
When finished, terminate your server:
- make terminate

Running the indexer using a GitHub branch in the `stage` environment with a prepopulated `stage` Neo4J.

Be sure to follow all the preliminary steps above at the top of this readme.
Ensure the following variables are set in your main.yml file:
- Neo4J
  - DOWNLOAD_NEO4J_DATA_IMAGE_FROM_AWS: true
  - NEO4J_DATA_IMAGE_FROM_AWS_TAG: stage
- Curation Database
  - CURATION_IMAGE_FROM_AWS_TAG: stage
  - CURATION_RELEASE_VERSION: v0.15.0
- Indexer, Cacher, and API settings
  - DOWNLOAD_JAVA_SOFTWARE_IMAGE_FROM_AWS: false
  - GITHUB_JAVA_SOFTWARE_BRANCH: "AGR-1234" (Set AGR-1234 to your GitHub branch.)
- Elasticsearch, Kibana, & Logstash settings
  - ES_IMAGE_FROM_AWS_TAG: stage
Run the following command to bring your server online:
- make launch
Logs can be viewed from the web address: http://{YOUR_NET_VALUE}-dev.alliancegenome.org:5601/app/logtrail
Start Neo4J as a prepopulated database:
- make startdb
Start the curation database as a prepopulated database:
- make startcurationdb
Run the indexer with your custom branch:
- make run_indexer
If you've pushed changes to your GitHub branch and need to re-run the indexer, simply run the same command again:
- make run_indexer
When finished, terminate your server:
- make terminate

Launch a website using a GitHub branch for the UI with prepopulated data from `stage`.

Be sure to follow all the preliminary steps above at the top of this readme.
Ensure the following variables are set in your main.yml file:
- Neo4J
  - DOWNLOAD_NEO4J_DATA_IMAGE_FROM_AWS: true
  - NEO4J_DATA_IMAGE_FROM_AWS_TAG: stage
- Curation Database
  - CURATION_IMAGE_FROM_AWS_TAG: stage
  - CURATION_RELEASE_VERSION: v0.15.0
- Indexer, Cacher, and API settings
  - DOWNLOAD_JAVA_SOFTWARE_IMAGE_FROM_AWS: true
  - JAVA_SOFTWARE_IMAGE_FROM_AWS_TAG: stage
- Elasticsearch, Kibana, & Logstash settings
  - ES_IMAGE_FROM_AWS_TAG: stage
- Infinispan settings
  - DOWNLOAD_INFINISPAN_DATA_IMAGE_FROM_AWS: true
  - INFINISPAN_DATA_IMAGE_FROM_AWS_TAG: stage
- UI settings
  - DOWNLOAD_UI_IMAGE_FROM_AWS: false
  - GITHUB_UI_BRANCH: "AGR-1234" (Set AGR-1234 to your GitHub branch.)
- Nginx settings
  - NGINX_IMAGE_FROM_AWS_TAG: stage
Run the following command to bring your server online:
- make launch
Logs can be viewed from the web address: http://{YOUR_NET_VALUE}-dev.alliancegenome.org:5601/app/logtrail
Start Neo4J as a prepopulated database:
- make startdb
Start the curation database as a prepopulated database:
- make startcurationdb
Run the indexer:
- make run_indexer
- After the indexer is finished, be sure to update the site_index as described above in the section above, "Important Note regarding the Indexer and generating indexes."
Start Infinispan with prepopulated data:
- make start_infinispan
Start the API:
- make start_api
Start the UI with your custom branch:
- make start_ui
If you've pushed changes to your GitHub branch and need to restart the UI, simply run the same command again:
- make start_ui
Start Nginx:
- make start_nginx
Your site should now be online at the following address:
- http://{YOUR_NET_VALUE}-dev.alliancegenome.org
When finished, terminate your server:
- make terminate

Name		Name	Last commit message	Last commit date
Latest commit History 184 Commits
environments		environments
files		files
roles		roles
tasks		tasks
templates		templates
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
hosts		hosts
launch_api.yml		launch_api.yml
launch_aws.yml		launch_aws.yml
launch_cacher.yml		launch_cacher.yml
launch_curation.yml		launch_curation.yml
launch_es_cluster.yml		launch_es_cluster.yml
launch_file_generator.yml		launch_file_generator.yml
launch_file_generator_no_upload.yml		launch_file_generator_no_upload.yml
launch_human_variant_indexer.yml		launch_human_variant_indexer.yml
launch_indexer.yml		launch_indexer.yml
launch_infinispan.yml		launch_infinispan.yml
launch_loader.yml		launch_loader.yml
launch_loader_tests.yml		launch_loader_tests.yml
launch_mod_variant_indexer.yml		launch_mod_variant_indexer.yml
launch_neo.yml		launch_neo.yml
launch_nginx.yml		launch_nginx.yml
launch_qc.yml		launch_qc.yml
launch_ui.yml		launch_ui.yml
password-client.sh		password-client.sh
playbook_launch_custom_build.yml		playbook_launch_custom_build.yml
playbook_launch_instance.yml		playbook_launch_instance.yml
playbook_launch_post_tasks.yml		playbook_launch_post_tasks.yml
playbook_launch_pre_tasks.yml		playbook_launch_pre_tasks.yml
playbook_terminate_instance.yml		playbook_terminate_instance.yml
restart_curation.yml		restart_curation.yml
restart_elk.yml		restart_elk.yml
restart_neo.yml		restart_neo.yml
stop_curation.yml		stop_curation.yml
stop_neo.yml		stop_neo.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ansible for Developers

Additional requirements before using this repository.

Clone the repository.

Copy the template configuration files.

Edit the configuration files before running.

main.yml

Makefile

Launch the AWS EC2 instance.

Launch additional software on your AWS EC2 instance.

Important Note regarding the Indexer and generating indexes.

Terminate the AWS EC2 instance.

Example use cases

Running the loader using a GitHub branch in the `stage` environment.

Running the indexer using a GitHub branch in the `stage` environment with a prepopulated `stage` Neo4J.

Launch a website using a GitHub branch for the UI with prepopulated data from `stage`.

About

Releases

Packages

Contributors 6

Languages

alliance-genome/agr_ansible_developers

Folders and files

Latest commit

History

Repository files navigation

Ansible for Developers

Additional requirements before using this repository.

Clone the repository.

Copy the template configuration files.

Edit the configuration files before running.

main.yml

Makefile

Launch the AWS EC2 instance.

Launch additional software on your AWS EC2 instance.

Important Note regarding the Indexer and generating indexes.

Terminate the AWS EC2 instance.

Example use cases

Running the loader using a GitHub branch in the stage environment.

Running the indexer using a GitHub branch in the stage environment with a prepopulated stage Neo4J.

Launch a website using a GitHub branch for the UI with prepopulated data from stage.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Running the loader using a GitHub branch in the `stage` environment.

Running the indexer using a GitHub branch in the `stage` environment with a prepopulated `stage` Neo4J.

Launch a website using a GitHub branch for the UI with prepopulated data from `stage`.

Packages