Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow "bare metal" deployment for testing #136

Draft
wants to merge 12 commits into
base: main
Choose a base branch
from

Conversation

dbutenhof
Copy link
Collaborator

@dbutenhof dbutenhof commented Nov 20, 2024

Type of change

  • Refactor
  • New feature
  • Bug fix
  • Optimization
  • Documentation Update

Description

The UI is currently hardcoded to support two environments: a development deployment from localhost:3000 to localhost:8000, and the API explicitly allows http://localhost:3000 as a cross-site origin; and an OpenShift deployment where cross-site scripting is unnecessary as the cluster API reverse proxy hides the port numbers.

Partly for more general testing and deployment, but specifically because the RHEL AI InstructLab project requires CPT dashboard access now before our code has been integrated into the production OpenShift deployment, it's convenient to support a third "bare metal" mode where the containerized UI and backend are hosted at ports 3000 and 8000 on some host (e.g., in the RDU3 Performance Lab).

For this, the UI needs to recognize that a non-localhost window.location with a 3000 port needs to call the API at port 8000 on the same host (for our "bare metal" deployment) while an empty port indicates we're using the OpenShift API reverse proxy routing.

Similarly, the backend code cross-site scripting protection needs to allow port 3000 from the same host as a valid origin.

This is chained from #122 (crucible) -> #123 (ilab API) -> #124 (ilab UI) -> #125 (multi-run API) -> #127 (multi-run UI) -> #129 (statistics aggregation) -> #130 (statistics display) -> #131 (metadata flyover) -> #132 (multiple metrics selection) -> #133 (compare multiple metrics) -> #134 (improve delta time x axis labels) -> #136 (bare metal CORS)

Related Tickets & Documents

PANDA-653 Fix CORS for non-cluster deployment

Checklist before requesting a review

  • I have performed a self-review of my code.
  • If it is a core feature, I have added thorough tests.

Testing

Tested on a Performance Lab host.

Verified when UI is loaded from http://<host>:3000 that API calls are made to http://<host>:8000 and that the backend generates the proper access-control-allow-origin: http://<host>:3000 response header expected by the browser.

Verified that localhost dev deployment with backend/scripts/start-reload.sh and UI npm run dev continues to work.

dbutenhof and others added 12 commits October 18, 2024 12:51
This encapsulates substantial logic to encapsulate interpretation of the
Crucible Common Data Model OpenSearch schema for the use of CPT dashboard API
components. By itself, it does nothing.
This builds on the `crucible_svc` layer in cloud-bulldozer#122 to add a backend API.
This relies on the ilab API in cloud-bulldozer#123, which in turn builds on the crucible
service in cloud-bulldozer#122.
When graphing metrics from two runs, the timestamps rarely align; so we add a
`relative` option to convert the absolute metric timestamps into relative
delta seconds from each run's start.
This adds the basic UI to support comparison of the metrics of two InstructLab
runs. This compares only the primary metrics of the two runs, in a relative
timeline graph.

This is backed by cloud-bulldozer#125, which is backed by cloud-bulldozer#124, which is backed by cloud-bulldozer#123,
which is backed by cloud-bulldozer#122. These represent a series of steps towards a complete
InstructLab UI and API, and will be reviewed and merged from cloud-bulldozer#122 forward.
This PR is primarily CPT dashboard backend API (and Crucible service) changes
to support pulling and displaying multiple Crucible metric statistics. Only
minor UI changes are included to support API changes. The remaining UI changes
to pull and display statistics will be pushed separately.
Add statistics charts for selected metric in row expansion and comparison
views.
Extract the "Metadata" into a separate component, which allows it to be reused
as an info flyover on the comparison page to help in identifying target runs
to be compared.
Modify the metrics pulldown to allow multiple selection. The statistical
summary chart and graph will show all selected metrics in addition to the
inherent benchmark primary benchmark (for the primary period).
Support selection of multiple metrics using the pulldown in the comparison
page. The update occurs when the pulldown closes.

To simplify the management of "available metrics" across multiple selected
runs, which might have entirely different metrics, the reducer no longer
tries to store separate metric selection lists for each run. This also means
that the "default" metrics selection remains when adding another comparison
run, or expanding another row.
The Plotly graphing package doesn't directly support a "delta time" type, and
in the comparison view we want to use delta time to compare two runs that will
generally have different absolute timestamps. (It turns out that the native
PatternFly graphing package, Victory, has the same limitation.)

Initially, this just reported numeric delta seconds, but that's unnatural for
a reader. This PR adds support for a `absolute_relative` option which reports
the delta times as small absolute timestamps, like `1970-01-01 00:01:00` for
60 seconds, formatting ticks using `"%H:%M:%S"` ("00:01:00") for readability.

I also made the X axis title appear, which necessitated some refactoring of
the layout to avoid overlaying the legend on the axis label; and in the
process I moved the "presentation specific" width parameter into the UI and
the others into the API so they don't have to be duplicated in the two action
calls.
The UI is currently hardcoded to support two environments: a development
deployment from `localhost:3000` to `localhost:8000`, and the API explicitly
allows `http://localhost:3000` as a cross-site origin; and an OpenShift
deployment where cross-site scripting is unnecessary as the cluster API
reverse proxy hides the port numbers.

Partly for more general testing and deployment, but specifically because the
RHEL AI InstructLab project requires CPT dashboard access now before our code
has been integrated into the production OpenShift deployment, it's convenient
to support a third "bare metal" mode where the containerized UI and backend
are hosted at ports 3000 and 8000 on some host (e.g., in the RDU3 Performance
Lab).

For this, the UI needs to recognize that a non-`localhost` `window.location`
with a `3000` port needs to call the API at port `8000` on the same host (for
our "bare metal" deployment) while an empty port indicates we're using the
OpenShift API reverse proxy routing.

Similarly, the backend code cross-site scripting protection needs to allow
port 3000 from the same host as a valid origin.
@dbutenhof dbutenhof self-assigned this Nov 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants