Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

web visibility of GO xrefs list #607

Open
zhilianghu opened this issue Jan 15, 2021 · 9 comments
Open

web visibility of GO xrefs list #607

zhilianghu opened this issue Jan 15, 2021 · 9 comments

Comments

@zhilianghu
Copy link

When I searched for biology related xref lists, on the first 2 Google pages are UniProt and NCBI db_xrefs lists. It's hard to find the GO consortium dbxref list although it has 266 databases while other 2 resources have only 183 and 129 databases respectively. The GO consortium dbxref list needs better visibility. Would the public web page need better META descriptions (view-source:http://amigo.geneontology.org/xrefs) or some other means can also help?

@pgaudet
Copy link

pgaudet commented Feb 3, 2021

@lpalbou Any suggestion on this ?

Thanks, Pascale

@pgaudet
Copy link

pgaudet commented Feb 3, 2021

@cmungall

@zhilianghu
Copy link
Author

Currently the page META "description" is "AmiGO 2". Suggest to use something like: "Definitions, syntax, and list of cross-references in genomics, genetics, and life sciences databases; db_xef; dbxref; xref; database; "
The page title: "Information about Cross References" could also be altered to reflect the nature of the content better, like: "A Guide to Database Cross-references in Life Sciences"
Page TITLE is not picked up by Google any more? Or it could also be updated.

@kltm kltm transferred this issue from geneontology/go-site Feb 4, 2021
@cmungall
Copy link
Member

cmungall commented Feb 4, 2021

Hi @zhilianghu good to hear from you!

Note that this has been low on our priorities as our xref metadata has primarily been for within GOC use. However, I think it's a generally useful piece of metadata and we may want to publicise more, e.g. via SEO/tags, putting a page on our main website, even a publication.

But I actually think the approach here should be to make the go xref metadata a spoke in a larger more distributed model. Some other things happening of relevance broader than GO

  • various other prefix registeries like identifiers.org and n2t.net have standardized prefixes and expansions. however, there are some differences in design choices
    • e.g for historic reasons we use UniProtKB, but they use 'uniprot'
    • differences in what case is canonical - this is important
    • we tend to group by database whereas they group by database-type tuple
  • The prefixcommons projects attempts to align, find inconsistencies, and merge different prefix registries: https://github.com/prefixcommons/biocontext
  • some of this may be in scope for @matentz's mapping-commons project

A lot of this is out of scope for our GO registry so I suggest opening a ticket on the prefixcommons repo to continue there

@lpalbou
Copy link
Contributor

lpalbou commented Feb 4, 2021

@zhilianghu it seems you are referring to two different things:

  • how easy it is for a user to find the go db-xrefs.yaml on google (Chris answer on SEO)
  • how much META data we provide about those cross references

The second is a much larger discussion and if that's what you would like, it would need some clarification / examples. So I will only answer for the first part "how to find".

  1. The latest go dbxrefs will always be accessible with:
  1. Currently, the only mention of dbxrefs on the GO website is: http://geneontology.org/docs/GO-term-elements#database-cross-references and it only refers to xrefs for GO terms. I will add some text there as well as a link to the download-mappings section.

  2. We also have a page about cross-references and mappings: http://geneontology.org/docs/download-mappings/ which probably should be linked from the GO Downloads menu (todo). Unfortunately, it doesn't describe the dbxrefs, but that's where it should be. When this is done, this will be indexed by google and other SEO.

Now, if your usage is bioinformatic in nature, I also recently published a lightweight dbxrefs handler on NPM:
https://www.npmjs.com/package/@geneontology/dbxrefs

I still have to write the docs, but essentially:

import * as dbxrefs from "@geneontology/dbxrefs";

// This will return a promise while the latest dbxrefs is retrieved from the URL above
dbxrefs.init(() => {
    this.dbXrefsReady = true;
 })

// When ready, you can simply do the following to get the full URL:
let url = dbxrefs.getURL(databaseName, entityType, entityID);

entityType can be undefined in which case it will take the first record for the database databaseName in the dbxrefs.yaml.

If your usage is more about CURIE <-> IRI conversion, we also provide another library, both in javascript and java:

Please note however that IRIs are different from URLs and IRIs do not necessarily resolve to a web page. They are meant to be permanent identifiers over the web to simplify the interconnection of knowledge over multiple data providers (aka semantic web). All GO terms or GO-CAM have IRIs (see https://geneontology.github.io/docs/sparql/#resource-description-framework-rdf).

@lpalbou
Copy link
Contributor

lpalbou commented Feb 4, 2021

Here is a proposal: geneontology/geneontology.github.io#278

And a new menu item:
Screen Shot 2021-02-04 at 2 46 59 PM

@zhilianghu
Copy link
Author

zhilianghu commented Feb 5, 2021 via email

@lpalbou
Copy link
Contributor

lpalbou commented Feb 5, 2021

Got it, thanks for the feedback Zhiliang.

When the PR is approved, the GO dbxrefs.yaml file will be referenced on the GO website and from there, it should be indexed / searchable by google. There would be more SEO optimization to do in the future though.

@zhilianghu
Copy link
Author

zhilianghu commented Feb 5, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants