Skip to content

jorge-martinez-gil/similarity-data-catalogs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

An Overview of Approaches to Quantify Open Data Catalog Similarity

(work in progress)

Overview

This paper, authored by Jorge Martinez-Gil, delves into methods for evaluating the similarity between open data catalogs (ODCs). It underscores the increasing relevance of open data initiatives and the necessity for effective catalog similarity metrics to enhance data management and accessibility.

Contents

  1. Introduction
    • Explores the role of ODCs in data transparency and innovation.
    • Introduces the concept of catalog similarity.
  2. Related Works
    • Reviews literature on ODC similarity and standard data catalog vocabularies.
  3. Similarity Methods for Open Data Catalogs
    • 3.1: Repositories of Triples
    • 3.2: Repositories of Tokens
    • 3.3: Character Sequences
  4. Conclusion
    • Summarizes findings and suggests future research directions.

Key Points

  • Highlights various strategies for ODC similarity measurement, including traditional and advanced semantic-based approaches.
  • Stresses the importance of selecting an appropriate method based on catalog characteristics and objectives.
  • Discusses future research possibilities in dynamic and real-time similarity assessment.

Conclusion

The paper emphasizes the diversity of methodologies in ODC similarity measurement and the need for customized approaches depending on specific catalog requirements.

For full details, refer to the complete paper.

Citation

If you use this work, please cite:

@article{martinez2023overview,
  title={An Overview of Approaches to Quantify Open Data Catalog Similarity},
  author={Martinez-Gil, Jorge},
  year={2023}
}

📄 License

The material is provided under the MIT License.

Releases

No releases published

Packages

No packages published

Languages