Skip to content

Introduction about metadata of dataset

Overview

Metadata is a bundle of pre-declare information about data.

This metadata was created for:

[1] Provide information on data overview

[2] Register data to a registered data manager

[3] Make it appeal to the data universe

[4] Metadata captured data in their cycle

The Business Case for Metadata — Federal Geographic Data Committee

Data Life Cycle https://www.fgdc.gov/policyandplanning/a-16/stages-of-geospatial-data-lifecycle-a16.pdf

Below is some captured description of the term metadata.

[1] Open Metadata

Metadata

[Short] Descriptive information about data

[Long] is critical to helping visitors find and use published data effectively. Good metadata reduces the need for visitors to seek personal assistance, helps prevent misinterpretation of data, and encourages higher data quality.

There are 2 types, adopted by Open Metadata:

Implement Metadata

Component

Description

Agent

Metadata Registry

Categories

Dataset Catalog

Column Catalog

Open Metadata Registry Step-By-Step Instruction - Metadata-Registry

Categories As an open data portal grows beyond 25-30 datasets, it is more helpful to visitors if they can browse for data by subject matter or theme. These categories are short, at most a word or two, and allow related data to be grouped together. Categories also empower visitors to explore available data for inspiration, rather than requiring them to use a search tool to find something specific.

Creating categories is often a fundamental step when implementing an open data portal. Categories do not need to be permanent; it makes sense to have three to four categories for a small number of datasets, and re-evaluate them on an annual basis as more data is published. Most mature open data portals have 8 to 12 categories. Having too many might mean that the categories are not broad enough. Having too few, especially when combined with a large number of datasets, might mean that the categories are too broad and less helpful for visitors.

Although there is no consistent set of categories between open data portals, the following are quite common and might serve as a starting point: Business, Education, Environment, Finance, Health, Human (or Social) Services, Property, Public Safety, Recreation, and Transportation. A librarian or information architect can provide insight and assistance when creating or revising a list of categories.

Included Data Points

As-reported financial data Standardized financial data Company reference and metadata Metrics & ratios Institutional holdings Insider transactions Price $9,600/yr History Back to 2006 Access Methods API, CSV, Snowflake, s3 Source Directly from the SEC Volume N/A Update Frequency Real time Licensing Business Use Support Ticketing, Instant Chat Free Trial Yes

https://intrinio.com/products/us-fundamentals

  • Business Design with metadata:

a) Provider Name b) Provider Focus c) Provider Pricing Models d) Sample of Provider Products e) URL link of Provider

The USP:

  • Defined the brand of the company

  • Working like Bloomberg Terminal Anywhere

  • Monetize at the App Buy In

Source Reference

[1] Open Data Metadata Guide Introduction · Open Data - Metadata Guide

[2] DCAT - US Schema DCAT-US Schema v1.1 (Project Open Data Metadata Schema) | resources.data.gov

[3] https://www.fgdc.gov/technical-guidance/metadata/fgdc-technical-guidance-datagov-geoplatform-ngda.pdf