You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/reference/esm-catalog-spec.md
+17-17Lines changed: 17 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,8 +43,8 @@ The descriptor is a single json file, inspired by the [STAC spec](https://github
43
43
### Catalog
44
44
45
45
The collection points to a single catalog.
46
-
A catalog is a CSV file.
47
-
The meaning of the columns in the csv file is defined by the parent collection.
46
+
A catalog is a CSV or parquet file.
47
+
The meaning of the columns in the csv/parquet file is defined by the parent collection.
48
48
49
49
```
50
50
activity_id,source_id,path
@@ -65,29 +65,29 @@ They should be either [URIs](https://en.wikipedia.org/wiki/Uniform_Resource_Iden
65
65
| id | string |**REQUIRED.** Identifier for the catalog. |
66
66
| title | string | A short descriptive one-line title for the catalog. |
67
67
| description | string |**REQUIRED.** Detailed multi-line description to fully explain the catalog. [CommonMark 0.28](http://commonmark.org/) syntax MAY be used for rich text representation. |
68
-
| catalog_file | string |**REQUIRED.** Path to a the CSV file with the catalog contents.|
69
-
| catalog_dict | array | If specified, it is mutually exclusive with `catalog_file`. An array of dictionaries that represents the data that would otherwise be in the csv. |
68
+
| catalog_file | string |**REQUIRED.** Path to a the CSV/parquet file with the catalog contents. |
69
+
| catalog_dict | array | If specified, it is mutually exclusive with `catalog_file`. An array of dictionaries that represents the data that would otherwise be in the csv/parquet.|
70
70
| attributes |[[Attribute Object](#attribute-object)]|**REQUIRED.** A list of attribute columns in the data set. |
71
-
| assets |[Assets Object](#assets-object)|**REQUIRED.** Description of how the assets (data files) are referenced in the CSV catalog file.|
71
+
| assets |[Assets Object](#assets-object)|**REQUIRED.** Description of how the assets (data files) are referenced in the CSV/parquet catalog file. |
72
72
| aggregation_control |[Aggregation Control Object](#aggregation-control-object)|**OPTIONAL.** Description of how to support aggregation of multiple assets into a single xarray data set. |
73
73
74
74
### Attribute Object
75
75
76
-
An attribute object describes a column in the catalog CSV file.
76
+
An attribute object describes a column in the catalog CSV/parquet file.
77
77
The column names can optionally be associated with a controlled vocabulary, such as the [CMIP6 CVs](https://github.com/WCRP-CMIP/CMIP6_CVs), which explain how to interpret the attribute values.
| column_name | string |**REQUIRED.** The name of the column containing the path to the asset. Must be in the header of the CSV file.|
90
+
| column_name | string |**REQUIRED.** The name of the column containing the path to the asset. Must be in the header of the CSV/parquet file. |
91
91
| format | string | The data format. Valid values are `netcdf`, `zarr`, `zarr2`, `zarr3`, `opendap` or `reference` ([`kerchunk`](https://github.com/fsspec/kerchunk) reference files). If specified, it means that all data in the catalog is the same type. |
92
92
| format_column_name | string | The column name which contains the data format, allowing for variable data types in one catalog. Mutually exclusive with `format`. |
93
93
@@ -128,11 +128,11 @@ If `zarr2` or `zarr3` is specified in the `format` field, the `async` flag will
128
128
129
129
An aggregation control object defines neccessary information to use when aggregating multiple assets into a single xarray data set.
0 commit comments