diff --git a/_pkgdown.yml b/_pkgdown.yml
index 1ac4564..44171ed 100644
--- a/_pkgdown.yml
+++ b/_pkgdown.yml
@@ -18,6 +18,7 @@ articles:
   contents:
   - 00-geopiper
   - 01-intro
+  - 02-catalogs
   
 reference:
 - title: 
@@ -36,4 +37,4 @@ reference:
 - subtitle: Exported climateR-catalog
   desc: Parquet file exported to package dataset
   contents:
-  - has_concept("catalog")
\ No newline at end of file
+  - has_concept("catalog")
diff --git a/docs/articles/02-catalogs.html b/docs/articles/02-catalogs.html
new file mode 100644
index 0000000..3f9f0b1
--- /dev/null
+++ b/docs/articles/02-catalogs.html
@@ -0,0 +1,202 @@
+<!DOCTYPE html>
+<!-- Generated by pkgdown: do not edit by hand --><html lang="en">
+<head>
+<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+<meta charset="utf-8">
+<meta http-equiv="X-UA-Compatible" content="IE=edge">
+<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
+<meta name="description" content="climateR">
+<title>climateR Catalogs • climateR</title>
+<script src="../deps/jquery-3.6.0/jquery-3.6.0.min.js"></script><meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
+<link href="../deps/bootstrap-5.2.2/bootstrap.min.css" rel="stylesheet">
+<script src="../deps/bootstrap-5.2.2/bootstrap.bundle.min.js"></script><!-- Font Awesome icons --><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/all.min.css" integrity="sha256-mmgLkCYLUQbXn0B1SRqzHar6dCnv9oZFPEC1g1cwlkk=" crossorigin="anonymous">
+<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.12.1/css/v4-shims.min.css" integrity="sha256-wZjR52fzng1pJHwx4aV2AO3yyTOXrcDW7jBpJtTwVxw=" crossorigin="anonymous">
+<!-- bootstrap-toc --><script src="https://cdn.jsdelivr.net/gh/afeld/bootstrap-toc@v1.0.1/dist/bootstrap-toc.min.js" integrity="sha256-4veVQbu7//Lk5TSmc7YV48MxtMy98e26cf5MrgZYnwo=" crossorigin="anonymous"></script><!-- headroom.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/headroom.min.js" integrity="sha256-AsUX4SJE1+yuDu5+mAVzJbuYNPHj/WroHuZ8Ir/CkE0=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/headroom/0.11.0/jQuery.headroom.min.js" integrity="sha256-ZX/yNShbjqsohH1k95liqY9Gd8uOiE1S4vZc+9KQ1K4=" crossorigin="anonymous"></script><!-- clipboard.js --><script src="https://cdnjs.cloudflare.com/ajax/libs/clipboard.js/2.0.6/clipboard.min.js" integrity="sha256-inc5kl9MA1hkeYUt+EC3BhlIgyp/2jDIyBLS6k3UxPI=" crossorigin="anonymous"></script><!-- search --><script src="https://cdnjs.cloudflare.com/ajax/libs/fuse.js/6.4.6/fuse.js" integrity="sha512-zv6Ywkjyktsohkbp9bb45V6tEMoWhzFzXis+LrMehmJZZSys19Yxf1dopHx7WzIKxr5tK2dVcYmaCk2uqdjF4A==" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/autocomplete.js/0.38.0/autocomplete.jquery.min.js" integrity="sha512-GU9ayf+66Xx2TmpxqJpliWbT5PiGYxpaG8rfnBEk1LL8l1KGkRShhngwdXK1UgqhAzWpZHSiYPc09/NwDQIGyg==" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mark.js/8.11.1/mark.min.js" integrity="sha512-5CYOlHXGh6QpOFA/TeTylKLWfB3ftPsde7AnmhuitiTX4K5SqCLBeKro6sPS8ilsz1Q4NRx3v8Ko2IBiszzdww==" crossorigin="anonymous"></script><!-- pkgdown --><script src="../pkgdown.js"></script><meta property="og:title" content="climateR Catalogs">
+<meta property="og:description" content="climateR">
+<!-- mathjax --><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js" integrity="sha256-nvJJv9wWKEm88qvoQl9ekL2J+k/RWIsaSScxxlsrv8k=" crossorigin="anonymous"></script><script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/config/TeX-AMS-MML_HTMLorMML.js" integrity="sha256-84DKXVJXs0/F8OTMzX4UR909+jtl4G7SPypPavF+GfA=" crossorigin="anonymous"></script><!--[if lt IE 9]>
+<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
+<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
+<![endif]-->
+</head>
+<body>
+    <a href="#main" class="visually-hidden-focusable">Skip to contents</a>
+    
+
+    <nav class="navbar fixed-top navbar-light navbar-expand-lg bg-light"><div class="container">
+    
+    <a class="navbar-brand me-2" href="../index.html">climateR</a>
+
+    <small class="nav-text text-muted me-auto" data-bs-toggle="tooltip" data-bs-placement="bottom" title="">0.3.1.1</small>
+
+    
+    <button class="navbar-toggler" type="button" data-bs-toggle="collapse" data-bs-target="#navbar" aria-controls="navbar" aria-expanded="false" aria-label="Toggle navigation">
+      <span class="navbar-toggler-icon"></span>
+    </button>
+
+    <div id="navbar" class="collapse navbar-collapse ms-3">
+      <ul class="navbar-nav me-auto">
+<li class="nav-item">
+  <a class="nav-link" href="../reference/index.html">Reference</a>
+</li>
+<li class="active nav-item">
+  <a class="nav-link" href="../articles/index.html">Articles</a>
+</li>
+      </ul>
+<form class="form-inline my-2 my-lg-0" role="search">
+        <input type="search" class="form-control me-sm-2" aria-label="Toggle navigation" name="search-input" data-search-index="../search.json" id="search-input" placeholder="Search for" autocomplete="off">
+</form>
+
+      <ul class="navbar-nav">
+<li class="nav-item">
+  <a class="external-link nav-link" href="https://github.com/mikejohnson51/climateR/" aria-label="github">
+    <span class="fab fa fab fa-github fa-lg"></span>
+     
+  </a>
+</li>
+      </ul>
+</div>
+
+    
+  </div>
+</nav><div class="container template-article">
+
+
+
+
+<div class="row">
+  <main id="main" class="col-md-9"><div class="page-header">
+      <img src="" class="logo" alt=""><h1>climateR Catalogs</h1>
+                        <h4 data-toc-skip class="author">Mike
+Johnson</h4>
+            <address class="author_afil">
+      Lynker<br><h4 data-toc-skip class="author">Justin
+Singh-Mohudpur</h4>
+            <address class="author_afil">
+      Lynker<br><small class="dont-index">Source: <a href="https://github.com/mikejohnson51/climateR/blob/HEAD/vignettes/02-catalogs.Rmd" class="external-link"><code>vignettes/02-catalogs.Rmd</code></a></small>
+      <div class="d-none name"><code>02-catalogs.Rmd</code></div>
+    </address>
+</address>
+</div>
+
+    
+    
+<div class="section level2">
+<h2 id="catalogs">Catalogs<a class="anchor" aria-label="anchor" href="#catalogs"></a>
+</h2>
+<p>In order to provide an evolving, federated collection of datasets,
+<code>climateR</code> makes use of a a preprocessed catalog, updated on
+a monthly cycle. This catalog is hosted and generated from the <a href="https://github.com/mikejohnson51/climateR-catalogs" class="external-link">climateR-catalogs
+repository</a>.</p>
+<p>This catalog contains over 100,000 thousand datasets from over 2,000
+data providers/archives. The following section describes the design of
+the catalog and its data pipeline.</p>
+<div class="section level3">
+<h3 id="design">Design<a class="anchor" aria-label="anchor" href="#design"></a>
+</h3>
+<p><img src="../reference/figures/catalogs-overview.png" width="635"></p>
+<p>The catalog data pipeline uses the <a href="https://docs.ropensci.org/targets/" class="external-link">targets</a> package to
+establish a declarative workflow using <em>data sources</em> as target
+creators. In particular, data sources are treated as <em>dynamic
+plugins</em> to the data pipeline, such that data sources are composable
+within the pipeline through a framework utilizing <a href="https://r6.r-lib.org/index.html" class="external-link">R6</a> classes.</p>
+<p>The data source R6 classes expose a simple interface to plugin
+creators, where adding a new data source is defined by giving a data
+source three things:</p>
+<ol style="list-style-type: decimal">
+<li>an <code>id</code>
+</li>
+<li>a <code>pull</code> function</li>
+<li>a <code>tidy</code> function</li>
+</ol>
+<p>The <code>id</code> represents a unique identifier for the data
+source that is contained with the final catalog. The <code>pull</code>
+function is a function containing any number of arguments that should
+gather catalog items from an endpoint, and collect them into a
+<code>data.frame</code>. The <code>tidy</code> function is a function
+that accepts <em>at least</em> a single argument for the output of the
+<code>pull</code> function. The function should perform any necessary
+actions to conform the argument as close to the catalog schema as
+possible.</p>
+<p>Using the data sources built on top of this R6-based framework, the
+pipeline is then given targets that correspond to (1) loading the R6
+class, (2) calling the <code>pull</code> function, and (3) calling the
+<code>tidy</code> function. These three steps are mapped across all
+available data sources that are loaded into the pipeline environment,
+and joined together to create a seamless table representing the catalog.
+Finally, the schema of the table is handled to ensure it conforms to the
+catalog specification, and outputs for JSON and Parquet are
+released.</p>
+<div class="section level4">
+<h4 id="technical-details">Technical Details<a class="anchor" aria-label="anchor" href="#technical-details"></a>
+</h4>
+<div class="section level5">
+<h5 id="targets-serialization">Targets Serialization<a class="anchor" aria-label="anchor" href="#targets-serialization"></a>
+</h5>
+<p>A key point to highlight is that with the targets R package,
+individual targets are serialized to a specific format when completed.
+Dependent targets also read from this serialization format back into R
+as necessary. The default format for targets is to use the R RDS format.
+However, since this pipeline already requires an <a href="https://arrow.apache.org/" class="external-link">Apache Arrow</a> dependency due to a
+Parquet output, we take advantage of the <a href="https://arrow.apache.org/docs/python/ipc.html" class="external-link">Arrow IPC
+file/stream formats</a> for serialization of these targets.
+Specifically, the <code>pull</code> and <code>tidy</code> targets always
+return the data source R6 class, and the succeeding targets for the
+catalog generation return a data frame. For the targets returning R6
+classes, a custom serializer that performs I/O between the R6 class and
+its metadata to Arrow IPC Stream format is implemented. For the targets
+returning data frames, we use the Arrow IPC File format.</p>
+<p>The Arrow IPC formats were chosen in this fashion due to the smaller
+memory footprint and the performance gained from zero-copy pass between
+targets. This also enables data sources to be built in various
+programming languages and access the same data if needed, again due to
+the zero-copy property of Arrow’s IPC formats.</p>
+</div>
+<div class="section level5">
+<h5 id="pipeline-infrastructure">Pipeline Infrastructure<a class="anchor" aria-label="anchor" href="#pipeline-infrastructure"></a>
+</h5>
+<p>With the catalog data pipeline built on top of R and the targets
+package, to aid in generating the catalog, we utilize <a href="https://github.com/features/actions" class="external-link">GitHub Actions</a>. Despite
+it being primarily for <a href="https://en.wikipedia.org/wiki/CI/CD" class="external-link">CI/CD</a> workflows, the
+concept of CI/CD can be generalized to data as well. For example, in
+data engineering, <a href="https://airflow.apache.org/" class="external-link">Apache
+Airflow</a> is a predominant application for constructing data
+workflows. Between the two, the primary difference is that GitHub
+Actions is further generalized, and offers less direct integrations for
+data engineering.</p>
+<p>With that context in mind, the GitHub Actions workflow for the
+catalog data pipeline is, in essence, a runner that calls
+<code><a href="https://docs.ropensci.org/targets/reference/tar_make.html" class="external-link">targets::tar_make()</a></code> to run the pipeline. When all of the
+targets are complete, the workflow takes the outputted catalog files and
+uploads them to the GitHub repository as a release. Furthermore, this
+workflow is scheduled to run on a monthly basis, ensuring that the
+catalog stays consistently up to date with the latest datasets offered
+by the data providers that are described in the data source plugins.</p>
+</div>
+</div>
+</div>
+</div>
+  </main><aside class="col-md-3"><nav id="toc"><h2>On this page</h2>
+    </nav></aside>
+</div>
+
+
+
+    <footer><div class="pkgdown-footer-left">
+  <p></p>
+<p>Developed by Mike Johnson.</p>
+</div>
+
+<div class="pkgdown-footer-right">
+  <p></p>
+<p>Site built with <a href="https://pkgdown.r-lib.org/" class="external-link">pkgdown</a> 2.0.7.</p>
+</div>
+
+    </footer>
+</div>
+
+  
+
+  
+
+  </body>
+</html>
diff --git a/docs/reference/figures/catalogs-overview.png b/docs/reference/figures/catalogs-overview.png
new file mode 100644
index 0000000..1951876
Binary files /dev/null and b/docs/reference/figures/catalogs-overview.png differ
diff --git a/man/figures/catalogs-overview.png b/man/figures/catalogs-overview.png
new file mode 100644
index 0000000..1951876
Binary files /dev/null and b/man/figures/catalogs-overview.png differ
diff --git a/vignettes/02-catalogs.Rmd b/vignettes/02-catalogs.Rmd
new file mode 100644
index 0000000..0a7bd63
--- /dev/null
+++ b/vignettes/02-catalogs.Rmd
@@ -0,0 +1,88 @@
+---
+title: "climateR Catalogs"
+author:
+  - name: "Justin Singh-Mohudpur"
+    url: https://github.com/program--
+    affiliation: Lynker
+    affiliation_url: https://lynker.com
+
+  - name: "Mike Johnson"
+    url: https://github.com/mikejohnson51
+    affiliation: Lynker
+    affiliation_url: https://lynker.com
+  
+output: distill::distill_article
+---
+
+# Catalogs
+
+In order to provide an evolving, federated collection of datasets, `climateR` makes use of a
+a preprocessed catalog, updated on a monthly cycle. This catalog is hosted and generated from
+the [climateR-catalogs repository](https://github.com/mikejohnson51/climateR-catalogs).
+
+This catalog contains over 100,000 thousand datasets from over 2,000 data providers/archives.
+The following section describes the design of the catalog and its data pipeline.
+
+## Design
+
+```{r, echo = FALSE}
+knitr::include_graphics("../man/figures/catalogs-overview.png")
+```
+
+The catalog data pipeline uses the [targets](https://docs.ropensci.org/targets/) package to establish
+a declarative workflow using *data sources* as target creators. In particular, data sources are 
+treated as *dynamic plugins* to the data pipeline, such that data sources are composable within the pipeline
+through a framework utilizing [R6](https://r6.r-lib.org/index.html) classes.
+
+The data source R6 classes expose a simple interface to plugin creators, where adding a new data source
+is defined by giving a data source three things:
+
+1. an `id`
+2. a `pull` function
+3. a `tidy` function
+
+The `id` represents a unique identifier for the data source that is contained with the final catalog.
+The `pull` function is a function containing any number of arguments that should gather catalog items
+from an endpoint, and collect them into a `data.frame`. The `tidy` function is a function that accepts
+*at least* a single argument for the output of the `pull` function. The function should perform any
+necessary actions to conform the argument as close to the catalog schema as possible.
+
+Using the data sources built on top of this R6-based framework, the pipeline is then given targets that
+correspond to (1) loading the R6 class, (2) calling the `pull` function, and (3) calling the `tidy` function.
+These three steps are mapped across all available data sources that are loaded into the pipeline environment,
+and joined together to create a seamless table representing the catalog. Finally, the schema of the table is
+handled to ensure it conforms to the catalog specification, and outputs for JSON and Parquet are released.
+
+### Technical Details
+
+
+#### Targets Serialization
+
+A key point to highlight is that with the targets R package, individual targets are serialized to a specific
+format when completed. Dependent targets also read from this serialization format back into R as necessary.
+The default format for targets is to use the R RDS format. However, since this pipeline already requires an
+[Apache Arrow](https://arrow.apache.org/) dependency due to a Parquet output, we take advantage of the
+[Arrow IPC file/stream formats](https://arrow.apache.org/docs/python/ipc.html) for serialization of these targets.
+Specifically, the `pull` and `tidy` targets always return the data source R6 class, and the succeeding targets for
+the catalog generation return a data frame. For the targets returning R6 classes, a custom serializer that performs
+I/O between the R6 class and its metadata to Arrow IPC Stream format is implemented. For the targets returning data frames,
+we use the Arrow IPC File format.
+
+The Arrow IPC formats were chosen in this fashion due to the smaller memory footprint and the performance
+gained from zero-copy pass between targets. This also enables data sources to be built in various programming
+languages and access the same data if needed, again due to the zero-copy property of Arrow's IPC formats.
+
+#### Pipeline Infrastructure
+
+With the catalog data pipeline built on top of R and the targets package, to aid in generating the catalog,
+we utilize [GitHub Actions](https://github.com/features/actions). Despite it being primarily for
+[CI/CD](https://en.wikipedia.org/wiki/CI/CD) workflows, the concept of CI/CD can be generalized to data as well.
+For example, in data engineering, [Apache Airflow](https://airflow.apache.org/) is a predominant application for
+constructing data workflows. Between the two, the primary difference is that GitHub Actions is further generalized,
+and offers less direct integrations for data engineering.
+
+With that context in mind, the GitHub Actions workflow for the catalog data pipeline is, in essence, a runner that
+calls `targets::tar_make()` to run the pipeline. When all of the targets are complete, the workflow takes the outputted
+catalog files and uploads them to the GitHub repository as a release. Furthermore, this workflow is scheduled to run on
+a monthly basis, ensuring that the catalog stays consistently up to date with the latest datasets offered by the data providers
+that are described in the data source plugins.