# Welcome to Sparkitecture!

![](https://1150130598-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-LbIjUyYyn8XW78ocYcy%2F-Lq7rYC1cxUSy2n_oFW0%2F-Lq7rZqsWSBg5dSpLFJ8%2FSparkitecture_logo.png?alt=media\&token=7d7069b3-aa9c-4cd6-842a-f51c8657397f)

[![DOI](https://zenodo.org/badge/167874964.svg)](https://zenodo.org/badge/latestdoi/167874964)

**Created by:** Colby T. Ford, Ph.D.

> PySpark Edition | *A work in progress...* | Created using [GitBook.com](https://www.gitbook.com/)

## About

*Sparkitecture* is a collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.

{% hint style="info" %}
This is an open source project (GPL v3.0) for the Spark community. If you have ideas or contributions you'd like to add, submit a [Feature Request](https://github.com/colbyford/sparkitecture/issues/new/choose) or a write your code/tutorial/page and create a [Pull Request](https://github.com/colbyford/sparkitecture/pulls) in the GitHub repo.
{% endhint %}

## How to Cite

| BibTex                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Text Citation                                                                                                                                                                                                                     |
| -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| <p><code>@misc{sparkitecture,</code></p><p><code>author = {Colby T. Ford},</code></p><p><code>title  = {Sparkitecture - {A} collection of "cookbook-style" scripts for simplifying data engineering and machine learning in {Apache Spark}.},</code></p><p><code>month = oct,</code></p><p><code>year = 2019,</code></p><p><code>doi = {10.5281/zenodo.3468502},</code></p><p><code>url = {<https://doi.org/10.5281/zenodo.3468502}></code></p><p><code>}</code></p> | Colby T. Ford. (2019, October) Sparkitecture - A collection of "cookbook-style" scripts for simplifying data engineering and machine learning in Apache Spark., (Version v1.0.0). Zenodo. <http://doi.org/10.5281/zenodo.3468502> |
