Improvements to docs (#222)

nvasilakis · web-flow · commit 457100d8eb29 · 2021-04-14T02:25:16.000-04:00
diff --git a/annotations/README.md b/annotations/README.md
@@ -8,7 +8,7 @@ The parallelizability study informed the design of the annotation language, whic
 
 > _N.b.: We welcome contributions to the study and annotatations for common commands._
 
-#### Main Parallelizability Classes
+## Main Parallelizability Classes
 
 PaSh introduces four major parallelizability classes:
 
@@ -35,7 +35,7 @@ If parallelized on a single input, each stage would need to wait on the results
 The last class, `side-effectful`, contains commands that have side-effects across the system -- for example, updating environment variables, interacting with the filesystem, and accessing the network.
 Such commands are not parallelizable without finer-grained concurrency control mechanisms that can detect side-effects across the system.
 
-#### Parallelizability Study of Commands in GNU & POSIX
+## Parallelizability Study of Commands in GNU & POSIX
 
 The parallelizability study of commands in GNU and POSIX is comprised of two parts: a coarse-grained parallelizability study, and a set of annotations for commands.
 
@@ -47,7 +47,7 @@ Annotations can be thought of as defining a bidirectional correspondence between
 Since command behaviors (and correspondence) can change based on their arguments, annotations contain a sequence of predicates.
 Each predicate is accompanied by information that instantiates the correspondence between a command and a dataflow node.
 
-#### A Simple Example: `chmod`
+## A Simple Example: `chmod`
 
 As a first example, below we present the annotations for `chmod`.
 
@@ -65,7 +65,7 @@ As a first example, below we present the annotations for `chmod`.
 
 The annotation for `chmod` is very simple, since it only needs to establish that `chmod` is side-effectful and therefore cannot be translated to a dataflow node.
 
-#### Another Example: `cut`
+## Another Example: `cut`
 
 As another example, below we present the annotations for `chmod`.
 
@@ -136,7 +136,7 @@ Inputs are always assigned to the non-option arguments and the output is always
 The option "stdin-hyphen" indicates that a non-option argument that is just a dash `-` represents the stdin, and the option “empty-args-stdin” indicates that if non-option arguments are empty, then the command reads from its stdin.
 The list identified by "short-long" contains a correspondence of short and long argument names for this command.
 
-#### How to Annotate a Command
+## How to Annotate a Command
 
 The first step to annotating a command is to identify its default class: `stateless`, `pure`, `non-parallelizable`, and `side-effectful`. How does the command behave without any inputs?
 The next step is to identify the set of inputs and their order.
diff --git a/compiler/README.md b/compiler/README.md
@@ -1,37 +1,57 @@
 # The PaSh Compiler
+Quick Jump: [intro](#introduction) | [overview](#compiler-overview) | [details](#zooming-into-fragments) | [earlier versions](#earlier-versions)
 
-A diagram of the compiler is shown below:
+## Introduction
 
-A correspondence between blocks in the diagram and Python modules is shown below:
+PaSh has recently shifted away from ahead-of-time compilation and towards just-in-time compilation intermixed with the execution of a script.
+This shift brings many benefits, allowing PaSh to correctly handle expansion and other important details -- but complicates the clear exposition of the two phases.
+A high-level diagram of PaSh's end-to-end operation is shown below:
+
+<img src="https://docs.google.com/drawings/d/e/2PACX-1vSIuacgBR_QFOzawoAdJMmTjgsdnDUkp1DbSjLVlrowlhL6kxqckXXsL7SPoRXKfaC1hw9HQzJitmDP/pub?w=1364&amp;h=454">
+
+PaSh pre-processes a sequential script to insert calls to the `pash_runtime.py`.
+It then invokes the script, switching between evaluation, execution, and parallelization at runtime:
+(i) it first parses the script, creating an abstact syntax tree (AST); 
+(ii) it then expands the nodes of the AST, often calling the shell which performs that expansion;
+(iii) it compiles dataflow regions, parts of the AST that are potentially parallelizable, through an iterative optimization proceedure applied over a dataflow graph (DFG); and
+(iv) finally emits the parallel script by translating the DFG to AST and unparsing the AST back to a shell script.
+The compilation takes into account information about individual commands through [annotations](../annotations), and the emitted parallel script uses additional constructs provided by PaSh's [runtime library](../runtime).
 
-- PaSh Preprocessor -- [pash.py](../compiler/pash.py)
-- Expand, Compile -- [ast_to_ir.py](../compiler/ast_to_ir.py)
-- Annotations -- [annotations.py](../compiler/annotations.py), [command_categories.py](../compiler/command_categories.py)
-- Optimize -- [pash_runtime.py](../compiler/pash_runtime.py)
+A correspondence between blocks in the diagram and Python modules is shown below:
 
-**Note:** At the time of the paper submission, PaSh did not have a preprocessing component, and didn't handle variable expansion. These changes significantly improve the practical applicability of PaSh since it can be used on scripts where the environment variables are modified throughout the script.
+- Preprocessing: [pash.py](./pash.py)
+- Expansion and compilation: [ast_to_ir.py](./ast_to_ir.py)
+- Dealing with annotations: [annotations.py](./annotations.py), [command_categories.py](./command_categories.py)
+- Optimization: [pash_runtime.py](./pash_runtime.py)
 
-First, there is the parser in [compiler/parser](../compiler/parser), which is a port of [libdash](https://github.com/mgree/), the dash parser extended with OCaml bindings, extended with ocaml2json and json2ocaml code to interface with PaSh.
+## Compiler Overview
 
-Now let's get to the compiler. It's entry point is [compiler/pash.py](../compiler/pash.py) that parses a script and replaces potentially parallelizable regions with calls to [compiler/pash_runtime.sh](../compiler/pash_runtime.sh). It then executes the script.
+Now let's get to the compiler.
+It's entry point is [pash.py](./pash.py) that parses a script and replaces potentially parallelizable regions with calls to [pash_runtime.sh](./pash_runtime.sh).
+It then executes the script.
 This allows invoking the compiler during the runtime to have information about the values of environment variables.
 
-The runtime script [compiler/pash_runtime.sh](../compiler/pash_runtime.sh) simply invokes the compiler [compiler/pash_runtime.py](../compiler/pash_runtime.py) and if it succeeds it executes the optimized script, otherwise it executes the original script.
+The [pash_runtime.sh](./pash_runtime.sh) script simply invokes the [pash.py](./pash.py) compiler:
+  if it succeeds it executes the optimized script, otherwise it executes the original script.
 
-Now the compiler has several stages:
+The compiler has several stages:
 
-1. It expands words in the AST and then it turns it into our dataflow model (guided by annotations)
-   - The expansion and translation happens in [ast_to_ir.py](../compiler/ast_to_ir.py)
-   - The dataflow model is mostly defined in [ir.py](../compiler/ir.py)
-   - The annotations are processed in [annotations.py](../compiler/annotations.py) and [command_categories.py](../compiler/command_categories.py)
+1. It expands words in the AST and then it turns it into our dataflow model (guided by [annotations](../annotations))
+   - The expansion and translation happens in [ast_to_ir.py](./ast_to_ir.py)
+   - The dataflow model is defined mostly in [ir.py](./ir.py)
+   - The annotations are processed in [annotations.py](./annotations.py) and [command_categories.py](./command_categories.py)
 2. It performs transformations on the dataflow graph to expose parallelism (guided by annotations)
-   - Translations happen in [pash_runtime.py](../compiler/pash_runtime.py)
+   - Translations happen in [pash_runtime.py](./pash_runtime.py)
 3. It then translates the dataflow graph back to a shell script to execute it with bash
-   - The `dfg2shell` translation happens in [ir_to_ast.py](../compiler/ir_to_ast.py)
+   - The `dfg2shell` translation happens in [ir_to_ast.py](./ir_to_ast.py)
+
+[//]: # (TODO: the parsing/unparsing components need update)
+
+## Zooming into Fragments
    
- A few interesting fragments are shown below.
+A few interesting fragments are outlined below.
  
- The [ast_to_ir.py](https://github.com/andromeda/pash/blob/main/compiler/ast_to_ir.py) contains a case statement that essentially pattern-matches on constructs of the shells script AST and then compiles them accordingly.
+The [ast_to_ir.py](./ast_to_ir.py) file contains a case statement that essentially pattern-matches on constructs of the shells script AST and then compiles them accordingly.
 ```Python
  compile_cases = {
         "Pipe": (lambda fileIdGen, config:
@@ -43,11 +63,12 @@ Now the compiler has several stages:
         "Or": (lambda fileIdGen, config:
                lambda ast_node: compile_node_and_or_semi(ast_node, fileIdGen, config))
         # ... more code ...
-    
 ```
 
+The following function from [ir.py](./ir.py) is responsible for parallelizing a single node (_i.e._, a command) in the dataflow graph.
+Look at the schematic in the comments starting [on line 637](./ir.py#L637) that gives the high-level overview of what this function does (not shown below).
 
-The following function from [ir.py](https://github.com/andromeda/pash/blob/main/compiler/ir.py) is responsible for parallelizing a single node (i.e., command) in the dataflow graph. Look at the schematic in the comments starting [on line 637](https://github.com/andromeda/pash/blob/main/compiler/ir.py#L637) that gives the high-level overview of what this function does (not shown below).
+[//]: # (TODO: Add schematic here)
 
 ```Python
     # See comment on line 637
@@ -60,7 +81,7 @@ The following function from [ir.py](https://github.com/andromeda/pash/blob/main/
         # ... more code ...
 ```
 
-Another interesting fragment is in [ir_to_ast.py](https://github.com/andromeda/pash/blob/main/compiler/ir_to_ast.py), which translates the parallel dataflow graph back to an AST.
+Another interesting fragment is in [ir_to_ast.py](./ir_to_ast.py), which translates the parallel dataflow graph back to an AST.
 
 ```Python
 def ir2ast(ir, args):
@@ -82,3 +103,11 @@ def ir2ast(ir, args):
 
 This AST is then unparsed back into a (parallel) shell script.
 
+## Earlier Versions
+
+The compiler is outlined in the [EuroSys paper](https://arxiv.org/pdf/2007.09436.pdf), but has evolved considerably since then:
+
+* PaSh originally did not have a preprocessing component, and didn't handle variable expansion. It now does both, significantly improving its practical applicability since it can be used on scripts where the environment variables are modified throughout the script.
+
+* PaSh originally was using code in [parser](./parser) -- a port of [libdash](https://github.com/mgree/), the `dash` parser extended with OCaml bindings -- and specifically the `ocaml2json` and `json2ocaml` binaries to interface with PaSh. PaSh now uses a custom parser written in Python, avoiding any dependency to OCaml and simplifying dependency management.
+
diff --git a/compiler/parser/libdash b/compiler/parser/libdash
@@ -1 +1 @@
-Subproject commit ac328aac56f0c48b00b30afca35b950eedd1f9bd
+Subproject commit ef6302502b904e33dd4cc686d71142fb1a87bbbd
diff --git a/docs/README.md b/docs/README.md
@@ -1,14 +1,15 @@
 # PaSh Documentation
+Quick Jump: [using pash](#using-pash) | [videos](#videos--video-presentations) | [papers](#academic-papers) 
 
-## Introductory Material
+## Using PaSh
 
 The following resources offer overviews of important PaSh components.
 
 * Short tutorial: [introduction](./tutorial.md#introduction), [installation](./tutorial.md#installation), [execution](./tutorial.md#running-scripts), and [next steps](./tutorial.md#what-next)
-* Annotations: [parallelizability](../annotations#main-parallelizability-classes) | [study](../annotations#parallelizability-study-of-commands-in-gnu--posix) | [example 1](../annotations#a-simple-example-chmod) | [example 1](../annotations#another-example-cut) | [howto](../annotations#how-to-annotate-a-command)
-* Compiler: [overview](../compiler)
-* Runtime: [overview](../runtime)
-* Scripts: [oneliners](../runtime)
+* Annotations: [parallelizability](../annotations#main-parallelizability-classes), [study](../annotations#parallelizability-study-of-commands-in-gnu--posix), [example 1](../annotations#a-simple-example-chmod), [example 1](../annotations#another-example-cut), [howto](../annotations#how-to-annotate-a-command)
+* Compiler: [intro](../compiler#introduction), [overview](../compiler#compiler-overview), [details](../compiler#zooming-into-fragments), [earlier versions](../compiler#earlier-versions)
+* Runtime: [split](../runtime#stream-splitting), [eager](../runtime#eager-stream-polling),  [cleanup](../runtime#cleanup-logic),  [aggregate](../runtime#aggregators)
+* Scripts: [one-liners](#common-unix-one-liners), [unix50](#unix-50-from-bell-labs), [weather analysis](#noaa-weather-analysis), [web indexing](#wikipedia-web-indexing)
 
 ## Videos & Video Presentations
 
@@ -22,17 +23,16 @@ The following presentations offer short PaSh introductions:
 
 The following papers present or use PaSh.
 
-#### An Order-aware Dataflow Model for Extracting Shell Script Parallelism
+**An Order-aware Dataflow Model for Extracting Shell Script Parallelism**  
 Shivam Handa, Konstantinos Kallas, Nikos Vasilakis, Martin Rinard  
 pdf | bibtex
 
-#### Automatic Synthesis of Parallel and Distributed Unix Commands with KumQuat
+**Automatic Synthesis of Parallel and Distributed Unix Commands with KumQuat**  
 Nikos Vasilakis*, Jiasi Shen*, Martin Rinard  
 pdf | bibtex
 
-#### The Once and Future Shell
+**The Once and Future Shell**  
 Michael Greenberg, Konstantinos Kallas, Nikos Vasilakis  
-[pdf]() | <details><summary>bibtex</summary>
 ```bibtex
 @inproceedings{pash:hotos:21,
   author = {Greenberg, Michael, and Kallas, Konstantinos, and Vasilakis, Nikos},
@@ -43,11 +43,10 @@ Michael Greenberg, Konstantinos Kallas, Nikos Vasilakis
   series = {HotOS '19}
 }
 ```
-</details>
 
-#### PaSh: Light-touch Data-Parallel Shell Processing
+**PaSh: Light-touch Data-Parallel Shell Processing**  
 Nikos Vasilakis*, Konstantinos Kallas*, Konstantinos Mamouras, Achilles Benetopoulos, Lazar Cvetković  
-[pdf](https://arxiv.org/pdf/2007.09436.pdf) | <details><summary>bibtex</summary>
+[arxiv](https://arxiv.org/pdf/2007.09436.pdf) | acm | video
 ```bibtex
 @inproceedings{pash:eurosys:21,
   author = {Vasilakis, Nikos, and Kallas, Konstantinos, and Mamouras, Konstantinos, and Benetopoulos, Achilles, and Cvetkovi\'{c}, Lazar},
@@ -63,6 +62,3 @@ Nikos Vasilakis*, Konstantinos Kallas*, Konstantinos Mamouras, Achilles Benetopo
   series = {EuroSys '21}
 }
 ```
-</details>
-
-
diff --git a/evaluation/benchmarks/README.md b/evaluation/benchmarks/README.md
@@ -1,9 +1,10 @@
 # Experimental Evaluation
+Quick Jump: [one-liners](#common-unix-one-liners) | [unix50](#unix-50-from-bell-labs) | [weather analysis](#noaa-weather-analysis) | [web indexing](#wikipedia-web-indexing)
 
 _Most benchmark sets in the evaluation infrastructure include a `input/setup.sh` script for fetching inputs and setting up the experiment appropriately._
 See [Running other script]() later.
 
-#### Section 6.1: Common Unix one-liners
+#### Common Unix one-liners
 
 The one-liner scripts are included in [evaluation/microbenchmarks](../evaluation/microbenchmarks).
 The list of scripts (and their correspondence to the names in the paper) are seen below:
@@ -61,7 +62,7 @@ Note that `-m` supersedes `-s` but `-l` does not supersede any of the two.
 Also note that if you run a script partially, it might end up saving partial results,
 therefore having 0 speedups in some points of the plots.
 
-#### Section 6.2: Unix50 from Bell Labs
+#### Unix50 from Bell Labs
 
 All of the Unix50 pipelines are in [evaluation/unix50/unix50.sh](../evaluation/unix50/unix50.sh).
 The inputs of the pipelines are in [evaluation/unix50/](../evaluation/unix50/).
@@ -112,7 +113,7 @@ These differences are due to the evolution of PaSh and the refinement of its ann
    The issue with these splits is that they do not manage to split the file (since there is only one line)
    leaving the rest of the script to run sequentially.
 
-#### Section 6.3: Use Case: NOAA Weather Analysis
+#### NOAA Weather Analysis
 
 Note that input files that are needed by this script 
 are `curl`ed from a server in the local network and therefore
@@ -169,7 +170,7 @@ is actually higher than what is reported in the paper since it doesn't
 have to write the intermediate files (between preprocessing and processing) to disk.
 
 
-#### Section 6.4: Use Case: Wikipedia Web Indexing
+#### Wikipedia Web Indexing
 
 Note that input files that are needed by this script (complete Wikipedia) 
 are saved locally on the server and therefore this program cannot be run from elsewhere.
diff --git a/runtime/README.md b/runtime/README.md
@@ -3,20 +3,20 @@ Quick Jump: [Stream Splitting](#stream-splitting) | [Eager Stream Polling](#eage
 
 PaSh includes a small library of runtime primitives supporting the runtime execution of parallel scripts emitted by the compiler.
 
-### Stream Splitting
+## Stream Splitting
 
 The PaSh compiler inserts `split` nodes to expose parallelism when parallelizable nodes only have one input.
 
-### Eager Stream Polling
+## Eager Stream Polling
 
 To overcome the laziness challenges outlined in Sec. 5, PaSh inserts and instantiates `eager` nodes on streams.
 
-### Cleanup Logic
+## Cleanup Logic
 
 PaSh contains cleanup logic for dealing with dangling FIFOs.
 This is implemented in `wait_for_output_and_sigpipe_rest.sh`.
 
-### Aggregators
+## Aggregators
 
 There is a small custom aggregator library provided in [agg/py/](agg/py/).
 These aggregators are used to merge partial results from the parallel scripts.