From 2deca48e63e9dcef73ce8e10fdd21095ec2de31b Mon Sep 17 00:00:00 2001
From: Josh van Leeuwen <me@joshvanl.dev>
Date: Fri, 26 Jul 2024 22:14:20 +0100
Subject: [PATCH] Updates API ref for jobs. Adds doc on Kubernetes Scheduler
 persistent (#4256)

* Updates API ref for jobs. Adds doc on Kubernetes Scheduler persistent
volume

Signed-off-by: joshvanl <me@joshvanl.dev>

* Apply suggestions from code review

Co-authored-by: Hannah Hunter <94493363+hhunter-ms@users.noreply.github.com>
Signed-off-by: Josh van Leeuwen <me@joshvanl.dev>

* Adds scheduler persistent volume docs for selfhosted

Signed-off-by: joshvanl <me@joshvanl.dev>

* Updates scheduler volume docs based on https://github.com/dapr/cli/pull/1423

Signed-off-by: joshvanl <me@joshvanl.dev>

* Apply suggestions from code review

Co-authored-by: Hannah Hunter <94493363+hhunter-ms@users.noreply.github.com>
Signed-off-by: Josh van Leeuwen <me@joshvanl.dev>

* Adds reference for file system location of scheduler local volume

Signed-off-by: joshvanl <me@joshvanl.dev>

* Update daprdocs/content/en/reference/api/jobs_api.md

Co-authored-by: Cassie Coyle <cassie.i.coyle@gmail.com>
Signed-off-by: Josh van Leeuwen <me@joshvanl.dev>

* Update daprdocs/content/en/operations/hosting/self-hosted/self-hosted-persisting-scheduler.md

Signed-off-by: Mark Fussell <markfussell@gmail.com>

* Apply suggestions from code review

Co-authored-by: Mark Fussell <markfussell@gmail.com>
Signed-off-by: Josh van Leeuwen <me@joshvanl.dev>

* Adds default volume name for scheduler dapr init

Signed-off-by: joshvanl <me@joshvanl.dev>

* Adds directions for getting scheduler volume from Docker Desktop

Signed-off-by: joshvanl <me@joshvanl.dev>

* Update daprdocs/content/en/operations/hosting/self-hosted/self-hosted-persisting-scheduler.md

Signed-off-by: Mark Fussell <markfussell@gmail.com>

---------

Signed-off-by: joshvanl <me@joshvanl.dev>
Signed-off-by: Josh van Leeuwen <me@joshvanl.dev>
Signed-off-by: Mark Fussell <markfussell@gmail.com>
Co-authored-by: Hannah Hunter <94493363+hhunter-ms@users.noreply.github.com>
Co-authored-by: Cassie Coyle <cassie.i.coyle@gmail.com>
Co-authored-by: Mark Fussell <markfussell@gmail.com>
---
 .../jobs/howto-schedule-jobs.md               | 19 ++--
 .../building-blocks/jobs/jobs-overview.md     | 19 ++--
 .../kubernetes-persisting-scheduler.md        | 55 ++++++++++++
 .../self-hosted-persisting-scheduler.md       | 27 ++++++
 daprdocs/content/en/reference/api/jobs_api.md | 86 ++++++++++++++-----
 .../content/en/reference/cli/dapr-init.md     |  5 +-
 6 files changed, 172 insertions(+), 39 deletions(-)
 create mode 100644 daprdocs/content/en/operations/hosting/kubernetes/kubernetes-persisting-scheduler.md
 create mode 100644 daprdocs/content/en/operations/hosting/self-hosted/self-hosted-persisting-scheduler.md

diff --git a/daprdocs/content/en/developing-applications/building-blocks/jobs/howto-schedule-jobs.md b/daprdocs/content/en/developing-applications/building-blocks/jobs/howto-schedule-jobs.md
index 166530635c8..fb4263ef184 100644
--- a/daprdocs/content/en/developing-applications/building-blocks/jobs/howto-schedule-jobs.md
+++ b/daprdocs/content/en/developing-applications/building-blocks/jobs/howto-schedule-jobs.md
@@ -8,19 +8,22 @@ description: "Learn how to use the jobs API to schedule jobs"
 
 Now that you've learned what the [jobs building block]({{< ref jobs-overview.md >}}) provides, let's look at an example of how to use the API. The code example below describes an application that schedules jobs for a **TBD** application.
 
-<!-- 
-Include a diagram or image, if possible. 
+<!--
+Include a diagram or image, if possible.
 -->
 
-
-
 ## Set up the Scheduler service
 
-When you run `dapr init` in either self-hosted mode or on Kubernetes, the Dapr scheduler service is started. 
+{{% alert title="Warning" color="warning" %}}
+By default, job data is not resilient to [Scheduler]({{< ref scheduler.md >}}) service restarts.
+A persistent volume must be provided to Scheduler to ensure job data is not lost in either [Kubernetes]({{< ref kubernetes-persisting-scheduler.md >}}) or [Self-Hosted]({{< ref self-hosted-persisting-scheduler.md >}}) mode.
+{{% /alert %}}
+
+When you run `dapr init` in either self-hosted mode or on Kubernetes, the Dapr scheduler service is started.
 
-## Run the Dapr sidecar 
+## Run the Dapr sidecar
 
-Run the Dapr sidecar alongside your application. 
+Run the Dapr sidecar alongside your application.
 
 ```bash
 dapr run --app-id=jobs --app-port 50070 --app-protocol grpc --log-level debug -- go run main.go
@@ -29,4 +32,4 @@ dapr run --app-id=jobs --app-port 50070 --app-protocol grpc --log-level debug --
 ## Next steps
 
 - [Learn more about the Scheduler control plane service]({{< ref "concepts/dapr-services/scheduler.md" >}})
-- [Jobs API reference]({{< ref jobs_api.md >}})
\ No newline at end of file
+- [Jobs API reference]({{< ref jobs_api.md >}})
diff --git a/daprdocs/content/en/developing-applications/building-blocks/jobs/jobs-overview.md b/daprdocs/content/en/developing-applications/building-blocks/jobs/jobs-overview.md
index ffea762f266..a7838088d59 100644
--- a/daprdocs/content/en/developing-applications/building-blocks/jobs/jobs-overview.md
+++ b/daprdocs/content/en/developing-applications/building-blocks/jobs/jobs-overview.md
@@ -8,14 +8,19 @@ description: "Overview of the jobs API building block"
 
 Many applications require job scheduling, or the need to take an action in the future. The jobs API is an orchestrator for scheduling these future jobs, either at a specific time or for a specific interval. 
 
-Not only does the jobs API help you with scheduling jobs, but internally, Dapr uses the scheduler service to schedule actor reminders.  
+Not only does the jobs API help you with scheduling jobs, but internally, Dapr uses the scheduler service to schedule actor reminders.
 
 Jobs in Dapr consist of:
-- The jobs API building block
+- [The jobs API building block]({{< ref jobs_api.md >}})
 - [The Scheduler control plane service]({{< ref "concepts/dapr-services/scheduler.md" >}})
 
 [See example scenarios.]({{< ref "#scenarios" >}})
 
+{{% alert title="Warning" color="warning" %}}
+By default, job data is not resilient to [Scheduler]({{< ref scheduler.md >}}) service restarts.
+A persistent volume must be provided to Scheduler to ensure job data is not lost in either [Kubernetes]({{< ref kubernetes-persisting-scheduler.md >}}) or [Self-hosted]({{< ref self-hosted-persisting-scheduler.md >}}) mode.
+{{% /alert %}}
+
 <img src="/images/scheduler/scheduler-architecture.png" alt="Diagram showing the Scheduler control plane service and the jobs API">
 
 ## How it works
@@ -34,19 +39,19 @@ You can use jobs to:
 
 Job scheduling can prove helpful in the following scenarios:
 
-- **Automated Database Backups**:   
+- **Automated Database Backups**:
    Ensure a database is backed up daily to prevent data loss. Schedule a backup script to run every night at 2 AM, which will create a backup of the database and store it in a secure location.
 
-- **Regular Data Processing and ETL (Extract, Transform, Load)**:  
+- **Regular Data Processing and ETL (Extract, Transform, Load)**:
    Process and transform raw data from various sources and load it into a data warehouse. Schedule ETL jobs to run at specific times (for example: hourly, daily) to fetch new data, process it, and update the data warehouse with the latest information.
 
-- **Email Notifications and Reports**:  
+- **Email Notifications and Reports**:
    Receive daily sales reports and weekly performance summaries via email. Schedule a job that generates the required reports and sends them via email at 6 a.m. every day for daily reports and 8 a.m. every Monday for weekly summaries.
 
-- **Maintenance Tasks and System Updates**:  
+- **Maintenance Tasks and System Updates**:
    Perform regular maintenance tasks such as clearing temporary files, updating software, and checking system health. Schedule various maintenance scripts to run at off-peak hours, such as weekends or late nights, to minimize disruption to users.
 
-- **Batch Processing for Financial Transactions**:  
+- **Batch Processing for Financial Transactions**:
    Processes a large number of transactions that need to be batched and settled at the end of each business day. Schedule batch processing jobs to run at 5 PM every business day, aggregating the day’s transactions and performing necessary settlements and reconciliations.
 
 Dapr's jobs API ensures the tasks represented in these scenarios are performed consistently and reliably without manual intervention, improving efficiency and reducing the risk of errors. 
diff --git a/daprdocs/content/en/operations/hosting/kubernetes/kubernetes-persisting-scheduler.md b/daprdocs/content/en/operations/hosting/kubernetes/kubernetes-persisting-scheduler.md
new file mode 100644
index 00000000000..130122192d6
--- /dev/null
+++ b/daprdocs/content/en/operations/hosting/kubernetes/kubernetes-persisting-scheduler.md
@@ -0,0 +1,55 @@
+---
+type: docs
+title: "How-to: Persist Scheduler Jobs"
+linkTitle: "How-to: Persist Scheduler Jobs"
+weight: 50000
+description: "Configure Scheduler to persist its database to make it resilient to restarts"
+---
+
+The [Scheduler]({{< ref scheduler.md >}}) service is responsible for writing jobs to its embedded database and scheduling them for execution.
+By default, the Scheduler service database writes this data to an in-memory ephemeral tempfs volume, meaning that **this data is not persisted across restarts**. Job data will be lost during these events.
+
+To make the Scheduler data resilient to restarts, a persistent volume must be mounted to the Scheduler `StatefulSet`.
+This persistent volume is backed by a real disk that is provided by the hosted Cloud Provider or Kubernetes infrastructure platform.
+Disk size is determined by how many jobs are expected to be persisted at once; however, 64Gb should be more than sufficient for most use cases.
+Some Kubernetes providers recommend using a [CSI driver](https://kubernetes.io/docs/concepts/storage/volumes/#csi) to provision the underlying disks.
+Below are a list of useful links to the relevant documentation for creating a persistent disk for the major cloud providers:
+- [Google Cloud Persistent Disk](https://cloud.google.com/compute/docs/disks)
+- [Amazon EBS Volumes](https://aws.amazon.com/blogs/storage/persistent-storage-for-kubernetes/)
+- [Azure AKS Storage Options](https://learn.microsoft.com/azure/aks/concepts-storage)
+- [Digital Ocean Block Storage](https://www.digitalocean.com/docs/kubernetes/how-to/add-volumes/)
+- [VMWare vSphere Storage](https://docs.vmware.com/VMware-vSphere/7.0/vmware-vsphere-with-tanzu/GUID-A19F6480-40DC-4343-A5A9-A5D3BFC0742E.html)
+- [OpenShift Persistent Storage](https://docs.openshift.com/container-platform/4.6/storage/persistent_storage/persistent-storage-aws-efs.html)
+- [Alibaba Cloud Disk Storage](https://www.alibabacloud.com/help/ack/ack-managed-and-ack-dedicated/user-guide/create-a-pvc)
+
+
+Once the persistent volume class is available, you can install Dapr using the following command, with Scheduler configured to use the persistent volume class (replace `my-storage-class` with the name of the storage class):
+
+{{% alert title="Note" color="primary" %}}
+If Dapr is already installed, the control plane needs to be completely [uninstalled]({{< ref dapr-uninstall.md >}}) in order for the Scheduler `StatefulSet` to be recreated with the new persistent volume.
+{{% /alert %}}
+
+{{< tabs "Dapr CLI" "Helm" >}}
+ <!-- Dapr CLI -->
+{{% codetab %}}
+
+```bash
+dapr init -k --set dapr_scheduler.cluster.storageClassName=my-storage-class
+```
+
+{{% /codetab %}}
+
+ <!-- Helm -->
+{{% codetab %}}
+
+```bash
+helm upgrade --install dapr dapr/dapr \
+--version={{% dapr-latest-version short="true" %}} \
+--namespace dapr-system \
+--create-namespace \
+--set dapr_scheduler.cluster.storageClassName=my-storage-class \
+--wait
+```
+
+{{% /codetab %}}
+{{< /tabs >}}
diff --git a/daprdocs/content/en/operations/hosting/self-hosted/self-hosted-persisting-scheduler.md b/daprdocs/content/en/operations/hosting/self-hosted/self-hosted-persisting-scheduler.md
new file mode 100644
index 00000000000..5bcc8745f20
--- /dev/null
+++ b/daprdocs/content/en/operations/hosting/self-hosted/self-hosted-persisting-scheduler.md
@@ -0,0 +1,27 @@
+---
+type: docs
+title: "How-to: Persist Scheduler Jobs"
+linkTitle: "How-to: Persist Scheduler Jobs"
+weight: 50000
+description: "Configure Scheduler to persist its database to make it resilient to restarts"
+---
+
+The [Scheduler]({{< ref scheduler.md >}}) service is responsible for writing jobs to its embedded database and scheduling them for execution.
+By default, the Scheduler service database writes this data to the local volume `dapr_scheduler`, meaning that **this data is persisted across restarts**.
+
+The host file location for this local volume is typically located at either `/var/lib/docker/volumes/dapr_scheduler/_data` or `~/.local/share/containers/storage/volumes/dapr_scheduler/_data`, depending on your container runtime.
+Note that if you are using Docker Desktop, this volume is located in the Docker Desktop VM's filesystem, which can be accessed using:
+
+```bash
+docker run -it --privileged --pid=host debian nsenter -t 1 -m -u -n -i sh
+```
+
+The Scheduler persistent volume can be modified with a custom volume that is pre-existing, or is created by Dapr.
+
+{{% alert title="Note" color="primary" %}}
+By default `dapr init` creates a local persistent volume on your drive called `dapr_scheduler`. If Dapr is already installed, the control plane needs to be completely [uninstalled]({{< ref dapr-uninstall.md >}}) in order for the Scheduler container to be recreated with the new persistent volume.
+{{% /alert %}}
+
+```bash
+dapr init --scheduler-volume my-scheduler-volume
+```
diff --git a/daprdocs/content/en/reference/api/jobs_api.md b/daprdocs/content/en/reference/api/jobs_api.md
index de9c33b0d75..37332b848c4 100644
--- a/daprdocs/content/en/reference/api/jobs_api.md
+++ b/daprdocs/content/en/reference/api/jobs_api.md
@@ -10,8 +10,16 @@ weight: 1300
 The jobs API is currently in alpha.
 {{% /alert %}}
 
+{{% alert title="Warning" color="warning" %}}
+By default, job data is not resilient to [Scheduler]({{< ref scheduler.md >}}) service restarts.
+A persistent volume must be provided to Scheduler to ensure job data is not lost in either [Kubernetes]({{< ref kubernetes-persisting-scheduler.md >}}) or [Self-Hosted]({{< ref self-hosted-persisting-scheduler.md >}}) mode.
+{{% /alert %}}
+
 With the jobs API, you can schedule jobs and tasks in the future.
 
+> The HTTP APIs are intended for development and testing only. For production scenarios, the use of the SDKs is strongly
+> recommended as they implement the gRPC APIs providing higher performance and capability than the HTTP APIs.
+
 ## Schedule a job
 
 Schedule a job with a name.
@@ -22,22 +30,50 @@ POST http://localhost:3500/v1.0-alpha1/jobs/<name>
 
 ### URL parameters
 
+{{% alert title="Note" color="primary" %}}
+At least one of `schedule` or `dueTime` must be provided, but they can also be provided together.
+{{% /alert %}}
+
 Parameter | Description
 --------- | -----------
 `name` | Name of the job you're scheduling
-`data` | A string value and can be any related content. Content is returned when the reminder expires. For example, this may be useful for returning a URL or anything related to the content.
-`dueTime` | Specifies the time after which this job is invoked. Its format should be [time.ParseDuration](https://pkg.go.dev/time#ParseDuration)
+`data` | A protobuf message `@type`/`value` pair. `@type` must be of a [well-known type](https://protobuf.dev/reference/protobuf/google.protobuf). `value` is the serialized data.
+`schedule` | An optional schedule at which the job is to be run. Details of the format are below.
+`dueTime` | An optional time at which the job should be active, or the "one shot" time, if other scheduling type fields are not provided. Accepts a "point in time" string in the format of RFC3339, Go duration string (calculated from creation time), or non-repeating ISO8601.
+`repeats` | An optional number of times in which the job should be triggered. If not set, the job runs indefinitely or until expiration.
+`ttl` | An optional time to live or expiration of the job. Accepts a "point in time" string in the format of RFC3339, Go duration string (calculated from job creation time), or non-repeating ISO8601.
+
+#### schedule
+`schedule` accepts both systemd timer-style cron expressions, as well as human readable '@' prefixed period strings, as defined below.
+
+Systemd timer style cron accepts 6 fields:
+seconds | minutes | hours | day of month | month        | day of week
+0-59    | 0-59    | 0-23  | 1-31         | 1-12/jan-dec | 0-7/sun-sat
+
+"0 30 * * * *" - every hour on the half hour
+"0 15 3 * * *" - every day at 03:15
+
+Period string expressions:
+Entry                  | Description                                | Equivalent To
+-----                  | -----------                                | -------------
+@every <duration>      | Run every <duration> (e.g. '@every 1h30m') | N/A
+@yearly (or @annually) | Run once a year, midnight, Jan. 1st        | 0 0 0 1 1 *
+@monthly               | Run once a month, midnight, first of month | 0 0 0 1 * *
+@weekly                | Run once a week, midnight on Sunday        | 0 0 0 * * 0
+@daily (or @midnight)  | Run once a day, midnight                   | 0 0 0 * * *
+@hourly                | Run once an hour, beginning of hour        | 0 0 * * * *
+
 
 ### Request body
 
 ```json
 {
   "job": {
-      "data": {
-          "@type": "type.googleapis.com/google.type.Expr",
-          "expression": "<expression>"
-      },
-      "dueTime": "30s"
+    "data": {
+	  "@type": "type.googleapis.com/google.protobuf.StringValue",
+	  "value": "\"someData\""
+    },
+    "dueTime": "30s"
   }
 }
 ```
@@ -46,24 +82,26 @@ Parameter | Description
 
 Code | Description
 ---- | -----------
-`202`  | Accepted
+`204`  | Accepted
 `400`  | Request was malformed
 `500`  | Request formatted correctly, error in dapr code or Scheduler control plane service
 
 ### Response content
 
-The following example curl command creates a job, naming the job `jobforjabba` and specifying the `dueTime` and the `data`.
+The following example curl command creates a job, naming the job `jobforjabba` and specifying the `schedule`, `repeats` and the `data`.
 
 ```bash
 $ curl -X POST \
   http://localhost:3500/v1.0-alpha1/jobs/jobforjabba \
-  -H "Content-Type: application/json" 
+  -H "Content-Type: application/json"
   -d '{
         "job": {
             "data": {
-                "HanSolo": "Running spice"
+	            "@type": "type.googleapis.com/google.protobuf.StringValue",
+	            "value": "Running spice"
             },
-            "dueTime": "30s"
+            "schedule": "@every 1m",
+            "repeats": 5
         }
     }'
 ```
@@ -87,33 +125,35 @@ Parameter | Description
 
 Code | Description
 ---- | -----------
-`202`  | Accepted
+`200`  | Accepted
 `400`  | Request was malformed
-`500`  | Request formatted correctly, error in dapr code or Scheduler control plane service
+`500`  | Request formatted correctly, Job doesn't exist or error in dapr code or Scheduler control plane service
 
 ### Response content
 
 After running the following example curl command, the returned response is JSON containing the `name` of the job, the `dueTime`, and the `data`.
 
 ```bash
-$ curl -X GET http://localhost:3500/v1.0-alpha1/jobs/jobforjabba -H "Content-Type: application/json" 
+$ curl -X GET http://localhost:3500/v1.0-alpha1/jobs/jobforjabba -H "Content-Type: application/json"
 ```
 
 ```json
 {
-  "name":"test1",
-  "dueTime":"30s",
+  "name": "jobforjabba",
+  "schedule": "@every 1m",
+  "repeats": 5,
   "data": {
-     "HanSolo": "Running spice"
-   }
-}                                    
+    "@type": "type.googleapis.com/google.protobuf.StringValue",
+    "value": "Running spice"
+  }
+}
 ```
 ## Delete a job
 
 Delete a named job.
 
 ```
-DELETE http://localhost:3500/v1.0-alpha1/jobs/<name> 
+DELETE http://localhost:3500/v1.0-alpha1/jobs/<name>
 ```
 
 ### URL parameters
@@ -126,7 +166,7 @@ Parameter | Description
 
 Code | Description
 ---- | -----------
-`202`  | Accepted
+`204`  | Accepted
 `400`  | Request was malformed
 `500`  | Request formatted correctly, error in dapr code or Scheduler control plane service
 
@@ -135,7 +175,7 @@ Code | Description
 In the following example curl command, the job named `test1` with app-id `sub` will be deleted
 
 ```bash
-$ curl -X DELETE http://localhost:3500/v1.0-alpha1/jobs/jobforjabba -H "Content-Type: application/json" 
+$ curl -X DELETE http://localhost:3500/v1.0-alpha1/jobs/jobforjabba -H "Content-Type: application/json"
 ```
 
 
diff --git a/daprdocs/content/en/reference/cli/dapr-init.md b/daprdocs/content/en/reference/cli/dapr-init.md
index b9295a01d03..3df59f323d5 100644
--- a/daprdocs/content/en/reference/cli/dapr-init.md
+++ b/daprdocs/content/en/reference/cli/dapr-init.md
@@ -45,6 +45,7 @@ dapr init [flags]
 |        N/A            | DAPR_HELM_REPO_PASSWORD | A password for a private Helm chart  |The password required to access the private Dapr Helm chart. If it can be accessed publicly, this env variable does not need to be set| |
 |  `--container-runtime`  |              |    `docker`      | Used to pass in a different container runtime other than Docker. Supported container runtimes are: `docker`, `podman` |
 |  `--dev`  |              |          | Creates Redis and Zipkin deployments when run in Kubernetes. |
+|  `--scheduler-volume`  |              |          | Self-hosted only. Optionally, you can specify a volume for the scheduler service data directory. By default, without this flag, scheduler data is not persisted and not resilient to restarts. |
 
 
 ### Examples
@@ -55,7 +56,9 @@ dapr init [flags]
 
 **Install**
 
-Install Dapr by pulling container images for Placement, Scheduler, Redis, and Zipkin. By default, these images are pulled from Docker Hub. 
+Install Dapr by pulling container images for Placement, Scheduler, Redis, and Zipkin. By default, these images are pulled from Docker Hub.
+
+> By default, a `dapr_scheduler` local volume is created for Scheduler service to be used as the database directory. The host file location for this volume is likely located at `/var/lib/docker/volumes/dapr_scheduler/_data` or `~/.local/share/containers/storage/volumes/dapr_scheduler/_data`, depending on your container runtime.
 
 ```bash
 dapr init