Unreleased (latest)
- Add support of official CWL IANA types to allow Process deployment with the relevant
Content-Type
header for the submitted payload (see common-workflow-language/common-workflow-language#421 (comment), relates to opengeospatial/NamingAuthority#169, resolves #434). - Support Process deployment using only CWL content provided it contains an
id
field representing the target Process ID as per recommendation in OGC Best Practice for Earth Observation Application Package, CWL Document (resolves #434). - Support Process deployment with a payload using
YAML
content instead ofJSON
. ThisYAML
content MUST be submitted in the request with aContent-Type
header either equal toapplication/x-yaml
orapplication/ogcapppkg+yaml
for the OGC Application Package schema, or usingapplication/cwl+yaml
for a CWL-only definition. The definition will be loaded and converted toJSON
for schema validation. Otherwise,JSON
contents is assumed to be directly provided in the request payload for validation as previously accomplished. - Add partial support of CWL with
$graph
representation for the special case where the graph is composed of a list of exactly one Application Package. Multi/nested-CWL definitions are NOT supported (relates to #56). - Add
weaver.cwl_processes_dir
configuration setting for preloading, registering or updating a set of known Process definitions from CWL files stored in a nested directory structure. This allows a service provider that uses Weaver to offer their Processes to directly maintain their definitions from the set of CWL files and upload changes in the web application at startup without need to manually undeploy and redeploy each Process. - Add
weaver.cwl_processes_register_error
to fail fast any Process registration error from CWL when loading files at startup.
- Fix Process deployment using a WPS-1/2 URL reference defining a
GetCapabilities
request to resolve the correspondingDescribeProcess
request if the Process ID can be inferred from other known locations (relates to #11). - Move
WpsPackage
properties to instance level to avoid potential referencing of attributes across same class used by distinct running Process.
4.18.0 (2022-06-09)
- Add CLI Authentication Handler parameters and corresponding
auth
argument of instantiated classes forWeaverClient
methods that allows inline request authentication and authorization resolution to access a protected service. Any Authentication Handler implementation can be used to fulfill required server functionalities. - Add CLI handling of uncaught exceptions to gracefully report message and error instead of exception traceback.
- Replaced CLI option
-t
by-T
(Docker token) duringdeploy
operation to match naming convention of other options (resolves #400). - Replaced CLI option
-H
bynH
(--no-headers
) andwH
(--with-headers
) to respectively enable or (explicitly) disable return of headers from response of the executed operation. - Replaced CLI option
-L
bynL
(--no-links
) andwL
(--with-links
) to respectively enable (explicitly) or disable return of links from response of the executed operation. - Replaced previously defined
-H
option by new-H/--header
argument allowing insertion of explicitly provided request headers for relevant requests called by the executed operation. - Add case insensitive support of values for common API, CLI, and
WeaverClient
parameter choices. - Add all missing CLI and
WeaverClient
examples in the documentation.
- Fix
Process.payload
improperly encoded in case of special characters where allowed such as in CWL definition. - Fix CLI operations assuming valid JSON response to instead return error response content and status code.
- Fix CLI rendering of various optional arguments and groups when displaying help messages.
- Fix invalid handling of
Constants
definitions mixed withclassproperty
such as inOutputFormat
causing returned value to be theclassproperty
itself instead of the retrieved value from its getter definition. - Fix minor typing definitions that were incorrect.
4.17.0 (2022-05-30)
- Add statistics collection at the end of Job execution to obtain used memory from
celery
process and spaced used by produced results. - Add
/jobs/{jobID}/statistics
endpoint (and corresponding locations for/providers
and/processes
) to report any collected statistics following a Job execution.
- Fix Job
Location
header injected twice inget_job_submission_response
causing header to have comma-separated list of URI values failing retrieval by CLI when attempting to perform auto-monitoring of the submitted Job. - Fix CWL runtime context setup to return monitored maximum RAM used by application under the Process if possible.
- Fix failing Service provider summary response in case of unresponsive (not accessible or parsable) URL endpoint
contents due to different errors raised by distinct versions of
requests
package.
4.16.1 (2022-05-12)
- Add OpenGIS as a potential
namespace resolver for common geospatial Media-Types such as
image/tiff; subtype=geotiff
that must be distinguished from generic IANA formats.
- Fix invalid interpretation of stored Process I/O with
schema
with Media-Type reference not representing a pre-resolved OpenAPI schema object, but rather an expected URIcontentSchema
reference for default format. - Fix CLI combination of user-provided Process description and inserted Process ID by option argument considering
alternative
OGC
/OLD
representations. - Fix OAS
format
field dropped for literal type when resolvingschema
provided during Process deployment. - Fix Media-Type resolution dropping important sub-type parameters to distinguish between specific
type context (e.g.
image/tiff
vsimage/tiff; subtype=geotiff
).
4.16.0 (2022-05-11)
- Add support of OpenAPI
schema
field for I/O definitions within Process description responses as required by OGC API - Processes specification (resolves #245). Existing and deployed processes using legacy I/O definitions will be parsed for corresponding fields employed in OpenAPI to generate the missingschema
field. Inversely, processes directly deployed withschema
definitions are ported back to legacy I/O representation by padding them with corresponding fields. Conversion between the two representations is unidirectional according to whetherschema
is specified or not. Nevertheless, the final I/O definitions can try to make use of both representations simultaneously and in combination with I/O definitions extracted from the CWL Application Package to resolve additional details during I/O merging strategy. - Add support of
Accept
header,f
andformat
request queries forGET /jobs/{jobID}/logs
retrieval usingtext
,json
,yaml
andxml
(and their corresponding Media-Type definitions) to list Job logs. - Add partial support of literals with unit of measure (
UoM
) specified during Process deployment using the I/Oschema
field (relates to #430). - Add partial support of bounding box parsing specified during Process deployment using the
I/O
schema
field (relates to #51). - Add encoding/decoding of JSON I/O definitions for saving to database in order to support OpenAPI
schema
that can contain conflicting key names with MongoDB functionalities (e.g.:$ref
). - Add parsing of CLI inputs with
@parameter=value
additional properties to be passed for the Process execution. This can be used for specifying themediaType
andencoding
of aFile
reference input. - Remove
deploymentProfileName
requirement during Process deployment. The correspondingdeploymentProfile
property is instead automatically generated from resolved CWL package/reference or remote WPS reference. This further simplifies deployment using the CLI to its bare minimum components as only the CWL or WPS reference needs to be provided along the desired Process ID without any further details.
- Remove
VaultReference
fromReferenceURL
schema employed to reference external resources that are not intended to be used with temporary Vault definitions. Only inputs for Process execution will allow Vault references. - Fix
LiteralOutput
creation not removingallowed_values
not available with PyWPS class. - Fix failing Process deployment caused by
links
if explicitly specified in the payload by the user. Additional links that don't conflict with dynamically generated ones are added to the deployed Process definition. - Fix missing
deploymentProfile
property in Process description (resolves #319).
4.15.0 (2022-04-20)
In order to support synchronous execution, setting
RESULT_BACKEND
MUST be specified in theweaver.ini
configuration file. See Weaver INI Configuration Example in section[celery]
for more details.With resolution and added support of
transmissionMode
handling according to OGC API - Processes specification, requests that where submitted withreference
outputs will produce results in a different format than previously since this parameter was ignored and always returnedvalue
representation.Due to
celery>=5.2
migration, any call tocelery
CLI must be updated accordingly by moving the global options before the mode, namelyworker
,inspect
and so on. Specifically for Weaver, this means theweaver-worker
command line option -A must be moved beforeworker
as follows:celery -A pyramid_celery.celery_app worker -B -E --ini weaver.ini [...]
- Support
Prefer
header withwait
orrespond-async
directives to selectJob
execution mode either as synchronous or asynchronous task, according to supportedjobControlOptions
of the relevantProcess
being executed (resolves #247). - Increase minor version of all
builtin
processes that will now be executable in wither (a)synchronous modes. - Add
weaver.exec_sync_max_wait
andweaver.quote_sync_max_wait
settings allowing custom definition for the maximum duration that can be specified to wait for a synchronous response from task workers. - Add
-B
(celery beat
) option to Docker command ofweaver-worker
to run scheduled task in parallel tocelery worker
in order to periodically cleanup task results introduced by synchronous execution. - Add support of
transmissionMode
handling asreference
to generate HTTPLink
references for results requested this way (resolves #377). - Updated every
Process
to report that they supportoutputTransmission
both asreference
andvalue
, since handling of results is accomplished by Weaver itself, regardless of the application being executed. - Add partial support of
response=raw
parameter for execution request submission in order to handle results to be returned accordingly to specifiedoutputTransmission
byreference
orvalue
. Multipart contents for multi-output results are not yet supported (relates to #376). - Add CLI option
-R/--ref/--reference
forexecute
operation allowing to request correspondingoutputs
by ID to be returned using thetransmissionMode: reference
method, producing HTTPLink
headers for those entries rather than inserting values in the response content body. - Add requested
outputs
into response ofGET /jobs/{jobId}/inputs
to obtain submittedJob
definitions. - Add query parameter
schema
forGET /jobs/{jobId}/inputs
(and corresponding endpoints under/processes
and/providers
) allowing to retrieve submitted input values and requested outputs with eitherOGC
/OLD
formats. - Improve conformance for returned status codes and error messages when requesting results for an unfinished,
failed, or dismissed
Job
. - Adjust conformance item references to correspond with OGC API - Processes: Part 2 renamed from Transactions to Deploy, Replace, Undeploy.
- Add
mutable
field toProcess
summary listing and detailed descriptions for conformance (resolves #180). - Improve
Process
undeployment to consider runningJob
to block its removal while in use. - Add
category
query parameter to/conformance
endpoint allowing to filter items byconf
(conformance),rec
(recommendation),req
(requirement),per
(permission) orall
references. By default, return theconf
representation which is the expected definitions by OGC API conformance validators. - Add multiple conformance items related to CWL and OGC Best Practice for Earth Observation Application Package definitions (relates to #56, #103, #105, #294, #399).
- Phase out
Python 3.6
support to better resolve package dependencies (could still work, but not explicitly supported nor officially guaranteed to work).
- Fix
outputs
permitted to be completely omitted from the execution request (resolves #375). - Fix
outputs
permitted as explicit empty mapping or list as equivalent to omitting them, defining by default that alloutputs
should be returned withtransmissionMode: value
forJob
execution. - Fix all instances of
outputTransmission
reported asreference
inProcess
descriptions, although Weaver behaved with thevalue
method, which is to return values and file references in content body, instead of HTTPLink
header references. - Fix WPS 1/2 endpoint not reporting the appropriate instance URL (fixes #83).
- Fix CLI
deploy
operation headers incorrectly passed down to the deployment request. - Fix many linting issues with latest
pylint
definitions. - Fix temporary
pywps
patches that have been integrated (relates to #352 addressing issues geopython/pywps#578 and geopython/pywps#623). - Fix
celery
security vulnerability with update to latest recommended version (resolves #386).
4.14.0 (2022-03-14)
- Add CLI option
-L/--no-links
that drops thelinks
section of any response to make the printed result more concise and specific to relevant details of the called operation. - Add CLI option
-F/--format
that allows output of contents in an alternative format. Available formatters include JSON, YAML and XML representations, with either pretty indentation and newlines or not. This allows CLI calls that can return contents in the preferred format of a such that might need to parse the relevant details. Alternative until the API itself can return similar formatted responses (relates to #125). - Add CLI option
-H/--headers
that allows output of response headers as well as the response contents. This can be useful for endpoints that can return critical information, such asLocation
header for the Job status endpoint of an OGC compliant service, or thePreference-Applied
header for services that support multiple execution modes (i.e.:wait
forsync-execute
orrespond-async
forasync-execute
control options). - Add CLI operation
jobs
to obtain listing with some options similar to the corresponding API endpoint queries.
- No change.
4.13.0 (2022-03-09)
- Add
schema
query parameter toGET /jobs/{jobID}/outputs
request allowing to select betweenOGC
,OLD
OGC+strict
andOLD+strict
representations (case insensitive), each with different combinations offormat.mimeType
,format.mediaType
and/or directlytype
field to provide the Content-Type of an output withhref
file. By default, both theformat
(i.e.:OLD
schema) and thetype
(i.e.:OGC
schema) are simultaneously reported for backward and forward compatibility, and for OGC compliance, to return the IANA Media-Type of the associated file reference (relates to #401). - Add support of
type
as alias to the Media-Type under theformat
for file references when submitted forJob
execution inputs, in accordance to the reported inputs/outputs endpoints, and for OGC compliance (resolves #401). - Drop
type
field formetadata
items in process description that correspond to avalue
with arole
. - Enforce pattern validation of
type
as IANA Content-Type formetadata
items in process description that correspond to aLink
withhref
. Invalidtype
are now rejected to adhere to OGC requirement classes. - Clarify schema employed by Weaver to use naming that is as close as possible to OGC schemas to facilitate their comprehension and external references.
- Fix
GET /jobs/{jobID}/inputs
endpoint failing to return submittedinputs
forJob
execution when they were specified using the mapping representation (i.e.:OGC
schema) instead of the listing representation (i.e.:OLD
schema). - Fix Media-Type provided as
Job
file reference input not forwarded to underlying WPS execution for validation against supported formats for corresponding inputs. Specified format handles both theOLD
definition withformat
field (and nestedmimeType
ormediaType
), and the more recentOGC
format withtype
field.
4.12.0 (2022-02-28)
- Updates related to OGC API - Processes: Quotation Extension.
- Move estimator portion of the quoting operation into separate files and bind them with Celery task to allow the same kind of dispatched processing as normal Process execution.
- Update Quote data type to contain status similarly to Job considering dispatched
async
processing. - Define
LocalizedDateTimeProperty
for reuse by data types avoiding issues about handling datetime localization. - Update OpenAPI schemas regarding Quote (partial/complete) and other datetime related fields.
- Add parsing of
Prefer
header allowingsync
processing (relates to #247). This is not yet integrated for Jobs execution themselves onprocesses/{id}/execution
endpoint.
- No change.
4.11.0 (2022-02-24)
- Support Process deployment using OGC schema (i.e.: Process metadata can be provided directly under
processDescription
instead of being nested underprocessDescription.process
). This aligns the deployment schema with reference OGC API - Processes: Deploy, Replace, Undeploy extension (see OGC Application Package schema). The previous schema for deployment with nestedprocess
field remains supported for backward compatibility.
- Fix resolution of the
default
field specifier under a list of supportedformats
during deployment. For various combinations such as whendefault: True
format is omitted, or when the default is not ordered first, resolveddefault
specifically foroutputs
definitions would be incorrect.
4.10.0 (2022-02-22)
- Refactor all constants of similar concept into classes to facilitate reuse and avoid omitting entries when iterating over all members of a corresponding constant group (fixes #33).
- Fix resolution of common IANA Media-Types (e.g.:
text/plain
,image/jpeg
, etc.) that technically do not provide and explicit entry when accessing the namespace (i.e.:{IANA_NAMESPACE_URL}/{mediaType}
), but are known in IANA registry through various RFC specifications. The missing endpoints caused many recurring and unnecessary HTTP 404 that needed a second validation against EDAM namespace each time. These common Media-Types, along with new definitions inweaver.formats
, will immediately return a IANA/EDAM references without explicit validation on their registries.
4.9.1 (2022-02-21)
- Add encryption of stored Vault file contents until retrieved for usage by the executed
Process
application.
- Fix auto-resolution of Vault file
Content-Type
when not explicitly provided.
4.9.0 (2022-02-17)
- Add Vault endpoints providing a secured self-hosted file storage to upload local files for execution input.
- Add
upload
CLI operation for uploading local files to Vault. - Add CLI automatic detection of local files during
execute
call to upload to Vault and retrieve them from it on the remote Weaver instance. - Add
-S
/--schema
option to CLIdescribe
operation. - Add more documentation examples and references related to CLI and
WeaverClient
usage. - Improve Media-Type/Content-Type guesses based on known local definitions and extensions in
weaver.formats
. - Extend
PyWPS
WPSRequest
to support more authorization header forwarding for inputs that could need it.
- Fix rendering of CLI required arguments under the appropriate argument group section when those arguments can be
specified using prefixed
-
and--
optional arguments format. - Fix CLI
url
parameter to be provided using-u
or--url
without specific argument position needed. - Fix CLI parsing of
File
inputs forexecute
operation when provided with quotes to capture full paths. - Fix rendering of OpenAPI variable names (
additionalParameters
) employed to represent for example{input-id}
as the key within the mapping representation of inputs/outputs. The previous notation employed was incorrectly interpreted as HTML tags, making them partially hidden in Swagger UI. - Fix reload of
DockerAuthentication
reference from database failing due to mismatched parameter names. - Fix invalid generation and interpretation of timezone-aware datetime between local objects and loaded from database. Jobs created or reported without any timezone UTC offset were assumed as UTC+00:00 although corresponding datetimes were generated based on the local machine timezone information. Once reloaded from database, the missing timezone awareness made datetime stored in ISO-8601 format to be interpreted as already localized datetime.
- Fix invalid setup of generic CLI options headers for other operations than
dismiss
. - Fix
weaver.request-options
handling that always ignoredtimeout
andverify
entries from the configuration file by overriding them with default values.
4.8.0 (2022-01-11)
- Refactor Workflow operation flow to reuse shared input and output staging operations between implementations. Each new step process implementation now only requires to implement the specific operations related to deployment, execution, monitoring and result retrieval for their process, without need to consider Workflow intermediate staging operations to transfer files between steps.
- Refactor
Wps1Process
andWps3Process
step processes to follow new workflow operation flow. - Add
builtin
processfile_index_selector
that allows the selection of a specific file within an array of files. - Add tests to validate chaining of Workflow steps using different combinations of process types
including WPS-1, OGC-API and
builtin
implementations. - Move CWL script examples in documentation to separate package files in order to directly reference them in tests validating their deployment and execution requests.
- Move all
tests/functional/application-packages
definitions into distinct directories to facilitate categorization of corresponding deployment, execution and package contents, and better support the various Workflow testing location of those files with backward compatibility. - Add logs final entry after retrieved internal CWL application logs to help highlight delimitation with following entries from the parent Process.
- Fix handling of CWL Workflow outputs between steps when nested glob output binding are employed (resolves #371).
- Fix resolution of
builtin
process Python reference when executed locally within a Workflow step. - Fix resolution of process type WPS-1 from its package within a Workflow step executed as OGC-API process.
- Fix resolution of
WPS1Requirement
directly provided as CWL execution unit within the deployment body. - Fix deployment body partially dropping invalid
executionUnit
sub-fields causing potential misinterpretation of the intended application package. - Fix resolution of package or WPS-1 reference provided by
href
with erroneousContent-Type
reported by the returned response. Attempts auto-resolution of detected CWL (as JSON or YAML) and WPS-1 (as XML) contents. - Fix resolution of
format
reference within CWL I/O record after interpretation of the loaded application package. - Fix missing WPS endpoint responses in generated OpenAPI for ReadTheDocs documentation.
- Fix reporting of WPS-1 status location as the XML file URL instead of the JSON OGC-API endpoint when Job was originally submitted through the WPS-1 interface.
- Fix and improve multiple typing definitions.
4.7.0 (2021-12-21)
- Add CLI
--body
and--cwl
arguments support of literal JSON string fordeploy
operation.
- Fix help message of CLI arguments not properly grouped within intended sections.
- Fix handling of mutually exclusive CLI arguments in distinct operation sub-parsers.
- Fix CLI requirement of
--process
and--job
arguments.
4.6.0 (2021-12-15)
- Add
WeaverClient
andweaver
CLI as new utilities to interact with Weaver instead of using the HTTP API. This provides both shell and Python script interfaces to run operations toward Weaver instances (or any other OGC API - Processes compliant instance except for deployment operations). It also facilitates new Process deployments by helping with the integration of a local CWL file into a full-fledgedDeploy
HTTP request, and other recurrent tasks such asExecute
requests followed by Job monitoring and results retrieval once completed successfully (resolves #363, resolves DAC-198, relates to DAC-203). - Added
weaver
command installation tosetup.py
script. - Added auto-documentation utilities for new
weaver
CLI (argparse parameter definitions) and provide relevant references in new chapter in Sphinx documentation. - Added
cwl2json_input_values
function to help converting between CWL parameters and OGC API - Processes input value definitions for Job submission. - Added
weaver.datatype.AutoBase
that allows quick definition of data containers with fields accessible both as properties and dictionary keys, simply by detecting predefined class attributes, avoiding a lot of boilerplate code. - Split multiple file loading, remote validation and resolution procedures into distinct functions in order for the new CLI to make use of the same methodologies as needed.
- Updated documentation with new details relevant to the added CLI and corresponding references.
- Updated some tests utilities to facilitate definitions of new tests for
WeaverClient
feature validation. - Replaced literal string
"OGC"
and"OLD"
used for schema selection by properly defined constants. - Add database revision number for traceability of migration procedures as needed.
- Add first database revision with conversion of UUID-like strings to literal UUID objects.
- Add
links
to/processes
and/providers/{id}/processes
listings (resolves #269). - Add
limit
,page
andsort
query parameters for/processes
listing (resolves #269). - Add
ignore
parameter to/processes
listing when combined withproviders=true
to allow the similar behaviour supported byignore
on/providers
endpoint, to effectively ignore services that cause parsing errors or failure to retrieve details from the remote reference. - Add schema validation of contents returned on
/processes
endpoint. - Add more validation of paging applicable index ranges and produce
HTTPBadRequest [400]
when values are invalid.
- Fix some typing definitions related to CWL function parameters.
- Fix multiple typing inconsistencies or ambiguities between
AnyValue
(as Python typing for any literal value) against the actual classAnyValue
ofPyWPS
. Typing definitions now all useAnyValueType
instead. - Fix resolution of
owsContext
location in the payload of remote Process provided byhref
link in theexecutionUnit
due to OGC API - Processes ("OGC"
schema) not nested underprocess
key (in contrast to"OLD"
schema). - Fix resolution of
outputs
submitted as mapping (OGC API - Processes schema) during Job execution to provide desired filtered outputs in results and theirtransmissionMode
. Note that filtering and handling of alltransmissionMode
variants are themselves not yet supported (relates to #377 and #380). - Fix resolution of unspecified UUID representation format in MongoDB.
- Fix conformance with error type reporting of missing Job or Process (resolves #320).
- Fix sorting of text fields using alphabetical case-insensitive ordering.
- Fix search with paging reporting invalid
total
when out of range. - Pin
pymongo<4
untilcelery>=5
gets resolved (relates to #386).
4.5.0 (2021-11-25)
- Add support of
X-Auth-Docker
request header that can be specified during Process deployment as authentication token that Weaver can use to obtain access and retrieve the Docker image referenced by the Application Package (CWL) located on a private registry. - Add more documentation details about sample CWL definitions to execute script, Python and Dockerized applications.
- Fix parsing of inputs for OpenSearch parameters lookup that was assuming inputs were always provided as listing definition, not considering possible mapping definition.
- Fix incorrect documentation section
Package as External Execution Unit Reference
where content was omitted and incorrectly anchored as followingESGF-CWT
section.
4.4.0 (2021-11-19)
- Add
map_wps_output_location
utility function to handle recurrent mapping ofweaver.wps_output_dir
back and forth with resolvedweaver.wps_output_url
. - Add more detection of map-able WPS output location to avoid fetching files unnecessarily. Common cases
are
Workflow
running multiple steps on the same server or Application PackageProcess
that reuses an output produced by a previous execution. Relates to #183. - Add pre-validation of file accessibility using HTTP HEAD request when a subsequent
Workflow
step employs an automatically mapped WPS output location from a previous step to verify that the file would otherwise be downloadable if it could not have been mapped. This is to ensure consistency and security validation of the reference WPS output location, although the unnecessary file download operation can be avoided. - Add functional
Workflow
tests to validate execution without the need of remote Weaver test application (relates to #141, relates to #281). - Add missing documentation details about Data Source and connect chapters with other relevant
documentation details and updated
Workflow
tests. - Add handling of
Content-Disposition
header providing preferredfilename
orfilename*
parameters when fetching file references instead of the last URL fragment employed by default (resolves #364). - Add more security validation of the obtained file name from HTTP reference, whether generated from URL path fragment or other header specification.
- Fix incorrect resolution of
Process
results endpoint to pass contents from one step to another duringWorkflow
execution (resolves #358). - Fix logic of remotely and locally executed applications based on CWL requirements when attempting to resolve whether an input file reference should be fetched.
- Fix resolution of WPS I/O provided as mapping instead of listing during deployment in order to properly parse them and merge their metadata with corresponding CWL I/O definitions.
- Fix DataSource and OpenSearch typing definitions to more rapidly detect incorrect data structures during parsing.
4.3.0 (2021-11-16)
- Add support of
type
andprocessID
query parameters forJob
listing (resolves some tasks in #268). - Add
type
field toJob
status information (resolves #351). - Add OGC API - Processes conformance references regarding supported operations for
Job
listing and filtering. - Add
minDuration
andmaxDuration
parameters to queryJob
listing filtered by specific execution time range (resolves #268). Range duration parameters are limited to single values each (relates to opengeospatial/ogcapi-processes#261). - Require minimally
pymongo==3.12.0
and corresponding MongoDB5.0
instance to process new filtering queries ofminDuration
andmaxDuration
. Please refer to :ref:`database_migration` and MongoDB official documentation for migration methods. - Refactor
Job
search method to facilitate its extension in the event of future filter parameters. - Support contextual WPS output location using
X-WPS-Output-Context
header to storeJob
results. When aJob
is executed by providing this header with a sub-directory, the resulting outputs of theJob
will be placed and reported under the corresponding location relative to WPS outputs (path and URL). - Add
weaver.wps_output_context
setting as default contextual WPS output location when header is omitted. - Replace
Job.execute_async
getter/setter by simple property using more genericJob.execution_mode
for storage in database. ProvideJob.execute_async
andJob.execute_sync
properties based on stored mode. - Simplify
execute_process
function executed by Celery task into sub-step functions where applicable. - Simplify forwarding of
Job
parameters betweenPyWPS
serviceWorkerService.execute_job
method and Celery task instantiating it by reusing theJob
object. - Provide corresponding
Job
log URL along already reported log file path to facilitate retrieval from server side. - Avoid
Job.progress
updates followingfailed
ordismissed
statuses to keep track of the last real progress percentage that was reached when that status was set. - Improve typing of database and store getter functions to infer correct types and facilitate code auto-complete.
- Implement
Job
dismiss operation ensuring pending or running tasks are removed and output result artifacts are removed from disk. - Implement HTTP Gone (410) status from already dismissed
Job
when requested again or when fetching its artifacts.
- Removes the need for specific configuration to handle public/private output directory settings using
provided
X-WPS-Output-Context
header (fixes #110). - Fix retrieval of Pyramid
Registry
and application settings when available container is WerkzeugRequest
instead of PyramidRequest
, as employed by underlying HTTP requests in PyWPS service. - Allow
group
query parameter to handleJob
category listing withprovider
asservice
alias. - Improve typing of database and store getter functions to infer correct types and facilitate code auto-complete.
- Fix incorrectly configured API views for batch
Job
dismiss operation withDELETE /jobs
and corresponding endpoints forProcess
andProvider
paths. - Fix invalid
Job
links sometimes containing duplicate/
occurrences. - Fix invalid
Job
link URL foralternate
relationship.
4.2.1 (2021-10-20)
- Add more frequent
Job
updates of execution checkpoint pushed to database in order to avoid inconsistent statuses between the parentCelery
task and the underlying Application Package being executed, since both can update the sameJob
entry at different moments. - Add a
Job
log entry as"accepted"
on the API side before calling theCelery
task submission (Job
not yet picked by a worker) in order to provide more detail between the submission time and initial execution time. This allows to have the first log entry not immediately set to"running"
since both"started"
and"running"
statues are remapped to"running"
within the task to be compliant with OGC status codes.
- Fix an inconsistency between the final
Job
status and the reported "completed" message in logs due to missing push of a newer state prior re-fetch of the latestJob
from the database.
4.2.0 (2021-10-19)
- Add execution endpoint
POST /provider/{id}/process/{id}/execution
corresponding to the OGC-API compliant endpoint for localProcess
definitions. - Add multiple additional relation
links
forProcess
andJob
responses (resolves #234 and #267). - Add convenience
DELETE /jobs
endpoint with input list ofJob
UUIDs in order todismiss
multiple entries simultaneously. This is useful for quickly removing a set ofJob
returned by filteredGET /jobs
contents. - Update conformance link list for
dismiss
and relevant relationlinks
definitions (relates to #53 and #267). - Add better support and reporting of
Job
statusdismissed
when operation is called from API on running task. - Use explicit
started
status whenJob
has been picked up by a Celery worker instead of leaving it toaccepted
(same status that indicates theJob
"pending", although a worker is processing it). Early modification of status is done in case setup operations (send WPS request, prepare files, etc.) take some time which would leave users under the impression theJob
is not getting picked up. Report explicitrunning
status inJob
once it has been sent to the remote WPS endpoint. The API will reportrunning
in both cases in order to support OGC API - Processes naming conventions, but internalJob
status will have more detail. - Add
updated
timestamp toJob
response to better track latest milestones saved to database (resolves #249). This avoids users having to compare many fields (created
,started
,finished
) depending on latest status. - Apply stricter
Deploy
body schema validation and employ deserialized result directly. This ensures that preserved fields in the submitted content for deployment contain only known data elements with expected structures for respective schemas. Existing deployment body that contain invalid formats could start to fail or might generate inconsistentProcess
descriptions if not adjusted. - Add improved reporting of erroneous inputs during
Process
deployment whenever possible to identify the cause. - Add more documentation details about missing features such as
EOImage
inputs handled by OpenSearch requests. - Add
weaver.celery
flag to internal application settings when auto-detecting that current runner iscelery
. This bypasses redundant API-only operations during application setup and startup not needed bycelery
worker.
- Fix OGC-API compliant execution endpoint
POST /process/{id}/execution
not registered in API. - Fix missing status for cancelled
Jobs
in order to properly supportdismiss
operation (resolves #145 and #228). - Fix all known OGC-specific link relationships with URI prefix (resolves #266).
- Fix incorrect rendering of some table cells in the documentation.
4.1.2 (2021-10-13)
- No change.
- Add
celery worker
task events flag (-E
) to Docker command (weaver-worker
) to help detect submitted delayed tasks when requesting job executions.
4.1.1 (2021-10-12)
- No change.
- Fix handling of default format field of WPS input definition incorrectly resolved as default data by
PyWPS
for Process that allows optional (minOccurs=0
) inputs ofComplex
type. Specific case is detected with relevant erroneous data and dropped silently because it should not be present (since omitted in WPS request) and should not generate a WPS input (relates to geopython/pywps#633). - Fix resolution of CWL field
default
value erroneously inserted as"null"
literal string for inputs generated from WPS definition to avoid potential confusion with valid"null"
input or default string. Default behaviour to drop or ignore omitted inputs are handled by"null"
withintype
field in CWL definitions. - Fix
Wps1Process
job runner for dispatched execution of WPS-1 Process assuming all provided inputs contain data or reference. Skip omitted optional inputs that are resolved withNone
value following above fixes. - Resolve execution failure of WPS-1 Process
ncdump
underhummingbird
Provider (fixes issue identified in output logs from notebook in PR pavics-sdi#230).
4.1.0 (2021-09-29)
- Improve reporting of mismatching Weaver configuration for Process and Application Package definitions that always require remote execution. Invalid combinations will be raised during execution with detailed problem.
- Forbid Provider and applicable Process definitions to be deployed, executed or queried when corresponding remote execution is not supported according to Weaver instance configuration since Provider must be accessed remotely.
- Refactor endpoint views and utilities referring to Provider operations into appropriate modules.
- Apply
weaver.configuration = HYBRID
by default in example INI configuration since it is the most common use case. Apply same configuration by default in tests. Default resolution still employsDEFAULT
for backward compatibility in case the setting was omitted completely from a custom INI file. - Add query parameter
ignore
toGET /providers
listing in order to obtain full validation of remote providers (including XML contents parsing) to return200
. Invalid definitions will raise and return a[422] Unprocessable Entity
HTTP error. - Add more explicit messages about the problem that produced an error (XML parsing, unreachable WPS, etc.) and which caused request failure when attempting registration of a remote Provider.
- Fix reported
links
by processes nested under a providerService
. Generated URL references were omitting the/providers/{id}
portion. - Fix documentation referring to incorrect setting name in some cases for WPS outputs configuration.
- Fix strict XML parsing failing resolution of some remote WPS providers with invalid characters such as
<
,<=
within process description fields. Although invalid, those easily recoverable errors will be handled by the parser. - Fix resolution and execution of WPS-1 remote Provider and validate it against end-to-end test procedure from scratch Service registration down to results retrieval (fixes #340).
- Fix resolution of applicable Provider listing schema validation when none have been registered (fixes #339).
- Fix incorrect schema definition of Process items for
GET /processes
response that did not report the alternative identifier-only listing whendetail=false
query is employed. - Fix incorrect reporting of documented OpenAPI reference definitions for
query
parameters with same names shared across multiple endpoints. Fix is directly applied on relevant reference repository that generates OpenAPI schemas (see fmigneault/cornice.ext.swagger@70eb702). - Fix
weaver.exception
definitions such that raising them directly will employ the correspondingHTTPException
code (if applicable) to generate the appropriate error response automatically when raising them directly without further handling. The order of class inheritance were always using500
due toWeaverException
definition.
4.0.0 (2021-09-21)
- Apply conformance updates to better align with expected
ProcessDescription
schema from OGC API - Processes v1.0-draft6. The principal change introduced in this case is that process description contents will be directly at the root of the object returned by/processes/{id}
response instead of being nested under"process"
field. Furthermore,inputs
andoutputs
definitions are reported as mapping of{"<id>": {<parameters>}}
as specified by OGC-API instead of old listing format[{"id": "<id-value>", <key:val parameters>}]
. The old nested and listing format can still be obtained using request query parameterschema=OLD
, and will otherwise use OGC-API by default or whenschema=OGC
. Note that some duplicated metadata fields are dropped regardless of selected format in favor of OGC-API names. Some examples areabstract
that becomesdescription
,processVersion
that simply becomesversion
,mimeType
that becomesmediaType
, etc. Some of those changes are also reflected byProcessSummary
during listing of processes, as well as for corresponding provider-related endpoints (relates to #200). - Add backward compatibility support of some metadata fields (
abstract
,mimeType
, etc.) forDeploy
operation of pre-existing processes. When those fields are detected, they are converted inplace in favor of their corresponding new names aligned with OGC-API. - Update
mimeType
tomediaType
as format type representation according to OGC-API (relates to #211). - Add explicit pattern validation (
type/subtype
) of format string definitions withMediaType
schema. - Add sorting capability to generate mapping schemas for API responses using overrides of
properties
_sort_first
and_sort_after
using lists of desired ordered field names. - Improved naming of many ambiguous and repeated words across schema definitions that did not necessarily interact with each other although making use of similar naming convention, making their interpretation and debugging much more complicated. A stricter naming convention has been applied for consistent Deploy/Describe/Execute-related and Input/Output-related references.
- Replace
list_remote_processes
function by methodprocesses
under theService
instance. - Replace
get_capabilities
function by reusing and extending methodsummary
under theService
instance. - Improve generation of metadata and content validation of
Service
provider responses (relates to OGC #200 and #266). - Add query parameter
detail
to providers listing request to allow listing of names instead of their summary (similarly to the processes endpoint query parameter). - Add query parameter
check
to providers listing request to retrieve all registeredService
regardless of their URL endpoint availability at the moment the request is executed (less metadata is retrieved in that case). - Add
weaver.schema_url
configuration parameter andweaver.wps_restapi.utils.get_schema_ref
function to help generate$schema
definition and return reference to expected/provided schema in responses (relates to #157) Only utilities are added, not all routes provide the information yet. - Add validation of
schema
field underFormat
schema (as per opengeospatial/ogcapi-processes schema format.yml) such that only URL formatted strings are allowed, or alternatively an explicit JSON definition. Previous definitions that would indicate an empty string schema are dropped sinceschema
is optional. - Block unknown and
builtin
process types during deployment from the API (fixes #276). Typebuiltin
can only be registered by Weaver itself at startup. Other unknown types that have no indication for mapping to an appropriateProcess
implementation are preemptively validated. - Add parsing and generation of additional
literalDataDomains
for specification of WPS I/O data constrains and provide corresponding definitions in process description responses (fixes #41, #211, #297). - Add additional
maximumMegabyte
metadata detail toformats
of WPS I/O ofcomplex
type whenever available (requires geopython/OWSLib#796, futureOWSLIB==0.26.0
release).
- Revert an incorrectly removed schema deserialization operation during generation of the
ProcessSummary
employed for populating process listing. - Revert an incorrectly modified schema reference that erroneously replaced service provider
ProcessSummary
items during their listing by a singleProcessInputDescriptionSchema
(introduced since3.0.0
). - Fix #203 with explicit validation test of
ProcessSummary
schema for providers response. - Fix failing
minOccurs
andmaxOccurs
generation from a remote providerProcess
to support OGC-API format (relates to #263). - Fix schemas references and apply deserialization to providers listing request.
- Fix failing deserialization of
variable
children schema under mapping when this variable element is allowed to be undefined (i.e.: defined withmissing=drop
). Allows support of emptyinputs
mapping of OGC-API representation ofProcessDescription
that permits such processes (constant or random output generator). - Fix some invalid definitions of execution inputs schemas under mapping with
value
sub-schema where key-based input IDs (usingadditionalProperties
) where replaced by the variable<input-id>
name instead of their original names in the request body (from #265 since3.4.0
). - Fix parsing error raised from
wps_processes.yml
configuration file when it can be found but contains neither aprocesses
norproviders
section. Also, apply more validation of specifiedname
values. - Fix parsing of
request_extra
function/setting parameters for specifically zero values corresponding toretries
andbackoff
options that were be ignored. - Fix incorrect parsing of
default
field within WPS input whenliteral
data type is present and was assumed ascomplex
(fixes #297). - Fix and test various invalid schema deserialization validation issues, notably regarding
PermissiveMappingSchema
, schema nodesExtendedFloat
,ExtendedInt
and their handling strategies when combined in mappings or keywords. - Fix resolution of similar values that could be implicitly converted between
ExtendedString
,ExtendedFloat
,ExtendedInt
andExtendedBool
schema types to guarantee original data type explicitly defined are preserved. - Fix
runningSeconds
field reporting to be offloat
type although implicitint
type conversion could occur. - Fix validation of
Execute
inputs schemas to adequately distinguish between optional inputs and incorrect formats. - Fix resolution of
Accept-Language
negotiation forwarded to local or remote WPS process execution. - Fix XML security issue flagged within dependencies to
PyWPS
andOWSLib
by pinning requirements to versionspywps==4.5.0
andowslib==0.25.0
, and apply the same fix in Weaver code (see following for details: geopython/pywps#616, geopython/pywps#618, geopython/pywps#624, CVE-2021-39371).
3.5.0 (2021-08-19)
- No change.
- Fix
weaver.datatype
objects auto-resolution of fields using either attributes (accessed asdict
) or properties (accessed asclass
) to ensure correct handling of additional operations on them. - Fix
DuplicateKeyError
that could sporadically arise during initialprocesses
storage creation whenbuiltin
processes get inserted/updated on launch by parallel worker/threads running the application. Operation is relaxed only for defaultbuiltin
to allow equivalent process replacement (upsert
) instead of only explicit inserts, as they should be pre-validated for duplicate entries, and only new definitions should be registered during this operation (fixes #246).
3.4.0 (2021-08-11)
- Add missing processID detail in job status info response (relates to #270).
- Add support for inputs under mapping for inline values and arrays in process execution (relates to #265).
- Fix copy of headers when generating the WPS clients created for listing providers capabilities and processes.
3.3.0 (2021-07-16)
- Add support for array type as job inputs (relates to #233).
- Remove automatic conversion of falsy/truthy
string
andinteger
type definitions toboolean
type to align with OpenAPIboolean
type definitions. Non explicitboolean
values will not be automatically converted tobool
anymore. They will require explicitfalse|true
values.
- Fix
minOccurs
andmaxOccurs
representation according to OGC-API (fixes #263). - Fixed the format of the output file URL. When the prefix
/
was not present, URL was incorrectly handled by not prepending the required base URL location.
3.2.1 (2021-06-08)
- No change.
- Fix backward compatibility of pre-deployed processes that did not define
jobControlOptions
that is now required. Missing definition are substituted in-place by default["execute-async"]
mode.
3.2.0 (2021-06-08)
- Add reference link to ReadTheDocs URL of Weaver in API landing page.
- Add references to OGC-API Processes requirements and recommendations for eventual conformance listing (relates to #231).
- Add
datetime
query parameter for job searches queries (relates to #236). - Add
limit
query parameter validation and integration for jobs in retrieve queries (relates to #237).
- Pin
pywps==4.4.3
and fix incompatibility introduced by its refactor of I/O base classes in #602 (specifically commit 343d825), which broke theComplexInput
work-around to avoid useless of file URLs (see issue #526). - Fix default execution mode specification in process job control options (fixes #182).
- Fix old OGC-API WPS REST bindings link in landing page for the more recent OGC-API Processes specification.
- Fix invalid deserialization of schemas using
not
keyword that would result in all fields returned instead of limiting them to the expected fields from the schema definitions forLiteralInputType
in process description. - Adjust
InputType
andOutputType
schemas to useallOf
instead ofanyOf
definition since all sub-schemas that define them must be combined, with their respectively required or optional fields.
3.1.0 (2021-04-23)
- Add caching of remote WPS requests according to
request-options.yml
and request headerCache-Control
to allow reduced query of pre-fetched WPS client definition. - Add
POST /processes/{}/execution
endpoint that mimics its jobs counterpart to respect OGC-API Processes updates (see issue opengeospatial/ogcapi-processes#124 and PR opengeospatial/ogcapi-processes#159, resolves #235). - Add OpenAPI schema examples for some of the most common responses.
- Add missing schema definitions for WPS XML requests and responses.
- Improve schema self-validation with their specified default values.
- Add explicit options usage and expected parsing results for all test variations of OpenAPI schemas generation and
colander
object arguments for future reference intests.wps_restapi.test_colander_extras
.
- Fix erroneous tags in job inputs schemas.
- Fix handling of deeply nested schema validator raising for invalid format within optional parent schema.
- Fix retrieval of database connection from registry reference.
- Fix test mock according to installed
pyramid
version to avoid error with modified mixin implementations.
3.0.0 (2021-03-16)
- Provide HTTP links to corresponding items of job in JSON body of status, inputs and outputs routes (#58, #86).
- Provide
Job.started
datetime and calculateJob.duration
from it to indicate the duration of the process execution instead of counting from the time the job was submitted (i.e.:Job.created
). - Provide OGC compliant
<job-uri>/results
response schema as well as some expectedcode
/description
fields in case where the request fails. - Add
<job-uri>/outputs
providing thedata
/href
formatted job results as well as<job-uri>/inputs
to retrieve the inputs that were provided during job submission (#86). - Deprecate
<job-uri>/result
paths (indicated in OpenAPI schemas and UI) in favor of<job-uri>/outputs
which provides the same structure with additionallinks
references (#58). Result path requests are redirected automatically to outputs. - Add more reference/documentation links to WPS-1/2 and update conformance references (#53).
- Add some minimal caching support of routes.
- Adjust job creation route to return
201
(created) as it is now correctly defined by the OGC API specification (#14). - Add
Job.link
method that auto-generates all applicable links (inputs, outputs, logs, etc.). - Add
image/jpeg
,image/png
,image/tiff
formats to supportedweaver.formats
(relates to #100). - Handle additional trailing slash resulting in
HTTPNotFound [404]
to automatically resolve to corresponding valid route without the slash when applicable. - Provide basic conda environment setup through
Makefile
for Windows bash-like shell (ie:MINGW
/MINGW64
). - Update documentation for minimal adjustments needed to run under Windows.
- Update OpenAPI template to not render the useless version selector since we only provide the current version.
- Update Swagger definitions to reflect changes and better reuse existing schemas.
- Update Swagger UI to provide the ReadTheDocs URL.
- Add crim-ca/cwltool@docker-gpu as
cwltool
requirement to allow processing of GPU-enabled dockers with nvidia-docker. - Add fmigneault/cornice.ext.swagger@openapi-3
as
cornice_swagger
requirement to allow OpenAPI-3 definitions support of schema generation and deserialization validation of JSON payloads. - Disable default auto-generation of
request-options.yml
andwps_processes.yml
configuration files from a copy of their respective.example
files as these have many demo (and invalid values) that fail real execution of tests when no actual file was provided. - Add per-request caching support when using
request_extra
function, and caching control according to request headers andrequest-options.yml
configuration.
- Fix
weaver.config.get_weaver_config_file
called with empty path to be resolved just as requesting the default file path explicitly instead of returning an invalid directory. - Fix CWL package path resolution under Windows incorrectly parsed partition as URL protocol.
- Fix
AttributeError
ofpywps.inout.formats.Format
equality check compared tonull
object (using getter patch onnull
since fix #507 not released at this point). - Fix potential invalid database state that could have saved an invalid process although the following
ProcessSummary
schema validation would fail and returnHTTPBadRequest [400]
. The process is now saved only after complete and successful schema validation.
2.2.0 (2021-03-03)
- Add
weaver.wps.utils.get_wps_client
function to handle the creation ofowslib.wps.WebProcessingService
client with appropriate request options configuration from application settings.
- Fix job percent progress reported in logs to be more consistent with actual execution of the process (fixes #90).
- Fix Job duration not stopped incrementing when its execution failed due to raised error (fixes #222).
- Improve race condition handling of
builtin
process registration at application startup.
2.1.0 (2021-02-26)
- Ensure that configuration file definitions specified in
processes
andproviders
will override older database definitions respectively matched byid
andname
when starting Weaver if other parameters were modified. - Support dynamic instantiation of WPS-1/2 processes from remote WPS providers to accomplish job execution.
- Remove previously flagged duplicate code to handle
OWSLib
processes conversion toJSON
for OGC-API. - Replace
GET
HTTP request byHEAD
for MIME-type check againstIANA
definitions (speed up). - Improve handling of CWL input generation in combination with
minOccurs
,maxOccurs
,allowedValues
anddefault
empty ("null"
) value from WPS process from remote provider (fix #17). - Add
HYBRID
mode that allows Weaver to simultaneously run local Application Packages and remote WPS providers. - Rename
ows2json_output
toows2json_output_data
to emphasise its usage for parsing job result data rather than simple output definition as accomplished byows2json_io
. - Remove function duplicating operations accomplished by
ows2json_io
(previously marked with FIXME). - Improve typing definitions for CWL elements to help identify invalid parsing methods during development.
- Improve listing speed of remote providers that require data fetch when some of them might have become unreachable.
- Avoid failing WPS-1/2 processes conversion to corresponding OGC-API process if metadata fields are omitted.
- Fix invalid function employed for
GET /providers/{prov}/processes/{proc}
route (some error handling was bypassed).
2.0.0 (2021-02-22)
- Add support of YAML format for loading
weaver.data_sources
definition. - Pre-install
Docker
CLI inworker
image to avoid bad practice of mounting it from the host. - Adjust WPS request dispatching such that process jobs get executed by
Celery
worker as intended (see #21 and #126). - Move WPS XML endpoint functions under separate
weaver.wps.utils
andweaver.wps.views
to remove the need to constantly handle circular imports issues due to processing related operations that share some code. - Move core processing of job operation by
Celery
worker underweaver.processes.execution
in order to separate those components from functions specific for producing WPS-REST API responses. - Handle WPS-1/2 requests submitted by GET KVP or POST XML request with
application/json
inAccept
header to return the same body content as if directly calling their corresponding WPS-REST endpoints. - Remove
request
parameter of every database store methods since they were not used nor provided most of the time. - Changed all forbidden access responses related to visibility status to return
403
instead of401
. - Add more tests for Docker applications and test suite execution with Github Actions.
- Add more details in sample configurations and provide an example
docker-compose.yml
configuration that defines a typical Weaver API / Worker combination withdocker-proxy
for sibling container execution. - Add captured
stdout
andstderr
details in job log following CWL execution error when retrievable. - Document the WPS KVP/XML endpoint within the generated OpenAPI specification.
- Disable auto-generation of
request_options.yml
file from corresponding empty example file and allow application to start if no such configuration was provided. - Remove every Python 2 backward compatibility references and operations.
- Drop Python 2 and Python 3.5 support.
- Target
PyWPS-4.4
to resolve multiple invalid dependency requirements breaking installed packages over builtin Python packages and other compatibility fixes (see geopython/pywps #568). - Fix retrieval of database connexion to avoid warning of
MongoClient
opened before fork of processes. - Fix indirect dependency
oauthlib
missing fromesgf-compute-api
(cwt
) package. - Fix inconsistent
python
reference resolution ofbuiltin
applications when executed locally and in tests (using virtual/conda environment) compared to within Weaver Docker image (using OS python). - Fix many typing definitions.
1.14.0 (2021-01-11)
- Add
data
input support for CWL Workflow step referring to WPS-3 Process. - Add documentation example references to Application Package and Process
Deploy
/Execute
repositories. - Add parsing of
providers
inwps_processes.yml
to directly register remote WPS providers that will dynamically fetch underlying WPS processes, instead of static per-service processes stored locally. - Add field
visible
towps_processes.yml
entries to allow directly defining the registered processes visibility. - Adjust response of remote provider processes to return the same format as local processes.
- Fix
stdout
/stderr
log file not permitted directly within CWL Workflow (must be inside intermediate steps). - Fix missing S3 bucket location constraint within unittests.
1.13.1 (2020-07-17)
- No change.
- Create an
stdout.log
orstderr.log
file in casecwltool
hasn't created it.
1.13.0 (2020-07-15)
- Add AWS S3 bucket support for process input reference files.
- Add
weaver.wps_output_s3_bucket
setting to upload results to AWS S3 bucket instead of local directory. - Add
weaver.wps_output_s3_region
setting to allow override parameter extracted from AWS profile otherwise. - Add more documentation about supported file reference schemes.
- Add documentation references to ESGF-CWT Compute API.
- Add conditional input file reference fetching (depending on ADES/EMS, process type from CWL
hints
) to take advantage of request-options and all supported scheme formats by Weaver, instead of relying on PyWPS and/or CWL wherever how far downstream the URL reference was reaching.
- Adjust some docstrings to better indicate raised errors.
- Adjust
weaver.processes.wps_package.WpsPackage
to use its internal logger when running the process in order to preserve log entries under its job execution. They were otherwise lost over time across all process executions.
1.12.0 (2020-07-03)
- Add multiple CWL ESGF processes and workflows, namely
SubsetNASAESGF
,SubsetNASAESGF
and many more. - Add tests for ESGF processes and workflows.
- Add documentation for
ESGF-CWTRequirement
processes. - Add
file2string_array
andmetalink2netcdf
builtins. - Add
esgf_process
Wps1Process
extension, to handleESGF-CWTRequirement
processes and workflows.
- Reset
MongoDatabase
connection when we are in a forked process.
1.11.0 (2020-07-02)
- Generate Weaver OpenAPI specification for readthedocs publication.
- Add some sections for documentation (#61).
- Add support of documentation RST file redirection to generated HTML for reference resolution in both Github source and Readthedocs served pages.
- Improve documentation links, ReadTheDocs format and TOC references.
- Avoid logging
stdout/stderr
in workflows. - Add tests to make sure processes
stdout/stderr
are logged. - Remove Python 2.7 version as not officially supported.
- Move and update WPS status location and status check functions into
weaver.wps
module.
- Fix reported WPS status location to handle when starting with
/
although not representing an absolute path.
1.10.1 (2020-06-03)
- No change.
- Pin
celery==4.4.2
to avoid import error on missingfutures.utils
called internally in following versions.
1.10.0 (2020-06-03)
- Add support of value-typed metadata fields for process description.
- Enforce
rel
field when specifying anhref
JSON link to match corresponding XML requirement.
- Add more examples of supported WPS endpoint metadata (fixes #84).
1.9.0 (2020-06-01)
- Add
weaver.wps_workdir
configuration setting to define the location where the underlyingcwltool
application should be executed under. This can allow more control over the scope of the mounted volumes for Application Package running a docker image. - Add mapping of WPS results from the
Job
's UUID to generated PyWPS UUID for outputs, status and log locations. - Add experimental configuration settings
weaver.cwl_euid
andweaver.cwl_egid
to provide effective user/group identifiers to employ when running the CWL Application Package. Using these require good control of the directory and process I/O locations as invalid permissions could break a previously working job execution. - Add more logging configuration and apply them to
cwltool
before execution of Application Package. - Enforce
no_match_user=False
andno_read_only=False
ofcwltool
'sRuntimeContext
to ensure that docker application is executed with same user asweaver
and that process input files are not modified inplace (readonly) where potentially inaccessible (according to settings). Definition of CWL package will need to add InitialWorkDirRequirement as per defined by reference specification to stage those files if they need to be accessed with write permissions (see: example). Addresses some issues listed in #155. - Enforce removal of some invalid CWL hints/requirements that would break the behaviour offered by
Weaver
. - Use
weaver.request_options
for WPS GetCapabilities and WPS Check Status requests under the running job. - Change default
DOCKER_REPO
value defined inMakefile
to point to reference mentioned inREADME.md
and considered as official deployment location. - Add
application/x-cwl
MIME-type supported with updatedEDAM 1.24
ontology. - Add
application/x-yaml
MIME-type to known formats. - Add
application/x-tar
andapplication/tar+gzip
MIME-type (not official) but resolved as synonymapplication/gzip
(official) to preserve compressed file support during CWL format validation.
- Set
get_cwl_file_format
default argumentmust_exist=True
instead ofFalse
to retrieve original default behaviour of the function. Since CWL usually doesn't need to addFile.format
field when no corresponding reference actually exists, this default also makes more sense.
1.8.1 (2020-05-22)
- Add Travis-CI smoke test of built docker images for early detection of invalid setup or breaking code to boot them.
- Add Travis-CI checks for imports. This check was not validated previously although available.
- Adjust
weaver.ini.example
to reflect working demo server configuration (employed by smoke test). - Move
weaver
web application toweaver.app
to reduce chances of breakingsetup.py
installation from import errors due toweaver
dependencies not yet installed. Redirect to new location makes this change transparent when loaded with the usualweaver.ini
configuration.
- Fix base docker image to install Python 3 development dependencies in order to compile requirements with expected
environment Python version. Package
python-dev
for Python 2 was being installed instead. - Fix failing docker image boot due to incorrectly placed
yaml
import during setup installation. - Fix imports according to
Makefile
targetscheck-imports
andfix-imports
. - Fix parsing of
PyWPS
metadata to correctly employ values provided byweaver.ini
.
1.8.0 (2020-05-21)
- Modify
weaver.utils.request_retry
toweaver.utils.request_extra
to include more requests functionality and reuse it across the whole code base. - Add
requests_extra
SSL verification option using specific URL regex(es) matches from configuration settings. - Add
file://
transport scheme support directly to utilityrequests_extra
to handle local file paths. - Add file
weaver.request_options
INI configuration setting to specify per-request method/URL options. - Add
requests_extra
support ofRetry-After
response header (if any available on429
status) which indicates how long to wait until next request to avoid automatically defined response right after. - Add
weaver.wps_workdir
configuration setting with allow setting correspondingpywps.workdir
directory.
- Modify
Dockerfile-manager
to run web application usingpserve
asgunicorn
doesn't correctly handles worker options anymore when loaded formweaver.ini
with--paste
argument. Also simplifies the command which already required multiple patches such as reapplying the host/port binding from INI file. - Fix handling of Literal Data I/O
type
when retrieved fromOWSLib.wps
object with remote WPS XML body. - Adjust
make start
target to use newmake install-run
target which installs the dependencies and package in edition mode so that configuration files present locally can be employed for running the application. Previously, one would have to move their configurations to thesite-package
install location of the active Python. - Fix
celery>4.2
not found because of application path modification. - Fix invalid handling of
wps_processes.yml
reference inweaver.ini
when specified as relative path to configuration directory. - Fix handling of
WPS<->CWL
I/O merge ofdata_format
field againstsupported_formats
withpywps>=4.2.4
. - Fix installation of
yaml
-related packages for Python 2 backward compatibility.
1.7.0 (2020-05-15)
- Add additional status log for
EOImage
input modification with OpenSearch during process execution. - Add captured
stderr/stdout
logging of underlying CWL application being executed to resultingJob
logs (addresses first step of #131). - Use
weaver.utils.request_retry
in even more places and extend convenience arguments offered by it to adapt it to specific use cases.
- Fix handling of WPS-REST output matching a JSON file for multiple-output format specified with a relative local path as specified by job output location. Only remote HTTP references where correctly parsed. Also avoid failing the job if the reference JSON parsing fails. It will simply return the original reference URL in this case without expanded data (relates to #25).
- Fix CWL job logs to be timezone aware, just like most other logs that will report UTC time.
- Fix JSON response parsing of remote provider processes.
- Fix parsing of CWL ordered parsing when I/O is specified as shorthand
"<id>":"<type>"
directly under theinputs
oroutputs
dictionary instead of extended JSON object variant such as{"input": {"type:" "<type>", "format": [...]}}
(fixes #137).
1.6.0 (2020-05-07)
- Reuse
weaver.utils.request_retry
function across a few locations that where essentially reimplementing the core functionality. - Add even more failure-permissive request attempts when validating a MIME-type against IANA website.
- Add auto-resolution of common extensions known under PyWPS as well as employing their specific encoding.
- Add
geotiff
format type support via PyWPS (#100). - Make WPS status check more resilient to failing WPS outputs location not found in case the directory path can be resolved to a valid local file representing the XML status (i.e.: don't depend as much on the HTTP WPS output route).
- Ensure backward support of generic/default
text/plain
I/O when extracted from a referenced WPS-1/2 XML remote process which provides insufficient format details. For CWL output generated from it, replace the glob pattern to match anything (<id>.*
) instead of<id>.txt
extracted fromtext/plain
to simulate MIME-type as*/*
. Issue log warning message for future use cases.
- Fix invalid
AllowedValue
parsing when usingLiteralData
inputs that resulted inAnyValue
being parsed as a"None"
string. This was transparent in case of string inputs and breaking for other types like integer when they attempted conversion. - Fix erroneous
Metadata
keywords passed down toowslib.wps.Metadata
objects in case of more verbose detailed not allowed by this implementation. - Fix parsing of explicitly-typed optional array CWL I/O notation that was not considered
(i.e.: using
type
as list with additional"null"
instead oftype: "<type>?"
shorthand). - Fix parsing of MIME-type from
format
field to exclude additional parameters (e.g.:; charset=UTF-8
for remote IANA validation.
1.5.1 (2020-03-26)
- Add unittest of utility function
fetch_file
. - Split some unittest utility functions to allow more reuse.
- Fix invalid
retry
parameter not handled automatically by request.
1.5.0 (2020-03-25)
- Adjust incorrectly parsed href file reference as WPS complex input which resulted in failing location retrieval.
- Partially address unnecessary fetch of file that has to be passed down to CWL, which will in turn request the file as required. Need update from PyWPS to resolve completely (#91, geopython/pywps#526).
- Adjust WPS output results to use relative HTTP path in order to recompose the output URL if server settings change.
- Support WPS output results as value (WPS literal data). Everything was considered an href file beforehand.
- Add additional
timeout
andretry
during fetching of remote file for processjsonarray2netcdf
to avoid unnecessary failures during edge case connexion problems. - Add support of
title
andversion
field ofbuiltin
processes.
- Patch
builtin
process execution failing sincecwltool 2.x
update. - Avoid long fetch operation using streamed request that defaulted to chuck size of 1. Now, we use an appropriate size according to available memory.
1.4.0 (2020-03-18)
- Update owslib to 0.19.2
- Drop support for python 3.5
1.3.0 (2020-03-10)
- Provide a way to override the external URL reported by WPS-1/2 and WPS-REST via configuration settings allowing for more advanced server-side results in response bodies.
1.2.0 (2020-03-06)
- Add WPS languages for other wps requests types:
DescribeProcess
andGetCapabilities
.
- Fix a bug where the validation of
OneOf
items was casting the value to the first valid possibility.
1.1.0 (2020-02-17)
- Simplify docker image generation and make base/manager/worker variants all available under the same docker repo docker-registry.crim.ca/ogc/weaver with different tags (#5).
- Add planned future support of
Accept-Language
header for WPS-1/2 (geopython/OWSLib 0.20.0
) (#74). - Improved job logs update with message and progress to allow better tracking of internal operations and/or problems.
- Allow WPS builtin process
jsonarray2netcdf
to fetch a remote file. - Change doc to point to DockerHub pavics/weaver images.
- Adjust CI rule long-lasting failures until it gets patched by original reference (gitleaks-actions#3).
- Fix readthedocs documentation generation.
- Fix
.travis
docker image build condition. - Fix
geopython/OWSLib>=0.19.1
requirement for Python 3.8 support (#62). - Fix job update filling due to status location incorrectly resolved according to configured PyWPS output path.
1.0.0 (2020-01-28)
- Add
notification_email
field toJob
datatype that stores an encrypted email (according to settings) when provided in the job submission body (#44). - Add ability to filter jobs with
notification_email
query parameter (#44). - Add jobs statistics grouping by specific fields using comma-separated list
groups
query parameter (#46). - Add some tests to evaluate new job search methods / grouping results and responses (#44, #46).
- Add handling of multiple CWL field
format
forFile
type. - Add missing ontology reference support for CWL field
format
by defaulting to IANA namespace. - Add support for I/O
array
ofenum
(ie: multiple values ofAllowedValues
for a given input) (#30). - Add support of
label
synonym astitle
for inputs and process description (CWL specifying alabel
will set it in WPS process) (#31) - Add support of input
minOccurs
andmaxOccurs
asint
while maintainingstr
support (#14). - Add conformance route with implementation links (#53).
- Add additional landing page link details (#54).
- Add
weaver.wps_restapi.colander_extras.DropableNoneSchema
to auto-handle some schema JSON deserialization. - Add
weaver.wps_restapi.colander_extras.VariableMappingSchema
to auto-handle some schema JSON deserialization. - Add more functional tests (#11, #17).
- Use
bump2version
and move all config undersetup.cfg
. - Remove enforced
text/plain
for CWLFile
when missingformat
field. - Replace bubbling up of too verbose unhandled exceptions (500 Internal Server Error) by summary message and additional internal logging for debugging the cause using an utility exception log decorator.
- Use the same exception log decorator to simplify function definitions when HTTP exceptions are already handled.
- Make
null
reference a singleton so that multiple instantiation calls all refer to the same instance and produce the expected behaviour of<x> is null
instead of hard-to-identify errors because of english syntax. - Remove unused function
weaver.utils.replace_caps_url
and corresponding tests. - Remove
weaver.processes.utils.jsonify_value
duplicated byweaver.processes.wps_package.complex2json
. - Use more JSON body schema validation using API schema definitions deserialization defined by
weaver.datatype
. - Enforce
builtin
processes registration on startup to receive applicable updates. - Provide 2 separate docker images for Weaver manager and worker, corresponding to the EMS/ADES API and the
celery
job runner respectively. - Update Apache license.
- Adjust some typing definitions incorrectly specified.
- Fix some failing functionality tests (#11, #17).
- Fix I/O field ordering preserved as specified in payload or loaded reference file.
- Fix setting
minOccurs=0
when adefault
is specified in the corresponding CWL I/O (#17, #25). - Fix incorrectly overridden
maxOccurs="unbounded"
bymaxOccurs="1"
when a partial array input definition is specified without explicitmaxOccurs
in WPS payload (#17, #25). - Fix case where omitted
format[s]
in both CWL and WPS deploy bodies generated a process description with complex I/O (file) without requiredformats
field. Defaulttext/plain
format is now automatically added. - Fix case where
format[s]
lists between CWL and WPS where incorrectly merged. - Fix
metadata
field within a WPS I/O incorrectly parsed when provided by a WPS-1/2 XML process definition. - Fix invalid JSON response formatting on failing schema validation of process deployment body.
- Fix docker images to support
pserve
when usinggunicorn>=20.x
dropping support of--paste
config feature. - Fix multiple Python 2/3 compatibility issues.
0.2.2 (2019-05-31)
- Support notification email subject template.
0.2.1 (2019-05-29)
- Add per-process email notification template.
0.2.0 (2019-03-26)
- Fixes to handle invalid key characters
"$"
and"."
during CWL package read/write operations to database. - Fixes some invalid CWL package generation from WPS-1 references.
- More cases handled for WPS-1 to CWL
WPS1Requirement
conversion (AllowedValues
,Default
,SupportedFormats
,minOccurs
,maxOccurs
). - Add file format validation to generated CWL package from WPS-1 MIME-types.
- Allow auto-deployment of WPS-REST processes from WPS-1 references specified by configuration.
- Add many deployment and execution validation tests for
WPS1Requirement
. - Add
builtin
application packages support for common operations.
0.1.3 (2019-03-07)
- Add useful Makefile targets for deployment.
- Add badges indications in
README.rst
for tracking from repo landing page. - Fix security issue of PyYAML requirement.
- Fix some execution issues for
Wps1Process
. - Fix some API schema erroneous definitions.
- Additional logging of unhandled errors.
- Improve some typing definitions.
0.1.2 (2019-03-05)
- Introduce
WPS1Requirement
and correspondingWps1Process
to run a WPS-1 process under CWL. - Remove mongodb requirement, assume it is running on an external service or docker image.
- Add some typing definitions.
- Fix some problematic imports.
- Fix some PEP8 issues and PyCharm warnings.
0.1.1 (2019-03-04)
- Modify Dockerfile to use lighter
debian:latest
instead ofbirdhouse/bird-base:latest
. - Modify Dockerfile to reduce build time by reusing built image layers (requirements installation mostly).
- Make some buildout dependencies optional to also reduce build time and image size.
- Some additional striping of deprecated or invalid items from Twitcher.
0.1.0 (2019-02-26)
- Initial Release. Based off Twitcher tag ogc-0.4.7.