diff --git a/docs/source/index.rst b/docs/source/index.rst index 73116958e..ee0c240b8 100644 --- a/docs/source/index.rst +++ b/docs/source/index.rst @@ -2,7 +2,7 @@ Welcome to Aryn! ================ Aryn is an LLM-powered data preparation, processing, and analytics system for complex, unstructured documents like PDFs, HTML, presentations, and more. With Aryn, you can prepare data for GenAI and RAG applications, power high-quality document processing workflows, and run analytics on large document collections with natural language. It includes two components: The Aryn Partitioning Service and Sycamore. -The Aryn Partitioning Service (APS) is a serverless, GPU-powered API for segmenting and labeling PDF documents, doing OCR, and extracting tables and images. It returns the output in JSON. APS runs the Aryn Partitioner and its `state-of-the-art, open source deep learning AI model `_ trained on 80k+ enterprise documents. You can use it to partition documents and extract information directly in your code, or use with Sycamore for additional processing. `Sign-up here for free `_ to get an API Key and use the `Aryn Playground `_ to visually see how it segments and processes your own documents. Or, watch the Aryn Partitioning Service in action in `this video `_. +The Aryn Partitioning Service (APS) is a serverless, GPU-powered API for segmenting and labeling PDF documents, doing OCR, and extracting tables and images. It returns the output in JSON. APS runs the Aryn Partitioner and its `state-of-the-art, open source deep learning AI model `_ trained on 80k+ enterprise documents. You can use it to partition documents and extract information directly in your code, or use it with Sycamore for additional processing. `Sign-up here for free `_ to get an API Key and use the `Aryn Playground `_ to visually see how it segments and processes your own documents. Or, watch the Aryn Partitioning Service in action in `this video `_. Sycamore is a document processing engine covered under the Apache v2.0 license. It's built for complex unstructured data, such as documents, presentations, transcripts, embedded tables, and internal knowledge repositories. Sycamore provides a declarative dataflow abstraction called a DocSet to make manipulating unstructured documents easy and scalable. It’s similar in style to Apache Spark and Pandas, but for collections of unstructured documents. DocSets can be used not only for extracting, enriching, summarizing, and cleaning unstructured data, but also for running powerful analytics on these datasets. @@ -57,7 +57,7 @@ Next, you can: .. -You can specify additional options (e.g. table extraction), and a list of these options is :doc: `here ` +You can specify additional options (e.g. table extraction), and a list of these options is :doc:`here `. |