Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a new data connector for Notion #260

Closed
ajjimeno opened this issue Feb 23, 2023 · 7 comments
Closed

Create a new data connector for Notion #260

ajjimeno opened this issue Feb 23, 2023 · 7 comments
Labels
enhancement New feature or request

Comments

@ajjimeno
Copy link
Contributor

ajjimeno commented Feb 23, 2023

Create a data connector that:

  • fetches data from Notion.
  • stores the content locally (at least temporarily for processing), and runs them through unstructured.partition.auto.
  • inspiration for processing is available here

See Adding Data Connectors for details on how to get started. Make sure to include a link to this issue when submitting a PR.

Definition of Done

  • The checklist has been completed.
  • The connector is able to process entries from Notion using the API.
  • The connector is able to process a single entry.
  • The connector is able to process several entries, with an option to process them recursively.
  • The connector can accept any credentials, if necessary.
  • The connector should be able of processing documents through unstructured.partition.auto.
@ajjimeno ajjimeno added the enhancement New feature or request label Feb 23, 2023
@HAKSOAT
Copy link
Contributor

HAKSOAT commented Feb 26, 2023

Hi @ajjimeno I'm looking to pick this up.

What does an entry refer to here? Does it mean a Notion page?

@ajjimeno
Copy link
Contributor Author

Hi @HAKSOAT, thank you for the interest. Yes, the idea to being able of recovering a Notion page and, if there are nested ones recover them as well. I think the term processing might be confusing but the idea is to connect to Notion, recover the page, nested pages and possible additional attached content. The content recovered should be processed by unstructured.partition.auto.

@HAKSOAT
Copy link
Contributor

HAKSOAT commented Feb 27, 2023

Thanks for clarifying. I picked up the Google Drive connector enhancement. I hope to come back to do this when that is done.

@HAKSOAT
Copy link
Contributor

HAKSOAT commented Mar 5, 2023

Just putting an update here that I am now taking a look at this task.

@ajjimeno
Copy link
Contributor Author

ajjimeno commented Mar 5, 2023 via email

@HAKSOAT
Copy link
Contributor

HAKSOAT commented Mar 10, 2023

Thought to share my findings after exploring the Notion API here. I didn't proceed with this because it felt a bit complex to work with Notion's API, especially in a short timeframe.

They use the concept of "blocks". Hence, multiple "blocks" per page with the possibility of nested blocks (say in the case of bullet points or toggles). I believe an approach that builds an actual page first (maybe a markdown file as the result) before passing through will be needed to make this work.

I think the following links will be good places to start from when working on this:

https://developers.notion.com/reference/block
https://developers.notion.com/reference/get-block-children
https://developers.notion.com/reference/property-value-object

@ajjimeno
Copy link
Contributor Author

ajjimeno commented Mar 11, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants