1 Star 0 Fork 280

lovezhou1990 / SeaTunnel

forked from Apache SeaTunnel / SeaTunnel 
加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
Notion.md 7.21 KB
一键复制 编辑 原始数据 按行查看 历史

Notion

Notion source connector

Description

Used to read data from Notion.

Key features

Options

name type required default value
url String Yes -
password String Yes -
version String Yes -
method String No get
schema.fields Config No -
format String No json
params Map No -
body String No -
json_field Config No -
content_json String No -
poll_interval_millis int No -
retry int No -
retry_backoff_multiplier_ms int No 100
retry_backoff_max_ms int No 10000
enable_multi_lines boolean No false
common-options config No -

url [String]

http request url

password [String]

API key for login, you can get more detail at this link:

https://developers.notion.com/docs/authorization

version [String]

The Notion API is versioned. API versions are named for the date the version is released

method [String]

http request method, only supports GET, POST method

params [Map]

http params

body [String]

http body

poll_interval_millis [int]

request http api interval(millis) in stream mode

retry [int]

The max retry times if request http return to IOException

retry_backoff_multiplier_ms [int]

The retry-backoff times(millis) multiplier if request http failed

retry_backoff_max_ms [int]

The maximum retry-backoff times(millis) if request http failed

format [String]

the format of upstream data, now only support json text, default json.

when you assign format is json, you should also assign schema option, for example:

upstream data is the following:

{
  "code": 200,
  "data": "get success",
  "success": true
}

you should assign schema as the following:


schema {
    fields {
        code = int
        data = string
        success = boolean
    }
}

connector will generate data as the following:

code data success
200 get success true

when you assign format is text, connector will do nothing for upstream data, for example:

upstream data is the following:

{
  "code": 200,
  "data": "get success",
  "success": true
}

connector will generate data as the following:

content
{"code": 200, "data": "get success", "success": true}

schema [Config]

fields [Config]

the schema fields of upstream data

content_json [String]

This parameter can get some json data.If you only need the data in the 'book' section, configure content_field = "$.store.book.*".

If your return data looks something like this.

{
  "store": {
    "book": [
      {
        "category": "reference",
        "author": "Nigel Rees",
        "title": "Sayings of the Century",
        "price": 8.95
      },
      {
        "category": "fiction",
        "author": "Evelyn Waugh",
        "title": "Sword of Honour",
        "price": 12.99
      }
    ],
    "bicycle": {
      "color": "red",
      "price": 19.95
    }
  },
  "expensive": 10
}

You can configure content_field = "$.store.book.*" and the result returned looks like this:

[
  {
    "category": "reference",
    "author": "Nigel Rees",
    "title": "Sayings of the Century",
    "price": 8.95
  },
  {
    "category": "fiction",
    "author": "Evelyn Waugh",
    "title": "Sword of Honour",
    "price": 12.99
  }
]

Then you can get the desired result with a simpler schema,like

Http {
  url = "http://mockserver:1080/contentjson/mock"
  method = "GET"
  format = "json"
  content_field = "$.store.book.*"
  schema = {
    fields {
      category = string
      author = string
      title = string
      price = string
    }
  }
}

Here is an example:

json_field [Config]

This parameter helps you configure the schema,so this parameter must be used with schema.

If your data looks something like this:

{
  "store": {
    "book": [
      {
        "category": "reference",
        "author": "Nigel Rees",
        "title": "Sayings of the Century",
        "price": 8.95
      },
      {
        "category": "fiction",
        "author": "Evelyn Waugh",
        "title": "Sword of Honour",
        "price": 12.99
      }
    ],
    "bicycle": {
      "color": "red",
      "price": 19.95
    }
  },
  "expensive": 10
}

You can get the contents of 'book' by configuring the task as follows:

source {
  Http {
    url = "http://mockserver:1080/jsonpath/mock"
    method = "GET"
    format = "json"
    json_field = {
      category = "$.store.book[*].category"
      author = "$.store.book[*].author"
      title = "$.store.book[*].title"
      price = "$.store.book[*].price"
    }
    schema = {
      fields {
        category = string
        author = string
        title = string
        price = string
      }
    }
  }
}

common options

Source plugin common parameters, please refer to Source Common Options for details

Example

Notion {
    url = "https://api.notion.com/v1/users"
    password = "SeaTunnel-test"
    version = "2022-06-28"
    content_field = "$.results.*"
    schema = {
       fields {
          object = string
          id = string
          type = string
          person = {
              email = string
          }
          avatar_url = string
       }
    }
}

Changelog

next version

  • Add Notion Source Connector
Java
1
https://gitee.com/ljzhou/SeaTunnel.git
git@gitee.com:ljzhou/SeaTunnel.git
ljzhou
SeaTunnel
SeaTunnel
dev

搜索帮助