Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: which input fields are actually required for the model? #5

Open
andrei-volkau opened this issue Oct 20, 2020 · 1 comment

Comments

@andrei-volkau
Copy link

I am sending an example.

I trained a model. I would like to try it on my own data. Could you please clarify which fields are actually required for the model? It might be possible that I don't need to provide all of them while shaping my own data as per the format below.

{
  "text3": "Twins",
  "text1": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
  "text2": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
  "annotation3": {
    "raw_text": "Twins",
    "toks": "Twins",
    "POSs": "NNS",
    "positions": "0-0-5",
    "NERs": "O"
  },
  "id": "56de8c374396321400ee2a11",
  "annotation2": {
    "raw_text": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
    "toks": "What 's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988 ?",
    "POSs": "WP VBZ DT NN IN DT NN NN NNP VBD IN IN NNP NNP IN CD .",
    "positions": "0-0-4 1-4-6 2-7-10 3-11-16 4-17-19 5-20-23 6-24-30 7-31-36 8-37-51 9-52-59 10-60-62 11-63-67 12-68-73 13-74-80 14-81-83 15-84-88 16-88-89",
    "NERs": "O O O O O O O O PERSON O O O PERSON PERSON O DATE O"
  },
  "annotation1": {
    "raw_text": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "toks": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "POSs": "NNS JJ CD NN , DT NN IN JJ NN , RB VBD JJ .",
    "positions": "0-0-5 1-6-11 2-12-16 3-17-22 4-23-24 5-25-26 6-27-33 7-34-38 8-39-44 9-45-51 10-52-53 11-54-58 12-59-65 13-66-76 14-77-78",
    "NERs": "O O DATE O O O O O O O O O O O O",
    "graph": {
      "g_features": [
        "twins",
        "-lrb-",
        "1988",
        "-rrb-",
        ",",
        "a",
        "comedy",
        "with",
        "danny",
        "devito",
        ",",
        "also",
        "proved",
        "successful",
        "."
      ],
      "g_adj": {
        "0": [
          {
            "node": 1,
            "edge": "punct"
          },
          {
            "node": 2,
            "edge": "appos"
          },
          {
            "node": 3,
            "edge": "punct"
          },
          {
            "node": 4,
            "edge": "punct"
          },
          {
            "node": 6,
            "edge": "appos"
          },
          {
            "node": 10,
            "edge": "punct"
          }
        ],
        "6": [
          {
            "node": 5,
            "edge": "det"
          },
          {
            "node": 7,
            "edge": "prep"
          }
        ],
        "7": [
          {
            "node": 9,
            "edge": "pobj"
          }
        ],
        "9": [
          {
            "node": 8,
            "edge": "amod"
          }
        ],
        "12": [
          {
            "node": 0,
            "edge": "nsubj"
          },
          {
            "node": 11,
            "edge": "advmod"
          },
          {
            "node": 13,
            "edge": "oprd"
          },
          {
            "node": 14,
            "edge": "punct"
          }
        ]
      },
      "num_edges": 14
    }
  }
}
@hugochan
Copy link
Owner

hugochan commented Mar 2, 2021

I am sending an example.

I trained a model. I would like to try it on my own data. Could you please clarify which fields are actually required for the model? It might be possible that I don't need to provide all of them while shaping my own data as per the format below.

{
  "text3": "Twins",
  "text1": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
  "text2": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
  "annotation3": {
    "raw_text": "Twins",
    "toks": "Twins",
    "POSs": "NNS",
    "positions": "0-0-5",
    "NERs": "O"
  },
  "id": "56de8c374396321400ee2a11",
  "annotation2": {
    "raw_text": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
    "toks": "What 's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988 ?",
    "POSs": "WP VBZ DT NN IN DT NN NN NNP VBD IN IN NNP NNP IN CD .",
    "positions": "0-0-4 1-4-6 2-7-10 3-11-16 4-17-19 5-20-23 6-24-30 7-31-36 8-37-51 9-52-59 10-60-62 11-63-67 12-68-73 13-74-80 14-81-83 15-84-88 16-88-89",
    "NERs": "O O O O O O O O PERSON O O O PERSON PERSON O DATE O"
  },
  "annotation1": {
    "raw_text": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "toks": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "POSs": "NNS JJ CD NN , DT NN IN JJ NN , RB VBD JJ .",
    "positions": "0-0-5 1-6-11 2-12-16 3-17-22 4-23-24 5-25-26 6-27-33 7-34-38 8-39-44 9-45-51 10-52-53 11-54-58 12-59-65 13-66-76 14-77-78",
    "NERs": "O O DATE O O O O O O O O O O O O",
    "graph": {
      "g_features": [
        "twins",
        "-lrb-",
        "1988",
        "-rrb-",
        ",",
        "a",
        "comedy",
        "with",
        "danny",
        "devito",
        ",",
        "also",
        "proved",
        "successful",
        "."
      ],
      "g_adj": {
        "0": [
          {
            "node": 1,
            "edge": "punct"
          },
          {
            "node": 2,
            "edge": "appos"
          },
          {
            "node": 3,
            "edge": "punct"
          },
          {
            "node": 4,
            "edge": "punct"
          },
          {
            "node": 6,
            "edge": "appos"
          },
          {
            "node": 10,
            "edge": "punct"
          }
        ],
        "6": [
          {
            "node": 5,
            "edge": "det"
          },
          {
            "node": 7,
            "edge": "prep"
          }
        ],
        "7": [
          {
            "node": 9,
            "edge": "pobj"
          }
        ],
        "9": [
          {
            "node": 8,
            "edge": "amod"
          }
        ],
        "12": [
          {
            "node": 0,
            "edge": "nsubj"
          },
          {
            "node": 11,
            "edge": "advmod"
          },
          {
            "node": 13,
            "edge": "oprd"
          },
          {
            "node": 14,
            "edge": "punct"
          }
        ]
      },
      "num_edges": 14
    }
  }
}

@andrei-volkau sorry for the late reply! Actually all the fields except for "positions" are needed. If you do not want to use the linguistic features like POS and NER, these two fields can be omitted as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants