Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to extract all the skill names into a list? as well as if it's a hard, soft skill or certification? #70

Open
dataelvisliang opened this issue Jul 4, 2023 · 1 comment

Comments

@dataelvisliang
Copy link

Is it possible to extract all the skill names into a list? as well as if it's a hard, soft skill or certification?

annotations = skill_extractor.annotate(job_description)
skill_extractor.describe(annotations)

these 2 lines of codes will annotate the texts visually, and I tried to use the followings to extract all the skills into a list:
doc_node_values = list(set([entry['doc_node_value'] for entry in annotations['results']['ngram_scored']]))
print(doc_node_values)

is there a way to also extract the skill label? (e.g., soft, hard, certification etc.)?

thank you

@steinathan
Copy link

Asking this too

i saw that theres a

SKILL_TO_COLOR = {
    'Hard Skill': '#818CF8',
    'Soft Skill': '#F472B6',
    'Certification': "#552448"
}

but dont know how to unwrap the skill_id eg. KS1282M68WL9T4YH3SLF to output the skill name and other info

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants