Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Signature extraction #1854

Open
baderdean opened this issue Nov 7, 2024 · 0 comments
Open

Signature extraction #1854

baderdean opened this issue Nov 7, 2024 · 0 comments

Comments

@baderdean
Copy link
Member

baderdean commented Nov 7, 2024

  1. Fetch only the non-html part of the email and send it to a queue
  2. Read emails from Redis Stream
  3. Detect signature from non-HTML part of emails in the last 20 lines only
  4. Extract automatically all information from signature using regexp and ENR including:
  • fullname
  • avatar
  • title
  • company
  • address
  • phone
  • social networks: linkedin, X at least
  • other URLS
  1. Insert them to a temporary table
  2. Group them by email to build the most complete and recent data for every contact
  3. Insert the results to
  4. Leadminer endpoint will verify the task-id, the token and insert them to the database, then notify the user that his data has been enriched

Inspiration:

Others:
inspiration:
https://github.com/disklosr/spotd
https://github.com/mlightner/contact_parser
https://github.com/inders/signature-in-email
https://github.com/thrustlabs/contact-parser
https://github.com/convosense/email_signature_remover
https://github.com/Statosco/Email-Signature-Extractor-using-BERT-and-Gemini
https://github.com/KimathiNewton/emails-signature-info
https://github.com/ruslyrossi47/email_signature
https://github.com/captamericadevs/ContactBuilder
https://github.com/edogola4/Email-Signature-Extraction-with-LLMs
https://github.com/leomarcel/email_parser
https://github.com/kartiikthakur/Email-Zoning--ML-Project

@baderdean baderdean changed the title Récupération des signatures Signature extraction Nov 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant