Skip to content
View josecannete's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Organizations

@dccuchile

Block or report josecannete

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
josecannete/README.md

Hello there, I'm José! 😁

I'm a Expert Machine Learning Engineer at Walmart, passionate about leveraging AI for real-world impact. I hold an MSc. in Computer Science, graduated from Universidad de Chile, and specialize in Language Models.

I pioneered BETO, the first BERT-like model for Spanish, and introduced ALBETO, a suite of light and fast models based in ALBERT for Spanish.

Formerly a founding member of the ReLeLa research group at DCC UChile, I've passionately explored Artificial Intelligence and optimizing systems for real-world applications.

Outside of work, music and podcasts are my go-to sources for relaxation and learning.

Feel free to reach out – you'll find my contact details in the adjacent bio.

Pinned Loading

  1. dccuchile/beto dccuchile/beto Public

    BETO - Spanish version of the BERT model

    492 63

  2. dccuchile/lightweight-spanish-language-models dccuchile/lightweight-spanish-language-models Public

    ALBETO and DistilBETO are versions of ALBERT and DistilBERT pre-trained exclusively on Spanish corpora.

    Python 30 2

  3. dccuchile/speedy-gonzales dccuchile/speedy-gonzales Public

    Code for "Speedy Gonzales: A Collection of Fast Task-Specific Models for Spanish"

    HTML 7

  4. dccuchile/spanish-word-embeddings dccuchile/spanish-word-embeddings Public

    Spanish word embeddings computed with different methods and from different corpora

    356 82

  5. spanish-corpora spanish-corpora Public

    Unannotated Spanish 3 Billion Words Corpora

    Python 92 10

  6. BotCenter/spanishWordEmbeddings BotCenter/spanishWordEmbeddings Public

    Spanish Word Embeddings computed from large corpora and different sizes using fastText.

    9 1