Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to submit my model to the Leaderboard? #27

Open
Waneila opened this issue Jul 25, 2024 · 15 comments
Open

How to submit my model to the Leaderboard? #27

Waneila opened this issue Jul 25, 2024 · 15 comments

Comments

@Waneila
Copy link

Waneila commented Jul 25, 2024

No description provided.

@Psycoy
Copy link
Owner

Psycoy commented Jul 26, 2024

Hi, Waneila, you can run the test on MixEval-Hard and MixEval and give us the screenshot here
Please make sure to adhere to the instructions in the repo

thanks
Jinjie

@Psycoy Psycoy closed this as completed Aug 9, 2024
@Psycoy Psycoy reopened this Aug 9, 2024
@Waneila
Copy link
Author

Waneila commented Sep 3, 2024 via email

@Waneila
Copy link
Author

Waneila commented Sep 5, 2024

I have already replied to the issue via email and submitted our results. Please confirm if you have received them.

thanks
Waneila

@Psycoy
Copy link
Owner

Psycoy commented Sep 5, 2024

Hi @Waneila ,

I cannot see the results here, was it attached as an image?

@Waneila
Copy link
Author

Waneila commented Sep 5, 2024

Hi Jinjie,

I'm sorry, the image was sent via email, so it might not display here. Let me resend the screenshot.
Here are the results of our model, Spark4.0, on MixEval-Hard and MixEval for the 20240601 version. Please include our model name in the leaderboard.
mixeval_hard
mixeval

Thank you, and I wish you a pleasant day.

Best regards,
Waneila

@Psycoy
Copy link
Owner

Psycoy commented Sep 5, 2024

Hi @Waneila ,

Is there any technical report / paper for your models?
We will only include models that are known to the public to the leaderboard.
If there is one, would you kindly give us a pointer? We will look into it.
If not, maybe you can first indicate the results in the paper and contact us to add to the leaderboard as soon as it's released.

Have a nice day!

@Waneila
Copy link
Author

Waneila commented Sep 6, 2024

Hi Jinjie,

This is the access address for our model: https://xinghuo.xfyun.cn/spark. You are welcome to visit this interface.

thanks
Waneila

@Waneila
Copy link
Author

Waneila commented Sep 10, 2024

Hi Jinjie,

We have already provided our model's homepage. When can it be added to the leaderboard approximately? If there are any issues, please contact me promptly.

thanks
Waneila

@Psycoy
Copy link
Owner

Psycoy commented Sep 10, 2024

Hi @Waneila ,

It's alr on the leaderboard. Please check if you could see it.

@Waneila
Copy link
Author

Waneila commented Sep 10, 2024

Hi Jinjie,

We have already seen our model on the leaderboard, thanks for your support.

@Waneila
Copy link
Author

Waneila commented Oct 11, 2024

Hi Jinjie,

Our model, Spark 4.0, has been updated to Spark 4.5. We are pleased to announce that compared to the previous version, our latest model has achieved significant improvements in MixEval-Hard and MixEval tasks. Here are the results of our latest model, Spark 4.5, on MixEval-Hard and MixEval for the 20240601 version. Please update our model in the leaderboard.
mixeval-hard
mixeval

Thank you, and I wish you a nice day!
Waneila

@Waneila
Copy link
Author

Waneila commented Oct 17, 2024

Hi Jinjie,

I'm sorry, our latest model is named Spark 4.0-2024-10-14, not Spark 4.5. Please take note.

Thanks,
Waneila

@Psycoy
Copy link
Owner

Psycoy commented Nov 5, 2024

Hi Waneila,

Noted, we will update the results soon

@Waneila
Copy link
Author

Waneila commented Nov 5, 2024

Hi Jinjie,

When can the update be completed?

@Psycoy
Copy link
Owner

Psycoy commented Nov 6, 2024

Hi Jinjie,

When can the update be completed?

Hi @Waneila ,

We are currently reviewing some models to be added to the leaderboard.
We will update the results once it's done.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants