zhenye234

🍉

YE Zhen zhenye234

🍉

Speech synthesis, Audio generation, Speech LLM

126 followers · 50 following

Hong Kong University of Science and Technology
Hong Kong
@zhenye234
https://huggingface.co/ZhenYe234

Achievements

Stars

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 4,084 397 Updated Feb 16, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,995 429 Updated Nov 21, 2024

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 316 15 Updated Feb 13, 2025

qiuqiangkong / audio_understanding

Python 84 3 Updated Feb 6, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 20,278 1,745 Updated Feb 17, 2025

huggingface / dataspeech

Python 345 53 Updated Sep 3, 2024

Kevin-naticl / LLaSE

LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement

Python 8 1 Updated Jan 28, 2025

deepseek-ai / DeepSeek-R1

76,839 9,957 Updated Feb 14, 2025

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 327 25 Updated Feb 14, 2025

freddyaboulton / gradio-webrtc

Realtime Video and Audio Streaming with WebRTC and Gradio

Python 205 27 Updated Feb 15, 2025

ScalingIntelligence / large_language_monkeys

Python 82 14 Updated Sep 25, 2024

sarulab-speech / UTMOSv2

UTokyo-SaruLab MOS Prediction System

Python 145 14 Updated Dec 9, 2024

zhenye234 / LLaSA_inference

23 Updated Feb 8, 2025

vivian556123 / NeurIPS2024-CoVoMix

Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

Python 48 4 Updated Jan 16, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,000 97 Updated Jan 16, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 165 18 Updated Feb 13, 2025

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,978 1,517 Updated Jan 13, 2025

Takaaki-Saeki / DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

Python 142 11 Updated Dec 5, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,218 2,332 Updated Feb 12, 2025