Stars
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
End-to-End Object Detection with Transformers
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
code for training & evaluating Contextual Document Embedding models
Robust Speech Recognition via Large-Scale Weak Supervision
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
Use the PS4 camera as an cheap yet powerfull 3D depth and RGB camera for use with OpenCV and python in Linux
Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
Qucs-S is a circuit simulation program with Qt-based GUI
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
The most customizable typing website with a minimalistic design and a ton of features. Test yourself in various modes, track your progress and improve your speed.
Faker is a Python package that generates fake data for you.
Code repository of all OpenGL chapters from the book and its accompanying website https://learnopengl.com
AliBaba Notifier: Real-time Flight Price Monitoring and Telegram Alerts
A modern, interlingual wordnet interface for Python
CCSER using TL and Attention-based Fusion of Wav2vec2 and Prosody