git repositories marketplace on viix

Add Git project

GIT repositories: All

Skip-gram-with-NS.ipynb

Skip-gram with negative sampling

#text processing #machine learning modelling

CBoW.ipynb

CBoW with keras (example of implementation and using)

#text processing

Keras-GAN

Keras implementations of Generative Adversarial Networks.

#machine learning modelling #neural network

sequence-to-sequence translation (by Keras)

Implementation a basic character-level sequence-to-sequence model. Applied to translating short English sentences into short French sentences, character-by-character.

#text processing #keras

LSTM stateful (by Keras)

Example demonstrate how to use a stateful LSTM model, stateful vs stateless LSTM performance comparison

#text processing #machine learning modelling #neural network

pretrained word embeddings (by Keras)

This script loads pre-trained word embeddings (GloVe embeddings) into a frozen Keras Embedding layer, and uses it to train a text classification model on the 20 Newsgroup dataset

#nlp #text processing #keras

LSTM text generation (by Keras)

Example script to generate text from Nietzsche's writings.

#nlp #text processing #machine learning modelling

fastText on IMDB-dataset (by Keras)

This example demonstrates the use of fasttext for text classification

#nlp #text processing #keras

bAbI mem-NN (by Keras)

End-to-End memory network on bAbI-dataset (reading comprehension Question-Answering).

#keras #machine learning modelling #neural network

LSTM-based network on bAbI-dataset (by Keras)

Recurrent neural networks for modeling Facebook’s bAbi dataset, “a mixture of 20 tasks for testing text understanding and reasoning”

#keras #machine learning modelling #neural network

unstructured-text-modelling

Text Analytics (Unsupervised Clustering) and Neural Network Modelling

#text processing #machine learning modelling #neural network

multi-label-text-classification

Holds code for collecting data from arXiv to build a multi-label text classification dataset and a simpler classifier on top of that.

#text processing #machine learning modelling

gpt2-from-scratch

How to create GPT-2, a powerful language model developed by OpenAI from scratch that can generate human-like text by predicting the next word in a sequence.

#llm #machine learning modelling #gpt

Compact Language Detector 2 (c++)

CLD2 probabilistically detects over 80 languages in Unicode UTF-8 text, either plain text or HTML/XML. Legacy encodings must be converted to valid UTF-8 by the caller. For mixed-language input, CLD2 returns the top three languages found and their approximate percentages of the total text bytes

#text processing #c++

counsel-chat (counsel_chat.ipynb)

This repository holds the code for working with data from counselchat.com. The scarped data are from individiuals seeking assistance from licensed therapists and their associated responses.

#llm #nlp #text processing

gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

#llm #gpt #neural network

gpt-2 (openAI) original

This repository is meant to be a starting point for researchers and engineers to experiment with GPT-2. Implementation from OpenAI

#llm #gpt #neural network

MobileLLM

This repository contains the training code of MobileLLM introduced in our work: "MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases", MobileLLM-125M/350M

#llm #ai #neural network

mml-book

Mathematics For Machine Learning. [Marc Peter Deisenroth, A Aldo Faisal, and Cheng Soon Ong]. To be published by Cambridge University Press. 🏆❇️✴️✨🎯

#machine learning modelling

fastText

fastText is a library for efficient learning of word representations and sentence classification. 🖥️🤖📡💻🌐⌨️🖱️🌍⭐👩‍💻📱

#text processing #embeddings

OpenNN

OpenNN is a software library written in C++ for advanced analytics. It implements neural networks, the most successful machine learning method.

#c++ #neural network

openai-finetuning-example

This repository provides an example of fine-tuning OpenAI's GPT-4o-mini model for classifying customer service support tickets. Through fine-tuning, we are able to increase the classification accuracy from 69% to 94%.

#llm #nlp #text processing

transformer-tf

Attention Is All You Need Implementation (TensorFlow 2.x) 🚀 This repository contains the TensorFlow implementation of the paper. This implementation can be used to perform any sequence to sequence task with some minimal code changes.

#tensorflow #transformers

tf-transformers

Imagine auto-regressive generation to be 90x faster. tf-transformers (Tensorflow Transformers) is designed to harness the full power of Tensorflow 2, designed specifically for Transformer based architecture.

#tensorflow #transformers

mgpt

We introduce mGPT, a multilingual variant of GPT-3, pretrained on 61 languages from linguistically diverse 25 language families using Wikipedia and C4 Corpus.

#llm #text processing #transformers

minGPT-TF

A TensorFlow re-implementation of mingpt

#tensorflow #transformers

gpt-mini

Yet another minimalistic Tensorflow (re-)re-implementation of Karpathy's Pytorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer).

#transformers

nanoGPT

Simplest, fastest repository for training/finetuning medium-sized GPTs.

#llm #nlp #ai

ml-cs335

Machine-learning mini-course: numpy examples for beginner-level

#machine learning modelling

gpt3-sandbox

The goal of this project is to enable users to create cool web demos using the newly released OpenAI GPT-3 API with just a few lines of Python.

gpt-3-simple-tutorial

Generate SQL from Natural Language Sentences using OpenAI's GPT-3 Model

miniGPT

Minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer), both training and inference

#llm #nlp #text processing