GitHub - vection/TextTitan: TextTitan offers end to end set of tools for tackling a wide range of NLP tasks including classification, regression, generation and multi-modal tasks.

TextTitan library offers a comprehensive set of tools for tackling a wide range of NLP tasks including classification, regression, multi-modal and generation tasks.

** still under development

Introduction

TextTitan is a comprehensive NLP library that provides end-to-end solutions for a wide range of NLP tasks. This library makes it easy for developers and researchers to perform complex NLP tasks with ease and efficiency.

With TextTitan, you can tackle classification, regression, multi-modal and generation tasks with fast and accurate results. The library has been designed to be production-ready, so you can quickly integrate NLP capabilities into your projects and start seeing results.

Whether you're a beginner or an experienced NLP practitioner, TextTitan provides an intuitive and easy-to-use interface that makes NLP accessible to everyone. So why wait? Start using TextTitan today and revolutionize your NLP workflow!

Installation

Soon

Quick Usage

Models supported: Bert, Roberta, DebertaV3, LSTM, LSTM+CNN

classifier = NLPClassifier(base_model='lstm',problem_type='single_label_classification',save_path='best_weights')
classifier.max_length = 64

To train the model you need to provide:

CustomDataset
Array

text_list = df['OriginalTweet'].tolist()
train_label_list =df['Sentiment'].tolist()
classifier.train(text_list,train_label_list,epochs=15,batch_size=128)

TextTitan automatically saves the best weights according to validation set with early stopping.

For more information reach Documentation

Loading & Evaluation

Let assume we trained model on default name - best_weights.

** If its Bert / Roberta models it will contain only one folder.

** If its LSTM/LSTM+CNN weights file will be best_weights.pth and best_weights_tokeniezr for tokenizer

We always follow by model name in loading function.

classifier = NLPClassifier.load('best_weights.pth')
classifier.predict(['test1 test test ','test2 test test']

[('Positive', 0.27153775095939636), ('Positive', 0.30906566977500916)]

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
models/text_models		models/text_models
LICENSE		LICENSE
README.md		README.md
base_tokenizer.py		base_tokenizer.py
dataset.py		dataset.py
nlp_classifier.py		nlp_classifier.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Installation

Quick Usage

Loading & Evaluation

About

Releases

Packages

Languages

License

vection/TextTitan

Folders and files

Latest commit

History

Repository files navigation

Introduction

Installation

Quick Usage

Loading & Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages