Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
Support
Quality
Security
License
Reuse
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Support
Quality
Security
License
Reuse
indexing library for Go
Support
Quality
Security
License
Reuse
该仓库主要记录 NLP 算法工程师相关的面试题
Support
Quality
Security
License
Reuse
Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Support
Quality
Security
License
Reuse
gopy generates a CPython extension module from a go package.
Support
Quality
Security
License
Reuse
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Support
Quality
Security
License
Reuse
🐫 The Perl programming language
Support
Quality
Security
License
Reuse
n
nlp-in-python-tutorialby adashofdata
Jupyter Notebook 
1640
Version:Current
License: No License (No License)
comparing stand up comedians using natural language processing
Support
Quality
Security
License
Reuse
A Chinese Nature Language Toolkit
Support
Quality
Security
License
Reuse
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
Support
Quality
Security
License
Reuse
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
Support
Quality
Security
License
Reuse
Tensorflow implementation of contextualized word representations from bi-directional language models
Support
Quality
Security
License
Reuse
Header-only C++ binding for libzmq
Support
Quality
Security
License
Reuse
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
Support
Quality
Security
License
Reuse
A programming language for large language models.
Support
Quality
Security
License
Reuse
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text
Support
Quality
Security
License
Reuse
t
text-analytics-with-pythonby dipanjanS
Jupyter Notebook 
1560
Version:Current
License: Permissive (Apache-2.0)
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Support
Quality
Security
License
Reuse
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
Support
Quality
Security
License
Reuse
Fast trigram based code search
Support
Quality
Security
License
Reuse
A framework for few-shot evaluation of autoregressive language models.
Support
Quality
Security
License
Reuse
The implementation of DeBERTa
Support
Quality
Security
License
Reuse
CKIP Neural Chinese Word Segmentation, POS Tagging, and NER
Support
Quality
Security
License
Reuse
mt code
Support
Quality
Security
License
Reuse
OpenAI API + Ruby! 🤖❤️ Now with ChatGPT and Whisper...
Support
Quality
Security
License
Reuse
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Support
Quality
Security
License
Reuse
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Support
Quality
Security
License
Reuse
jiant is an nlp toolkit
Support
Quality
Security
License
Reuse
🦆 Contextually-keyed word vectors
Support
Quality
Security
License
Reuse
Self-contained Machine Learning and Natural Language Processing library in Go
Support
Quality
Security
License
Reuse
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Support
Quality
Security
License
Reuse
Next Generation Visual Programming System
Support
Quality
Security
License
Reuse
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
Support
Quality
Security
License
Reuse
A blazing fast language for the blazing fast world(WIP)
Support
Quality
Security
License
Reuse
A VM That is Dynamic and Fast
Support
Quality
Security
License
Reuse
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Support
Quality
Security
License
Reuse
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Support
Quality
Security
License
Reuse
A collection of research on knowledge graphs
Support
Quality
Security
License
Reuse
Swift Core ML 3 implementations of GPT-2, DistilGPT-2, BERT, and DistilBERT for Question answering. Other Transformers coming soon!
Support
Quality
Security
License
Reuse
An easy way to extract information from documents
Support
Quality
Security
License
Reuse
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Support
Quality
Security
License
Reuse
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Support
Quality
Security
License
Reuse
nanobind: tiny and efficient C++/Python bindings
Support
Quality
Security
License
Reuse
Turn Chinese natural language into structured data 中文自然语言理解
Support
Quality
Security
License
Reuse
A program that provides LLMs with the ability to complete complex tasks using plugins.
Support
Quality
Security
License
Reuse
CodeBERT
Support
Quality
Security
License
Reuse
这个项目是一个基本包.封装了大多数nlp项目中常用工具
Support
Quality
Security
License
Reuse
An Artificial Intelligence Automation Platform. AI Instruction management from various providers, has an adaptive memory, and a versatile plugin system with many commands including web browsing. Supports many AI providers and models and growing support every day.
Support
Quality
Security
License
Reuse
Python Keyphrase Extraction module
Support
Quality
Security
License
Reuse
Data augmentation for NLP, presented at EMNLP 2019
Support
Quality
Security
License
Reuse
L
LatticeLSTMby jiesutd
Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
Python
1692
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
g
gpt2-mlby imcaspar
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Python
1682
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
b
Support
Quality
Security
License
Reuse
N
NLP-Interview-Notesby km1994
该仓库主要记录 NLP 算法工程师相关的面试题
Jupyter Notebook
1678
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
b
biobertby dmis-lab
Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Python
1651
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
g
gopyby go-python
gopy generates a CPython extension module from a go package.
Go
1650
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
F
FARMby deepset-ai
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Python
1646
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
perl5by Perl
🐫 The Perl programming language
Perl
1641
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
n
nlp-in-python-tutorialby adashofdata
comparing stand up comedians using natural language processing
Jupyter Notebook
1640
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
F
FoolNLTKby rockyzhengwu
A Chinese Nature Language Toolkit
Python
1639
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
N
NeuroNERby Franck-Dernoncourt
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
Python
1638
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
TextInfoExpby Roshanson
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
Python
1607
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
b
bilm-tfby allenai
Tensorflow implementation of contextualized word representations from bi-directional language models
Python
1605
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
C
Chinese-XLNetby ymcui
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
Python
1567
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
lmqlby eth-sri
A programming language for large language models.
Python
1565
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
R
Recognizers-Textby microsoft
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text
C#
1564
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
text-analytics-with-pythonby dipanjanS
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Jupyter Notebook
1560
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
N
NeuralNLP-NeuralClassifierby Tencent
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
Python
1554
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
z
zoektby google
Fast trigram based code search
Go
1552
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
lm-evaluation-harnessby EleutherAI
A framework for few-shot evaluation of autoregressive language models.
Python
1550
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
D
DeBERTaby microsoft
The implementation of DeBERTa
Python
1542
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
ckiptaggerby ckiplab
CKIP Neural Chinese Word Segmentation, POS Tagging, and NER
Python
1540
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
Support
Quality
Security
License
Reuse
r
ruby-openaiby alexrudall
OpenAI API + Ruby! 🤖❤️ Now with ChatGPT and Whisper...
Ruby
1532
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
K
Keras-TextClassificationby yongzhuo
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Python
1530
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
nlp-journeyby msgi
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Python
1528
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
j
Support
Quality
Security
License
Reuse
s
sense2vecby explosion
🦆 Contextually-keyed word vectors
Python
1510
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
spagoby nlpodyssey
Self-contained Machine Learning and Natural Language Processing library in Go
Go
1510
Updated: 2 y ago
License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
C
CDial-GPTby thu-coai
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Python
1506
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
u
unitby samuelmtimbo
Next Generation Visual Programming System
TypeScript
1506
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
bi-att-flowby allenai
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
Python
1505
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
P
Peregrineby peregrine-lang
A blazing fast language for the blazing fast world(WIP)
C++
1486
Updated: 2 y ago
License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
m
Support
Quality
Security
License
Reuse
a
anagoby Hironsan
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Python
1465
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
nlpcdaby 425776024
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Python
1450
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
k
knowledge-graphsby shaoxiongji
A collection of research on knowledge graphs
JavaScript
1441
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
swift-coreml-transformersby huggingface
Swift Core ML 3 implementations of GPT-2, DistilGPT-2, BERT, and DistilBERT for Question answering. Other Transformers coming soon!
Swift
1438
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
docqueryby impira
An easy way to extract information from documents
Python
1437
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
N
NeuronBlocksby microsoft
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Python
1433
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
nlp_xiaojiangby yongzhuo
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Python
1432
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
nanobindby wjakob
nanobind: tiny and efficient C++/Python bindings
C++
1432
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
R
Rasa_NLU_Chiby crownpku
Turn Chinese natural language into structured data 中文自然语言理解
Python
1430
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
smartgptby Cormanz
A program that provides LLMs with the ability to complete complex tasks using plugins.
Rust
1426
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
Support
Quality
Security
License
Reuse
n
nlp-langby NLPchina
这个项目是一个基本包.封装了大多数nlp项目中常用工具
Java
1415
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
Agent-LLMby Josh-XT
An Artificial Intelligence Automation Platform. AI Instruction management from various providers, has an adaptive memory, and a versatile plugin system with many commands including web browsing. Supports many AI providers and models and growing support every day.
Python
1413
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pkeby boudinfl
Python Keyphrase Extraction module
Python
1408
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
e
eda_nlpby jasonwei20
Data augmentation for NLP, presented at EMNLP 2019
Python
1403
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse