AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.
Support
Quality
Security
License
Reuse
Multilingual text (NLP) processing toolkit
Support
Quality
Security
License
Reuse
Basic Utilities for PyTorch Natural Language Processing (NLP)
Support
Quality
Security
License
Reuse
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Support
Quality
Security
License
Reuse
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Support
Quality
Security
License
Reuse
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Support
Quality
Security
License
Reuse
A simple prompt-chatting AI based on wechaty and fintuned NLP model
Support
Quality
Security
License
Reuse
Multi-Task Deep Neural Networks for Natural Language Understanding
Support
Quality
Security
License
Reuse
I
Information-Extraction-Chineseby crownpku
Python 
2103
Version:Current
License: No License (No License)
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Support
Quality
Security
License
Reuse
a
awesome-sentence-embeddingby Separius
Python 
2099
Version:Current
License: Strong Copyleft (GPL-3.0)
A curated list of pretrained sentence and word embedding models
Support
Quality
Security
License
Reuse
Create ridiculously fast Lexers
Support
Quality
Security
License
Reuse
A modern proof language
Support
Quality
Security
License
Reuse
Deep Learning Chinese Word Segment
Support
Quality
Security
License
Reuse
兜哥出品 <一本开源的NLP入门书籍>
Support
Quality
Security
License
Reuse
Beautiful visualizations of how language differs among document types.
Support
Quality
Security
License
Reuse
NLP, before and after spaCy
Support
Quality
Security
License
Reuse
Curated collection of data structures for the JavaScript/TypeScript language.
Support
Quality
Security
License
Reuse
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Support
Quality
Security
License
Reuse
Pre-trained word vectors of 30+ languages
Support
Quality
Security
License
Reuse
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Support
Quality
Security
License
Reuse
P
Programming-Alpha-To-Omegaby justjavac
JavaScript 
1983
Version:Current
License: Strong Copyleft (GPL-2.0)
从零开始学编程 系列汇总(从α到Ω)
Support
Quality
Security
License
Reuse
A comprehensive reference for all topics related to Natural Language Processing
Support
Quality
Security
License
Reuse
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Support
Quality
Security
License
Reuse
Easily build, customize and control your own LLMs
Support
Quality
Security
License
Reuse
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
Support
Quality
Security
License
Reuse
A python tool for evaluating the quality of sentence embeddings.
Support
Quality
Security
License
Reuse
Named Entity Recognition (LSTM + CRF) - Tensorflow
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
SLING - A natural language frame semantics parser
Support
Quality
Security
License
Reuse
👩🏫 Advanced NLP with spaCy: A free online course
Support
Quality
Security
License
Reuse
Surprisingly space efficient trie in Golang(11 bits/key; 100 ns/get).
Support
Quality
Security
License
Reuse
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Support
Quality
Security
License
Reuse
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Support
Quality
Security
License
Reuse
An Efficient Lexical Analyzer for Chinese
Support
Quality
Security
License
Reuse
🐬 A simplified implementation of TypeScript's type system written in TypeScript's type system
Support
Quality
Security
License
Reuse
中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)
Support
Quality
Security
License
Reuse
A library of data loaders for LLMs made by the community -- to be used with GPT Index and/or LangChain
Support
Quality
Security
License
Reuse
Super easy library for BERT based NLP models
Support
Quality
Security
License
Reuse
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Support
Quality
Security
License
Reuse
Longformer: The Long-Document Transformer
Support
Quality
Security
License
Reuse
The most popular spellchecking library.
Support
Quality
Security
License
Reuse
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Support
Quality
Security
License
Reuse
Build LLM apps in Typescript/Javascript. 🧑💻 🧑💻 🧑💻 🚀 🚀 🚀
Support
Quality
Security
License
Reuse
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
Support
Quality
Security
License
Reuse
Dataset of GPT-2 outputs for research in detection, biases, and more
Support
Quality
Security
License
Reuse
Baidu's open-source Sentiment Analysis System.
Support
Quality
Security
License
Reuse
Datasets, SOTA results of every fields of Chinese NLP
Support
Quality
Security
License
Reuse
An introduction to the Rust programming language for Node developers.
Support
Quality
Security
License
Reuse
A lightning fast Finite State machine and REgular expression manipulation library.
Support
Quality
Security
License
Reuse
A modern Prolog implementation written mostly in Rust.
Support
Quality
Security
License
Reuse
A
AI-Writerby BlinkDL
AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.
Python
2190
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
polyglotby aboSamoor
Multilingual text (NLP) processing toolkit
Python
2166
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
PyTorch-NLPby PetrochukM
Basic Utilities for PyTorch Natural Language Processing (NLP)
Python
2157
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
r
rebiberby yuchenlin
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Python
2147
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
J
JioNLPby dongrixinyu
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Python
2142
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
K
Kashgariby BrikerMan
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Python
2137
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
AntiFraudChatBotby Turing-Project
A simple prompt-chatting AI based on wechaty and fintuned NLP model
Python
2135
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
mt-dnnby namisan
Multi-Task Deep Neural Networks for Natural Language Understanding
Python
2118
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
I
Information-Extraction-Chineseby crownpku
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Python
2103
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
a
awesome-sentence-embeddingby Separius
A curated list of pretrained sentence and word embedding models
Python
2099
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
l
logosby maciejhirsz
Create ridiculously fast Lexers
Rust
2096
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
K
Support
Quality
Security
License
Reuse
k
kcwsby koth
Deep Learning Chinese Word Segment
C++
2083
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
n
Support
Quality
Security
License
Reuse
s
scattertextby JasonKessler
Beautiful visualizations of how language differs among document types.
Python
2072
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
textacyby chartbeat-labs
NLP, before and after spaCy
Python
2071
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
mnemonistby Yomguithereal
Curated collection of data structures for the JavaScript/TypeScript language.
JavaScript
2070
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
text2vecby shibing624
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Python
2066
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
wordvectorsby Kyubyong
Pre-trained word vectors of 30+ languages
Python
2013
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytextrankby DerwenAI
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Python
2008
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Programming-Alpha-To-Omegaby justjavac
从零开始学编程 系列汇总(从α到Ω)
JavaScript
1983
Updated: 4 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
T
The-NLP-Pandectby ivan-bilan
A comprehensive reference for all topics related to Natural Language Processing
Python
1967
Updated: 2 y ago
License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
H
HarvestTextby blmoistawinde
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Python
1954
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
x
xTuringby stochasticai
Easily build, customize and control your own LLMs
Python
1951
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
DeepKEby zjunlp
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
Python
1950
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SentEvalby facebookresearch
A python tool for evaluating the quality of sentence embeddings.
Python
1934
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
sequence_taggingby guillaumegenthial
Named Entity Recognition (LSTM + CRF) - Tensorflow
Python
1928
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
moveby move-language
Rust
1923
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
slingby google
SLING - A natural language frame semantics parser
C++
1921
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
spacy-courseby ines
👩🏫 Advanced NLP with spaCy: A free online course
Python
1902
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
slimby openacid
Surprisingly space efficient trie in Golang(11 bits/key; 100 ns/get).
Go
1877
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
rust-bertby guillaume-be
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Rust
1876
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
N
NCRFppby jiesutd
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Python
1865
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
THULAC-Pythonby thunlp
An Efficient Lexical Analyzer for Chinese
Python
1832
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
H
HypeScriptby ronami
🐬 A simplified implementation of TypeScript's type system written in TypeScript's type system
TypeScript
1827
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
named_entity_recognitionby luopeixiang
中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)
Python
1807
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
l
llama-hubby emptycrown
A library of data loaders for LLMs made by the community -- to be used with GPT Index and/or LangChain
Python
1807
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fast-bertby utterworks
Super easy library for BERT based NLP models
Python
1793
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
ABSA-PyTorchby songyouwei
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Python
1782
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
longformerby allenai
Longformer: The Long-Document Transformer
Python
1778
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
h
hunspellby hunspell
The most popular spellchecking library.
C++
1771
Updated: 2 y ago
License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
B
BERT-NER-Pytorchby lonePatient
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Python
1770
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
promptableby cfortuner
Build LLM apps in Typescript/Javascript. 🧑💻 🧑💻 🧑💻 🚀 🚀 🚀
TypeScript
1743
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaggle-CrowdFlowerby ChenglongChen
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
C++
1742
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
g
gpt-2-output-datasetby openai
Dataset of GPT-2 outputs for research in detection, biases, and more
Python
1723
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
Sentaby baidu
Baidu's open-source Sentiment Analysis System.
Python
1719
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
C
ChineseNLPby didi
Datasets, SOTA results of every fields of Chinese NLP
HTML
1710
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
r
rust-for-node-developersby Mercateo
An introduction to the Rust programming language for Node developers.
Rust
1696
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
B
BlingFireby microsoft
A lightning fast Finite State machine and REgular expression manipulation library.
C++
1696
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scryer-prologby mthom
A modern Prolog implementation written mostly in Rust.
Rust
1695
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse