AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.
Support
Quality
Security
License
Reuse
Multilingual text (NLP) processing toolkit
Support
Quality
Security
License
Reuse
Basic Utilities for PyTorch Natural Language Processing (NLP)
Support
Quality
Security
License
Reuse
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Support
Quality
Security
License
Reuse
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Support
Quality
Security
License
Reuse
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Support
Quality
Security
License
Reuse
A simple prompt-chatting AI based on wechaty and fintuned NLP model
Support
Quality
Security
License
Reuse
Multi-Task Deep Neural Networks for Natural Language Understanding
Support
Quality
Security
License
Reuse
I
Information-Extraction-Chineseby crownpku
Python 2103 Version:Current License: No License (No License)
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Support
Quality
Security
License
Reuse
a
awesome-sentence-embeddingby Separius
Python 2099 Version:Current License: Strong Copyleft (GPL-3.0)
A curated list of pretrained sentence and word embedding models
Support
Quality
Security
License
Reuse
Create ridiculously fast Lexers
Support
Quality
Security
License
Reuse
A modern proof language
Support
Quality
Security
License
Reuse
Deep Learning Chinese Word Segment
Support
Quality
Security
License
Reuse
兜哥出品 <一本开源的NLP入门书籍>
Support
Quality
Security
License
Reuse
Beautiful visualizations of how language differs among document types.
Support
Quality
Security
License
Reuse
NLP, before and after spaCy
Support
Quality
Security
License
Reuse
Curated collection of data structures for the JavaScript/TypeScript language.
Support
Quality
Security
License
Reuse
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Support
Quality
Security
License
Reuse
Pre-trained word vectors of 30+ languages
Support
Quality
Security
License
Reuse
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Support
Quality
Security
License
Reuse
P
Programming-Alpha-To-Omegaby justjavac
JavaScript 1983 Version:Current License: Strong Copyleft (GPL-2.0)
从零开始学编程 系列汇总(从α到Ω)
Support
Quality
Security
License
Reuse
A comprehensive reference for all topics related to Natural Language Processing
Support
Quality
Security
License
Reuse
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Support
Quality
Security
License
Reuse
Easily build, customize and control your own LLMs
Support
Quality
Security
License
Reuse
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
Support
Quality
Security
License
Reuse
A python tool for evaluating the quality of sentence embeddings.
Support
Quality
Security
License
Reuse
Named Entity Recognition (LSTM + CRF) - Tensorflow
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
SLING - A natural language frame semantics parser
Support
Quality
Security
License
Reuse
👩🏫 Advanced NLP with spaCy: A free online course
Support
Quality
Security
License
Reuse
Surprisingly space efficient trie in Golang(11 bits/key; 100 ns/get).
Support
Quality
Security
License
Reuse
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Support
Quality
Security
License
Reuse
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Support
Quality
Security
License
Reuse
An Efficient Lexical Analyzer for Chinese
Support
Quality
Security
License
Reuse
🐬 A simplified implementation of TypeScript's type system written in TypeScript's type system
Support
Quality
Security
License
Reuse
中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)
Support
Quality
Security
License
Reuse
A library of data loaders for LLMs made by the community -- to be used with GPT Index and/or LangChain
Support
Quality
Security
License
Reuse
Super easy library for BERT based NLP models
Support
Quality
Security
License
Reuse
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Support
Quality
Security
License
Reuse
Longformer: The Long-Document Transformer
Support
Quality
Security
License
Reuse
The most popular spellchecking library.
Support
Quality
Security
License
Reuse
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Support
Quality
Security
License
Reuse
Build LLM apps in Typescript/Javascript. 🧑💻 🧑💻 🧑💻 🚀 🚀 🚀
Support
Quality
Security
License
Reuse
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
Support
Quality
Security
License
Reuse
Dataset of GPT-2 outputs for research in detection, biases, and more
Support
Quality
Security
License
Reuse
Baidu's open-source Sentiment Analysis System.
Support
Quality
Security
License
Reuse
Datasets, SOTA results of every fields of Chinese NLP
Support
Quality
Security
License
Reuse
An introduction to the Rust programming language for Node developers.
Support
Quality
Security
License
Reuse
A lightning fast Finite State machine and REgular expression manipulation library.
Support
Quality
Security
License
Reuse
A modern Prolog implementation written mostly in Rust.
Support
Quality
Security
License
Reuse
A
AI-Writerby BlinkDL
AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.
Python 2190Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
polyglotby aboSamoor
Multilingual text (NLP) processing toolkit
Python 2166Updated: 12 mo ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
PyTorch-NLPby PetrochukM
Basic Utilities for PyTorch Natural Language Processing (NLP)
Python 2157Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
r
rebiberby yuchenlin
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Python 2147Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
J
JioNLPby dongrixinyu
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Python 2142Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
K
Kashgariby BrikerMan
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Python 2137Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
AntiFraudChatBotby Turing-Project
A simple prompt-chatting AI based on wechaty and fintuned NLP model
Python 2135Updated: 12 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
mt-dnnby namisan
Multi-Task Deep Neural Networks for Natural Language Understanding
Python 2118Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
I
Information-Extraction-Chineseby crownpku
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Python 2103Updated: 12 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
awesome-sentence-embeddingby Separius
A curated list of pretrained sentence and word embedding models
Python 2099Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
l
logosby maciejhirsz
Create ridiculously fast Lexers
Rust 2096Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
K
Support
Quality
Security
License
Reuse
k
kcwsby koth
Deep Learning Chinese Word Segment
C++ 2083Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
Support
Quality
Security
License
Reuse
s
scattertextby JasonKessler
Beautiful visualizations of how language differs among document types.
Python 2072Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
textacyby chartbeat-labs
NLP, before and after spaCy
Python 2071Updated: 11 mo ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
m
mnemonistby Yomguithereal
Curated collection of data structures for the JavaScript/TypeScript language.
JavaScript 2070Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
text2vecby shibing624
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Python 2066Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
wordvectorsby Kyubyong
Pre-trained word vectors of 30+ languages
Python 2013Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pytextrankby DerwenAI
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Python 2008Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Programming-Alpha-To-Omegaby justjavac
从零开始学编程 系列汇总(从α到Ω)
JavaScript 1983Updated: 3 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
T
The-NLP-Pandectby ivan-bilan
A comprehensive reference for all topics related to Natural Language Processing
Python 1967Updated: 1 y ago License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
H
HarvestTextby blmoistawinde
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Python 1954Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
x
xTuringby stochasticai
Easily build, customize and control your own LLMs
Python 1951Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
DeepKEby zjunlp
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
Python 1950Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SentEvalby facebookresearch
A python tool for evaluating the quality of sentence embeddings.
Python 1934Updated: 11 mo ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
sequence_taggingby guillaumegenthial
Named Entity Recognition (LSTM + CRF) - Tensorflow
Python 1928Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
m
moveby move-language
Rust 1923Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
slingby google
SLING - A natural language frame semantics parser
C++ 1921Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
spacy-courseby ines
👩🏫 Advanced NLP with spaCy: A free online course
Python 1902Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
slimby openacid
Surprisingly space efficient trie in Golang(11 bits/key; 100 ns/get).
Go 1877Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
rust-bertby guillaume-be
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Rust 1876Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
N
NCRFppby jiesutd
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Python 1865Updated: 12 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
THULAC-Pythonby thunlp
An Efficient Lexical Analyzer for Chinese
Python 1832Updated: 12 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
H
HypeScriptby ronami
🐬 A simplified implementation of TypeScript's type system written in TypeScript's type system
TypeScript 1827Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
named_entity_recognitionby luopeixiang
中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)
Python 1807Updated: 12 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
llama-hubby emptycrown
A library of data loaders for LLMs made by the community -- to be used with GPT Index and/or LangChain
Python 1807Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fast-bertby utterworks
Super easy library for BERT based NLP models
Python 1793Updated: 12 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
ABSA-PyTorchby songyouwei
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Python 1782Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
longformerby allenai
Longformer: The Long-Document Transformer
Python 1778Updated: 11 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
h
hunspellby hunspell
The most popular spellchecking library.
C++ 1771Updated: 12 mo ago License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
B
BERT-NER-Pytorchby lonePatient
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Python 1770Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
promptableby cfortuner
Build LLM apps in Typescript/Javascript. 🧑💻 🧑💻 🧑💻 🚀 🚀 🚀
TypeScript 1743Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaggle-CrowdFlowerby ChenglongChen
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
C++ 1742Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
gpt-2-output-datasetby openai
Dataset of GPT-2 outputs for research in detection, biases, and more
Python 1723Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
Sentaby baidu
Baidu's open-source Sentiment Analysis System.
Python 1719Updated: 12 mo ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
C
ChineseNLPby didi
Datasets, SOTA results of every fields of Chinese NLP
HTML 1710Updated: 12 mo ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rust-for-node-developersby Mercateo
An introduction to the Rust programming language for Node developers.
Rust 1696Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
B
BlingFireby microsoft
A lightning fast Finite State machine and REgular expression manipulation library.
C++ 1696Updated: 11 mo ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scryer-prologby mthom
A modern Prolog implementation written mostly in Rust.
Rust 1695Updated: 11 mo ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse