A text adventure that is quite vague.
Support
Quality
Security
License
Reuse
Bicoastal Datafest Hackathon Entry - Compare bills from different states to find similar ones with common influencers
Support
Quality
Security
License
Reuse
For the task of prediction of author from emails, we used Unigram language model. We started out on the problem by finding out the features that would help model the solution. The features that looked important were: • N-grams of the email • Frequency of each N-gram • Out of Vocabulary words (Spelling mistakes) The combination of first two features describes how the particular author chooses his dictionary set for writing text. Therefore, this feature can be termed as the signature of the author as all writers tend to choose only words from some defined subset of the Vocabulary. Also, the out of vocabulary words, generally the spelling mistakes done by the author, depict the style of the writing text, and therefore, comes to be an important aspect of the solution. The solution, thus, comes to be finding the total probability of each Ngram to be written by the particular author in the email.
Support
Quality
Security
License
Reuse
Utility to create a tagged corpus from gmail messages
Support
Quality
Security
License
Reuse
Trie (prefix tree) with string count, removal, and edit-distance find functionality in python/cython
Support
Quality
Security
License
Reuse
Spell check for russian and english text
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
CoNLL-2000 style evaluation of data using BIO and BEISO representation for mutli-token entities (i.e. chunks).
Support
Quality
Security
License
Reuse
Giza++ dictionary filtering tool and (initial) transliteration dictionary acquisition tool
Support
Quality
Security
License
Reuse
Text classification software based on Hidden Markov Models.
Support
Quality
Security
License
Reuse
Ruby wrapper for Hunspell
Support
Quality
Security
License
Reuse
Malaprop is a project involving transformations of natural text that result in some words being replaced by real-word near neighbours.
Support
Quality
Security
License
Reuse
Convert OpenIE / reverb extraction file to RDF
Support
Quality
Security
License
Reuse
E
Evaluation-of-conformer-generation-toolsby URVnutrigenomica-CTNS
Python 3 Version:Current License: No License (No License)
Repository containing scripts and data for the evaluation of conformer generation tools.
Support
Quality
Security
License
Reuse
Ruby public interface to classify text
Support
Quality
Security
License
Reuse
A trie data structure for arrays
Support
Quality
Security
License
Reuse
AudgenDB radiology report text classification and REST service
Support
Quality
Security
License
Reuse
Convert a text field into multi text
Support
Quality
Security
License
Reuse
Toy programming language
Support
Quality
Security
License
Reuse
Weibo suicide prediction with jieba, FANN and PyQt4
Support
Quality
Security
License
Reuse
Custom-built full text geocoding
Support
Quality
Security
License
Reuse
Spanish stemming
Support
Quality
Security
License
Reuse
conjoiners - multi-platform / multi-language reactive programming library (for Python)
Support
Quality
Security
License
Reuse
An ordered list of the names of the canonical books of the bible, with common abbreviations
Support
Quality
Security
License
Reuse
A packaged client for Clear NLP
Support
Quality
Security
License
Reuse
A small and experimental programming course which takes place at Athlon Sofia.
Support
Quality
Security
License
Reuse
Distribution code for the Inform 7 usability preprocessor
Support
Quality
Security
License
Reuse
Sentence alignment for comparable corpora using python/c++
Support
Quality
Security
License
Reuse
Toolkit for extracting lexical entries from the Apertium (http://www.apertium.org/) dictionaries and adding them to Grammatical Framework (GF, http://www.grammaticalframework.org/). It also implememts an algorithm for extracting GF multi-word expressions from parallel corpora.
Support
Quality
Security
License
Reuse
Library for transliteration
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Extract Web Community's Corpus
Support
Quality
Security
License
Reuse
Token field using https://github.com/thermogl/TITokenField
Support
Quality
Security
License
Reuse
Geoloc is a python package that identifies the places mentioned in a given text.
Support
Quality
Security
License
Reuse
Rudimentary ruby roguelike with ncurses-ruby
Support
Quality
Security
License
Reuse
Python AI/NLP samples
Support
Quality
Security
License
Reuse
Indexing tools to analyze and understand the SSDM
Support
Quality
Security
License
Reuse
Cyrillic to Latin translitter
Support
Quality
Security
License
Reuse
Modernize mecab-ruby
Support
Quality
Security
License
Reuse
g
generator-ibm-service-enablementby ibm-developer
JavaScript 3 Version:Current License: No License (No License)
WARNING: This repository is no longer maintained
Support
Quality
Security
License
Reuse
Interactive word anagram game made with JavaScript
Support
Quality
Security
License
Reuse
Guardian Discovery Week experiment in adding more semantics to news
Support
Quality
Security
License
Reuse
An approach to programming in C# that allows for Homoiconic source-code.
Support
Quality
Security
License
Reuse
A Chinese tweets parser web app, use Jieba for word segmentation. Running on GAE.
Support
Quality
Security
License
Reuse
Sentence generator using Markov chains
Support
Quality
Security
License
Reuse
A reworked Patient Evaluation interface for GNU Health
Support
Quality
Security
License
Reuse
Experimental NER techniques to address common (for me) text analysis problems.
Support
Quality
Security
License
Reuse
Hello World - SVA Fall 2020
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Node package for Arabic NLP tools.
Support
Quality
Security
License
Reuse
n
nebulous-adventureby jdavis
A text adventure that is quite vague.
Python 3Updated: 9 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
lawdiffby dangoldin
Bicoastal Datafest Hackathon Entry - Compare bills from different states to find similar ones with common influencers
Python 3Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
E
EmailAuthorPredictionby rahularora
For the task of prediction of author from emails, we used Unigram language model. We started out on the problem by finding out the features that would help model the solution. The features that looked important were: • N-grams of the email • Frequency of each N-gram • Out of Vocabulary words (Spelling mistakes) The combination of first two features describes how the particular author chooses his dictionary set for writing text. Therefore, this feature can be termed as the signature of the author as all writers tend to choose only words from some defined subset of the Vocabulary. Also, the out of vocabulary words, generally the spelling mistakes done by the author, depict the style of the writing text, and therefore, comes to be an important aspect of the solution. The solution, thus, comes to be finding the total probability of each Ngram to be written by the particular author in the email.
Python 3Updated: 9 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
gmail-corpusby dlaz
Utility to create a tagged corpus from gmail messages
Python 3Updated: 9 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
counttrieby JoelSjostrand
Trie (prefix tree) with string count, removal, and edit-distance find functionality in python/cython
Python 3Updated: 9 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
p
php-spell-checkerby stepozer
Spell check for russian and english text
PHP 3Updated: 10 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
application_pythonby andreacampi
Ruby 3Updated: 11 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
bioevalby savkov
CoNLL-2000 style evaluation of data using BIO and BEISO representation for mutli-token entities (i.e. chunks).
Python 3Updated: 5 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
dict-filteringby pmarcis
Giza++ dictionary filtering tool and (initial) transliteration dictionary acquisition tool
C# 3Updated: 6 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
H
HMM-Theseby hicham321
Text classification software based on Hidden Markov Models.
Java 3Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
j
Support
Quality
Security
License
Reuse
m
malapropby ambimorph
Malaprop is a project involving transformations of natural text that result in some words being replaced by real-word near neighbours.
Python 3Updated: 7 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
e
ext2rdfby Weissger
Convert OpenIE / reverb extraction file to RDF
Python 3Updated: 8 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
E
Evaluation-of-conformer-generation-toolsby URVnutrigenomica-CTNS
Repository containing scripts and data for the evaluation of conformer generation tools.
Python 3Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
i
ingenia_rubyby ingenia-api
Ruby public interface to classify text
Ruby 3Updated: 10 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
array-trieby mikolalysenko
A trie data structure for arrays
JavaScript 3Updated: 5 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
arrcby chop-dbhi
AudgenDB radiology report text classification and REST service
Python 3Updated: 7 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
j
jquery.multitextby aprimadi
Convert a text field into multi text
JavaScript 3Updated: 8 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
U
UglyLangby EmilHernvall
Toy programming language
Java 3Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
W
Weibo-Suicide-Predictionby LemonChiu
Weibo suicide prediction with jieba, FANN and PyQt4
Python 3Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mordecaiby johnb30
Custom-built full text geocoding
Python 3Updated: 6 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
e
Support
Quality
Security
License
Reuse
c
conjoiners-pythonby conjoiners
conjoiners - multi-platform / multi-language reactive programming library (for Python)
Python 3Updated: 10 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
books-of-the-bibleby TehShrike
An ordered list of the names of the canonical books of the bible, with common abbreviations
JavaScript 3Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
clear-nlp-packagedby cbrew
A packaged client for Clear NLP
Java 3Updated: 8 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
programming-at-athlonby mitio
A small and experimental programming course which takes place at Athlon Sofia.
Ruby 3Updated: 11 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
i
i7upby cbrantley91
Distribution code for the Inform 7 usability preprocessor
Python 3Updated: 8 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
C
CythonAlignerby jrs026
Sentence alignment for comparable corpora using python/c++
Python 3Updated: 8 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
apertiumToFromGFby vitaka
Toolkit for extracting lexical entries from the Apertium (http://www.apertium.org/) dictionaries and adding them to Grammatical Framework (GF, http://www.grammaticalframework.org/). It also implememts an algorithm for extracting GF multi-word expressions from parallel corpora.
Python 3Updated: 10 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
t
translitby romppu75
Library for transliteration
Java 3Updated: 10 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
M
MotifExtractionby THUIR
Java 3Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
C
CorpusMakerby dandelin
Extract Web Community's Corpus
Python 3Updated: 9 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
Titanium-Tokenfieldby yec
Token field using https://github.com/thermogl/TITokenField
Python 3Updated: 10 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
g
geolocby GEOLOC
Geoloc is a python package that identifies the places mentioned in a given text.
Python 3Updated: 5 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
r
ruby-roguelikeby wvandyk
Rudimentary ruby roguelike with ncurses-ruby
Ruby 3Updated: 10 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
python-ai-samplesby kudkudak
Python AI/NLP samples
Python 3Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
deathsby ftrain
Indexing tools to analyze and understand the SSDM
Python 3Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
smart_translitterby Mehonoshin
Cyrillic to Latin translitter
Ruby 3Updated: 7 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
Support
Quality
Security
License
Reuse
g
generator-ibm-service-enablementby ibm-developer
WARNING: This repository is no longer maintained
JavaScript 3Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
letter-runby virginiac32
Interactive word anagram game made with JavaScript
JavaScript 3Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
semantificationby theefer
Guardian Discovery Week experiment in adding more semantics to news
Ruby 3Updated: 10 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
csVisionby erichosick
An approach to programming in C# that allows for Homoiconic source-code.
C# 3Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
z
zh-tweets-parserby nyanshell
A Chinese tweets parser web app, use Jieba for word segmentation. Running on GAE.
Python 3Updated: 11 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
M
Markovby pelmers
Sentence generator using Markov chains
Python 3Updated: 8 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
health_encounterby moh-gov-jm
A reworked Patient Evaluation interface for GNU Health
Python 3Updated: 7 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
t
tgniby sujitpal
Experimental NER techniques to address common (for me) text analysis problems.
Java 3Updated: 8 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hello-worldby areaofeffect
Hello World - SVA Fall 2020
JavaScript 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
learn_to_codeby jimktrains
Python 3Updated: 10 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
arabic-nlpby ielashi
Node package for Arabic NLP tools.
JavaScript 3Updated: 5 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse