treat

Go to file

Louis Mullie 65dfc01e74 Fix little bug.		2012-07-25 02:42:46 -04:00
bin	Manifest baby!	2012-06-17 01:52:01 -04:00
files	Update files and manifests.	2012-06-29 16:44:37 -04:00
lib	Fix little bug.	2012-07-25 02:42:46 -04:00
models	Manifest baby!	2012-06-17 01:52:01 -04:00
spec	Rename do() to chain.	2012-07-25 02:07:07 -04:00
tmp	Added manifests.	2012-06-16 01:50:26 -04:00
.gitignore	Add .prop to gitignore.	2012-07-18 16:48:39 -04:00
.rspec	Added tests for time extractors and indexers/searchers.	2012-03-11 23:40:15 -04:00
.travis.yml	Re-add installer.	2012-06-29 13:47:39 -04:00
Gemfile	Add sourcify gem dependency.	2012-07-18 16:42:18 -04:00
LICENSE	Bum version number in LICENSe.	2012-07-23 20:36:35 -04:00
README.md	Update links on the Wiki.	2012-07-11 00:51:30 -04:00
RELEASE	Added some release info for v 1.1.0	2012-06-29 16:49:41 -04:00
Rakefile	Cosmetic changes.	2012-06-29 16:49:46 -04:00
treat.gemspec	Add birch and sourcify dependency.	2012-07-18 16:47:10 -04:00

README.md

Treat is a framework for natural language processing and computational linguistics in Ruby. It provides a common API for a number of gems and external libraries for document retrieval, parsing, annotation, and information extraction.

Current features

Text extractors for PDF, HTML, XML, Word, AbiWord, OpenOffice and image formats (Ocropus).
Text retrieval with indexation and full-text search (Ferret).
Text chunkers, sentence segmenters, tokenizers, and parsers for several languages (Stanford & Enju).
Word inflectors, including stemmers, conjugators, declensors, and number inflection.
Lexical resources (WordNet interface, several POS taggers for English, Stanford taggers for several languages).
Language, date/time, topic words (LDA) and keyword (TF*IDF) extraction.
Serialization of annotated entities to YAML, XML formats or to MongoDB.
Visualization in ASCII tree, directed graph (DOT) and tag-bracketed (standoff) formats.
Linguistic resources, including language detection and tag alignments for several treebanks.
Decision tree and multilayer perceptron classification (liblinear coming soon!)

Resources

Read the latest documentation.
See how to install Treat.
Learn how to use Treat.
Help out by contributing to the project.
View a list of papers about tools included in this toolkit.
Open an issue.

License

This software is released under the GPL License and includes software released under the GPL, Ruby, Apache 2.0 and MIT licenses.