-
Notifications
You must be signed in to change notification settings - Fork 40
Home
JohnDaws edited this page Dec 3, 2018
·
24 revisions
Baleen is a text analytics framework, allowing for the extraction of information from unstructured and semi-structured data. For instance, it could be used to extract entities (e.g. people, organisations, dates) from news reports and find relationships between them; or it could be used to extract metadata from a large corpus of documents in order to look for patterns or features hidden within.
This wiki is intended as a central repository for information on the use and development of Baleen. As of Baleen 2.4, much of the in-built Baleen documentation has also transitioned to being hosted on this wiki, and users are encouraged to contribute and improve the documentation provided.
- Getting Started
- Configuring History
- Configuring Jobs
- Configuring Logging
- Ordering Annotators
- Relationship Extraction
- Examples
- Upgrading Between Versions
- Usage FAQs
- Using Templates
- Using Third Party Components
- Adding a New Database
- Building Javadoc
- Developing Annotators with Resources
- Developing Collection Readers
- Developing Regex Annotators
- Developing Shared Resources
- Developing with Configuration Parameters
- Expanding the Type System
- Embedding Baleen
- Future Direction
- Wishlist