Java NLP

Open-source Java projects categorized as NLP

Top 13 Java NLP Projects

  • CoreNLP

    Stanford CoreNLP: A Java suite of core NLP tools.

    Project mention: One does not simply "create a visualization" from unstructured data! | reddit.com/r/dataengineering | 2023-01-11

    If your looking at spacy have a look at Apache OpenNLP and Core NLP.

  • Smile

    Statistical Machine Intelligence & Learning Engine

    Project mention: Just want to vent a bit | reddit.com/r/ProgrammingLanguages | 2022-12-03

    Although it may be a bit more work, you can do both machine learning and AI in Java. If you are doing deep learning, you can use DeepJavaLibrary (I do work on this one at Amazon). If you are looking for other ML algorithms, I have seen Smile, Tribuo, or some around Spark.

  • Sonar

    Write Clean Java Code. Always.. Sonar helps you commit clean code every time. With over 600 unique rules to find Java bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • Tribuo

    Tribuo - A Java machine learning library

    Project mention: Is deeplearning4j a good choice? | reddit.com/r/java | 2023-03-11

    It seems to have been picked up by Eclipse and there is also Oracle Labs' Tribuo and Deep Java Library. All seem active, but I don't know much about any of them. I agree it's probably best to follow the community and use a more popular tool like PyTorch.

  • DatumBox

    Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

  • zemberek-nlp

    NLP tools for Turkish.

    Project mention: Is there a way to consistently determine whether the final vowel drops in bisyllabic nouns when they get an accusative case? | reddit.com/r/turkish | 2022-07-17

    https://github.com/ahmetaa/zemberek-nlp/blob/a9c0f88210dd6a4a1b6152de88d117054a105879/morphology/src/main/resources/tr/master-dictionary.dict Yaninda LastVowelDrop yazanlar

  • BotLibre

    An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.

    Project mention: [P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions. | reddit.com/r/MachineLearning | 2023-03-03
  • CogCompNLP

    CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • airy

    💬 Open Source App Framework to build streaming apps with real-time data - 💎 Build real-time data pipelines and make real-time data universally accessible - 🤖 Join historical and real-time data in the stream to create smarter ML and AI applications. - ⚡ Standardize complex data ingestion and stream data to apps with pre-built connectors

  • DKPro Core

    Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.

  • hawking

    A Natural Language Date Time Parser that Extract date and time from text with context and parse to the required format

  • turing

    :sparkles: :dna: Turing AI - Semantic Navigation, Chatbot using Search Engine and Many NLP Vendors.

  • txtai.java

    Semantic search and workflows in Java

    Project mention: # Run txtai in native code | dev.to | 2022-07-11

    txtai currently has two main methods of execution: Python or via a HTTP API. There are API bindings for JavaScript, Java, Rust and Go.

  • AgePredictor

    Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-03-11.

Java NLP related posts

Index

What are some of the best open-source NLP projects in Java? This list will help you:

Project Stars
1 CoreNLP 8,911
2 Smile 5,700
3 Tribuo 1,134
4 DatumBox 1,084
5 zemberek-nlp 1,041
6 BotLibre 517
7 CogCompNLP 462
8 airy 327
9 DKPro Core 196
10 hawking 168
11 turing 36
12 txtai.java 25
13 AgePredictor 13
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com