Sourceforge

SourceForge is a web-based service that offers software developers a centralized online location to control and manage free and open-source software projects. It provides a source code repository, bug tracking, mirroring of downloads for load balancing, a wiki for documentation, developer and user mailing lists, user-support forums, user-written reviews and ratings, a news bulletin, micro-blog for publishing project updates, and other features. SourceForge was one of the first to offer this serv... For More Info

Total links:- 716
Total paper mentions:- 1668

First ACL Paper:- 2002
Latest ACL Paper:- 2019

Links

Alongwith its Literature Mentions


http://sourceforge.jp/projects/naist-jdic/
Japanese Predicate Argument Structure Analysis Exploiting Argument Position and Type
A Cross-Lingual ILP Solution to Zero Anaphora Resolution

http://freedict.sourceforge.net/
A Repository of Free Lexical Resources for African Languages: The Project and the Method

http://sourceforge.net/projects/maxent/
Extracting Synonyms from Dictionary Definitions
Word Clustering and Word Selection Based Feature Reduction for MaxEnt Based Hindi NER

http://sourceforge.net/projects/arabicmwes/
Parsing Models for Identifying Multiword Expressions

http://sourceforge.net/projects/ghawwasv4/
Arib@QALB-2015 Shared Task: A Hybrid Cascade Model for Arabic Spelling Error Detection and Correction

http://sourceforge.net/projects/xaira
Layering and Merging Linguistic Annotations
Integrating Linguistic Resources: The American National Corpus Model

https://sourceforge.net/directory/os:windows/?q=Lucon
Tools for Building a Corpus to Study the Historical and Geographical Variation of the Romanian Language

http://sourceforge.net/projects/iso-graf/
Biber Redux: Reconsidering Dimensions of Variation in American English

https://sourceforge.net/projects/mmax2/
Design and Development of the MERLIN Learner Corpus Platform

http://opennlp.sourceforge.net/models-1.5
Named Entity Recognition in Broadcast News Using Similar Written Texts

http://biocreative.sourceforge.net/biocreative
Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing

http://bionlp.sourceforge.net/
Knowtator: A Protégé plug-in for annotated corpus construction

http://emu.sourceforge.net/manual/
Introducing a web application for labeling, visualizing speech and correcting derived speech signals

http://sourceforge.net/projects/carafe/
Improving Personal Name Search in the TIGR System

https://sourceforge.net/projects/
CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
Language Model Based Grammatical Error Correction without Annotated Training Data
A Large Parallel Corpus of Full-Text Scientific Articles
A Bayesian Belief Updating Model of Phonetic Recalibration and Selective Adaptation
Citius at SemEval-2017 Task 2: Cross-Lingual Similarity from Comparable Corpora and Dependency-Based Contexts
Corpus-Based Diacritic Restoration for South Slavic Languages
Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin
Evaluation of language identification methods using 285 languages
A Graph-based Text Similarity Measure That Employs Named Entity Information
The Influence of Spelling Errors on Content Scoring Performance
MaltDiver: A Transition-Based Parser Visualizer
Robust Training under Linguistic Adversity
Leveraging Pre-Trained Embeddings for Welsh Taggers

http://sourceforge.net/projects/lpsolve
Multilingual Dependency-based Syntactic and Semantic Parsing
A Cascaded Syntactic and Semantic Dependency Parsing System
Jointly Combining Implicit Constraints Improves Temporal Ordering
Efficient Parsing for Transducer Grammars
Plurality, Negation, and Quantification:Towards Comprehensive Quantifier Scope Disambiguation

http://imdbpy.sourceforge.net/
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access

http://sourceforge.net/p/jobimtext
Supervised All-Words Lexical Substitution using Delexicalized Features

http://swirl-parser.sourceforge.net
Learning to Rank Answers to Non-Factoid Questions from Web Collections

http://soaplab.sourceforge.net/soaplab2/
Using Language Resources in Humanities research

http://sourceforge.net/projects/regulus
Teaching Dialogue to Interdisciplinary Teams through Toolkits

http://tidy.sourceforge.net/
Building a Web Corpus of Czech
A Distributed Resource Repository for Cloud-Based Machine Translation
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
Less Destructive Cleaning of Web Documents by Using Standoff Annotation

http://wn-similarity.sourceforge.net/
Dimensionality Reduction for Text using Domain Knowledge
A Semantically Enhanced Approach to Determine Textual Similarity
Aligning Verb Senses in Two Italian Lexical Semantic Resources
Lexical Acquisition for Opinion Inference: A Sense-Level Lexicon of Benefactive and Malefactive Events
Automatic Classification of WordNet Morphosemantic Relations
Combining Orthogonal Monolingual and Multilingual Sources of Evidence for All Words WSD
On the Impact of Sentiment and Emotion Based Features in Detecting Online Sexual Predators
+/-EffectWordNet: Sense-level Lexicon Acquisition for Opinion Inference
Integrating Language and Vision to Generate Natural Language Descriptions of Videos in the Wild
Building a BioWordNet Using WordNet Data Structures and WordNet’s Software Infrastructure–A Failure Story
WordNet Based Features for Predicting Brain Activity associated with meanings of nouns
HIT-CIR: An Unsupervised WSD System Based on Domain Most Frequent Sense Estimation
Towards Cross-Language Word Sense Disambiguation for Quechua

http://kaldi.sourceforge
Revisiting Word Neighborhoods for Speech Recognition

https://mosesdecoder.svn.sourceforge
SampleRank Training for Phrase-Based Machine Translation

http://aspell.sourceforge.net/
Automated Proof Reading of Clinical Notes
Learning a Spelling Error Model from Search Query Logs
Development of Slovenian Broadcast News Speech Database

https://sourceforge.net/p/halef/openvxml
Leveraging Multimodal Dialog Technology for the Design of Automated and Interactive Student Agents for Teacher Training

http://sourceforge.net/projects/virastyar/
A Three-Layer Architecture for Automatic Post Editing System Using Rule-Based Paradigm

http://jena.sourceforge
An Evolving eScience Environment for Research Data in Linguistics

https://sourceforge.net/projects/ar-text-mining/files
LIM-LIG at SemEval-2017 Task1: Enhancing the Semantic Similarity for Arabic Sentences with Vectors Weighting

http://wordlist.sourceforge.net/12dicts-readme.html
Bootstrapping Named Entity Extraction for the Creation of Mobile Services

http://mumble.sourceforge.net
The Dynamics of Action Corrections in Situated Interaction

https://sourceforge.net/p/bitextor/wiki/Home/
Findings of the WMT 2016 Bilingual Document Alignment Shared Task

http://sourceforge.net/projects/gate
UA-ZSA: Web Page Clustering on the basis of Name Disambiguation

http://javaxdelta.sourceforge.net/
Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’s Edit History

http://foma.sourceforge.net
Developing an Open-Source FST Grammar for Verb Chain Transfer in a Spanish-Basque MT System
Revisiting Multi-Tape Automata for Semitic Morphological Analysis and Generation

http://inprotk.sourceforge.net
INPRO_iSS: A Component for Just-In-Time Incremental Speech Synthesis

http://lpsolve.sourceforge.net/
Hard Constraints for Grammatical Function Labelling
Towards Strict Sentence Intersection: Decoding and Evaluation Strategies
Grammatical Error Correction Using Integer Linear Programming
Joint Determination of Anaphoricity and Coreference Resolution using Integer Programming
A Constituent-Based Approach to Argument Labeling with Joint Inference in Discourse Parsing
A Minimum Error Weighting Combination Strategy for Chinese Semantic Role Labeling
Learning Logical Structures of Paragraphs in Legal Articles
Joint Inference for Bilingual Semantic Role Labeling
Text Summarization Model based on Redundancy-Constrained Knapsack Problem
Graph-Cut-Based Anaphoricity Determination for Coreference Resolution
Semi-Supervised Semantic Role Labeling via Structural Alignment
Phrase-based Compressive Cross-Language Summarization
Enforcing Transitivity in Coreference Resolution
Optimal and Syntactically-Informed Decoding for Monolingual Phrase-Based Alignment

http://neuroph.sourceforge.net
KnCe2013-CORE:Semantic Text Similarity by use of Knowledge Bases

http://champollion.sourceforge.net
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
Cooperation for Arabic Language Resources and Tools — The MEDAR Project
Mining a Comparable Text Corpus for a Vietnamese-French Statistical Machine Translation System
Champollion: A Robust Parallel Text Sentence Aligner
Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm
Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons

http://nltk.sourceforge.net/
Fully Unsupervised Graph-Based Discovery of General-Specific Noun Relationships from Web Corpora Frequency Counts
ProLiV - a Tool for Teaching by Viewing Computational Linguistics
Dynamic Path Prediction and Recommendation in a Museum Environment
Hands-On NLP for an Interdisciplinary Audience
Interactive Predictive Parsing
Statistical Confidence Measures for Probabilistic Parsing
Computational Semantics in the Natural Language Toolkit
NLTK: The Natural Language Toolkit
ASV Toolbox: a Modular Collection of Language Exploration Tools

http://emu.sourceforge.net/):
The MULI Project: Annotation and Analysis of Information Structure in German and English

http://hyperestraier.sourceforge.net/
Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering

http://kaldi.sourceforge.net/
A Machine Learning based Music Retrieval and Recommendation System
Trans-dimensional Random Fields for Language Modeling
Vector-space topic models for detecting Alzheimer’s disease
The WaveSurfer Automatic Speech Recognition Plugin
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish

http://aracomlex.sourceforge.net/
Arabic Spelling Correction using Supervised Learning
Arabic Word Generation and Modelling for Spell Checking

http://multivalent.sourceforge.net/
Annotated Web as corpus

http://riso.sourceforge.net/
J-NERD: Joint Named Entity Recognition and Disambiguation with Rich Linguistic Features

http://biocreative.sourceforge.net/
Evaluating the Impact of Alternative Dependency Graph Encodings on Solving Event Extraction Tasks
Identifying Interaction Sentences from Biological Literature Using Automatically Extracted Patterns
Exploring Two Biomedical Text Genres for Disease Recognition
Customizing an Information Extraction System to a New Domain
How Feasible and Robust is the Automatic Extraction of Gene Regulation Events? A Cross-Method Evaluation under Lab and Real-Life Conditions
Building a Named Entity Recognizer in Three Days: Application to Disease Name Recognition in Bulgarian Epicrises

http://smtmood.sourceforge.net
Mood at work: Ramses versus Pharaoh
MISTRAL: a Statistical Machine Translation Decoder for Speech Recognition Lattices
MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

http://trans.sourceforge.net/en/presentation.php
WOZ Acoustic Data Collection for Interactive TV
New functions for a multipurpose multimodal tool for phonetic and linguistic analysis of very large speech corpora
An Advanced Speech Corpus for Norwegian
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction

http://maxent.sourceforge.net
Web-based frequency dictionaries for medium density languages
Fine-Grained Classification of Named Entities by Fusing Multi-Features
Chinese Part-of-Speech Tagging: One-at-a-Time or All-at-Once? Word-Based or Character-Based?
There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
Automatic Part-of-Speech Tagging for Bengali: An Approach for Morphologically Rich Languages in a Poor Resource Scenario
Cross-lingual Adaptation as a Baseline: Adapting Maximum Entropy Models to Bulgarian
A Lucene and Maximum Entropy Model Based Hedge Detection System
Using a morphological analyzer in high precision POS tagging of Hungarian
Bringing Active Learning to Life
Iterative Scaling and Coordinate Descent Methods for Maximum Entropy
Chinese Word Segmentation Based on an Approach of Maximum Entropy Modeling
Semi-Supervised Maximum Entropy Based Approach to Acronym and Abbreviation Normalization in Medical Texts
Two-Phase LMR-RC Tagging for Chinese Word Segmentation
Named Entity Recognition: A Maximum Entropy Approach Using Global Information
Evaluating the Impact of Coder Errors on Active Learning
Creating a Test Corpus of Clinical Notes Manually Tagged for Part-of-Speech Information
A Hybrid Framework for Scalable Opinion Mining in Social Media: Detecting Polarities and Attitude Targets
Assessing the Challenge of Fine-Grained Named Entity Recognition and Classification
Closing the Gap: Learning-Based Information Extraction Rivaling Knowledge-Engineering Methods
Multimodality and Dialogue Act Classification in the RoboHelper Project

http://jocr.sourceforge.net/
Construction and Annotation of a French Folkstale Corpus

http://web-harvest.sourceforge.net
Shedding (a Thousand Points of) Light on Biased Language

http://sourceforge.net/projects/opennlp/
QuickView: NLP-based Tweet Search
The Integration of Syntactic Parsing and Semantic Role Labeling
Recognizing Named Entities in Tweets

http://wordfreak.sourceforge.net
Can You Tag the Modal? You Should.
Chooser: a Multi-Task Annotation Tool
Knowtator: A Protégé plug-in for annotated corpus construction
Efficient Annotation with the Jena ANnotation Environment (JANE)
WordFreak: An Open Tool for Linguistic Annotation
Text Annotation with OpenNLP and UIMA
C-3: Coherence and Coreference Corpus

http://www.sourceforge.net/projects/libassp
Evaluation of Automatic Formant Trackers

http://sourceforge.net/projects/xml-copy-editor/
A Repository of Free Lexical Resources for African Languages: The Project and the Method

http://lsids.sourceforge.net/
Maskkot — An Entity-centric Annotation Platform

http://frenchmozilla.sourceforge.net/
A Framework for Data Management for the Online Volunteer Translators’ Aid System QRLex

http://nltk.sourceforge.net
TextTree Construction for Parser and Treebank Development
GLEU: Automatic Evaluation of Sentence-Level Fluency
Teaching Dialogue to Interdisciplinary Teams through Toolkits
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Automatic Acquisition of Semantics-Extraction Patterns
Automatic Acquisition of Training Data for Statistical Parsers

http://apps.sourceforge.net/
Indian Language Screen Readers and Syllable Based Festival Text-to-Speech Synthesis System

http://nltk.sourceforge.org
Teaching Applied Natural Language Processing: Triumphs and Tribulations

http://amtoolbox.sourceforge.net/pitu/
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies

http://sourceforge.net/projects/inprotk/
Interpreting Situated Dialogue Utterances: an Update Model that Uses Speech, Gaze, and Gesture Information

http://sourceforge.net/projects/trove4j
Top Accuracy and Fast Dependency Parsing is not a Contradiction

http://openmwe.sourceforge.jp/
Construction of an Idiom Corpus and its Application to Idiom Identification based on WSD Incorporating Idiom-Specific Features

http://banner.sourceforge.net/
CBR-Tagger: a case-based reasoning approach to the gene/protein mention problem
Evaluating the Use of Empirically Constructed Lexical Resources for Named Entity Recognition
Disease Mention Recognition with Specific Features

http://foma.sourceforge.net/dokuwiki/doku.php
Finite State Morphology Tool for Latvian

http://sourceforge.net/projects/arabic-spell/files/arabic-spell
Arabic Word Generation and Modelling for Spell Checking

http://jvnsegmenter.sourceforge.net/
Building and Annotating the Linguistically Diverse NTU-MC (NTU-Multilingual Corpus)

https://sourceforge.net/projects/termevaluator/
Local-Global Vectors to Improve Unigram Terminology Extraction

http://tadm.sourceforge.net/
Unsupervised Parse Selection for HPSG
A Grammar Sparrer for Norwegian
Discriminant Ranking for Efficient Treebanking
Feature Selection for Fluency Ranking
Structural Correspondence Learning for Parse Disambiguation
Adaptability of Lexical Acquisition for Large-scale Grammars
Resolving Speculation: MaxEnt Cue Classification and Dependency-Based Scope Rules
Recognizing Textual Relatedness with Predicate-Argument Structures
Domain Adaptation of Maximum Entropy Language Models

http://corpussearch.sourceforge.net
A Treebank Query System Based on an Extracted Tree Grammar
The MULINCO corpus and corpus platform
A TAG-derived Database for Treebank Search and Parser Analysis

http://crfpp.sourceforge.net/index.html
CYUT-III System at Chinese Grammatical Error Diagnosis Task
Chinese Grammatical Error Diagnosis by Conditional Random Fields

http://sourceforge.net/projects/ebmsumcorpus
Appraising UMLS Coverage for Summarizing Medical Evidence

http://sourceforge.net/p/kaldi/code/
Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices

http://sourceforge.net/projects/reck/files/reck_v1.0.0.tar.gz/download
Relational Structures and Models for Coreference Resolution

http://matplotlib.sourceforge
Multidisciplinary Instruction with the Natural Language Toolkit

http://sourceforge.net/
Reproducible Results in Parsing-Based Machine Translation: The JHU Shared Task Submission
Checking a structured pathology report for completeness of content using terminological knowledge
Jointly or Separately: Which is Better for Parsing Heterogeneous Dependencies?
Machine Translation Experiments on PADIC: A Parallel Arabic DIalect Corpus
DirRelCond3: Detecting Textual Entailment Across Languages With Conditions On Directional Text Relatedness Scores
Corpus-amostra Coelho Netto: compilação, anotação e ocorrências em textos literários dos séc. XIX e XX (Corpus-sample Coelho Netto: Compilation, Annotation and Occurrences in Literary Texts from XIX and XX Centuries) [in Portuguese]
A Statistical Semantic Parser that Integrates Syntax and Semantics
Arabic Spelling Correction using Supervised Learning
Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida
Learning for Semantic Parsing with Statistical Machine Translation
Incremental Generation of Visually Grounded Language in Situated Dialogue (demonstration system)

http://qallme.sourceforge.net/
QALL-ME needs AIR: a portability study

http://knowtator.sourceforge.net/
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
Overview of the CL-SciSumm 2016 Shared Task
Integrated Tools for Query-driven Development of Light-weight Ontologies and Information Extraction Components
Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project

http://pmd.sourceforge.net
The Excitement Open Platform for Textual Inferences

http://zql.sourceforge.net
Summarizing Source Code using a Neural Attention Model

http://sourceforge.net/projects/uplug/
Aligning Bilingual Literary Works: a Pilot Study
ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts alignment of parallel texts

http://knowtator.sourceforge.net
SPIED: Stanford Pattern based Information Extraction and Diagnostics
Towards a Methodology for Named Entities Annotation

http://parcels.sourceforge.net
Annotated Web as corpus

http://sourceforge.net/projects/gibbslda/
Modeling Term Translation for Document-informed Machine Translation

http://dome.sourceforge.net/
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool

http://corpussearch.sourceforge.net/
Converting SynTagRus Dependency Treebank into Penn Treebank Style

http://flexcrfs.sourceforge
Pseudo-Word for Phrase-Based Machine Translation

http://lists.sourceforge.net/
NLTK: The Natural Language Toolkit

http://secondstring.sourceforge.net/
Indexing Methods for Faster and More Effective Person Name Search
AquaLog: An ontology-driven Question Answering System to interface the Semantic Web
A Hybrid Disambiguation Measure for Inaccurate Cultural Heritage Data
Linking Databases using Matched Arabic Names
International Journal of Computational Linguistics & Chinese Language Processing, Volume 19, Number 1, March 2014

http://wnpojo.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://ayaspell.sourceforge.net
TECHLIMED system description for the Shared Task on Automatic Arabic Error Correction
SAHSOH@QALB-2015 Shared Task: A Rule-Based Correction Method of Common Arabic Native and Non-Native Speakers’ Errors
TECHLIMED@QALB-Shared Task 2015: a hybrid Arabic Error Correction System

http://wnwa.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://sourceforge.net/projects/joshua/files
Faster and Smaller N-Gram Language Models

http://maxent.sourceforge.net/
An Exploration of Features for Recognizing Word Emotion
The SoNLP-DP System in the CoNLL-2015 shared Task
Hedge Classification with Syntactic Dependency Features Based on an Ensemble Classifier
Cascaded Classifiers for Confidence-Based Chemical Named Entity Recognition
A Constituent-Based Approach to Argument Labeling with Joint Inference in Discourse Parsing
SRCB-WSD: Supervised Chinese Word Sense Disambiguation with Key Features
Automatic Annotation of Word Emotion in Sentences Based on Ren-CECps
Error Detection for Treebank Validation
Finding Appropriate Subset of Votes Per Classifier Using Multiobjective Optimization: Application to Named Entity Recognition
Projection-based Acquisition of a Temporal Labeller
Reinforcing English Countability Prediction with One Countability per Discourse Property
Joint Arabic Segmentation and Part-Of-Speech Tagging
Hedge Classification in Biomedical Texts with a Weakly Supervised Selection of Keywords
Semantic Role Labeling of NomBank: A Maximum Entropy Approach
High Throughput Modularized NLP System for Clinical Text
A Novel Reordering Model Based on Multi-layer Phrase for Statistical Machine Translation
Data-driven Approaches for Information Structure Identification
Meeting TempEval-2: Shallow Approach for Temporal Tagger
Using Variance as a Stopping Criterion for Active Learning of Frame Assignment
Combining Deep and Shallow Approaches in Parsing German
It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text
Parsing Syntactic and Semantic Dependencies for Multiple Languages with A Pipeline Approach
PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation
Feature Subset Selection Using Genetic Algorithm for Named Entity Recognition
Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali
Representing words as regions in vector space
Supporting inferences in semantic space: representing words as regions
Two Methods to Incorporate ’Local Morphosyntactic’ Features in Hindi Dependency Parsing
Importance of Linguistic Constraints in Statistical Dependency Parsing
A Maximum Entropy Approach to Chinese Word Segmentation
Graded Word Sense Assignment
NLTK: The Natural Language Toolkit
Tagging and Linking Web Forum Posts
Semi-supervised Semantic Role Labeling Using the Latent Words Language Model
SoNLP-DP System for ConLL-2016 English Shallow Discourse Parsing
Enhancing Performance of Lexicalised Grammars
A Puristic Approach for Joint Dependency Parsing and Semantic Role Labeling

http://infomap-nlp.sourceforge.net/
Towards a Unified Approach for Opinion Question Answering and Summarization
A Language Independent Approach for Name Categorization and Discrimination
Combining Different Features of Idiomaticity for the Automatic Classification of Noun+Verb Expressions in Basque
ISA meets Lara: An incremental word space model for cognitively plausible simulations of semantic learning
Latent Semantic Analysis Models on Wikipedia and TASA
Text-to-Text Semantic Similarity for Automatic Short Answer Grading
Learning Sentiment Lexicons in Spanish
Semantic Density Analysis: Comparing Word Meaning across Time and Phonetic Space
Methods to Integrate a Language Model with Semantic Information for a Word Prediction Component

http://sourceforge.net/projects/ar-text-mining/files/Arabic-
Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping

http://opennlp.sourceforge.net/about.html
Exploring Co-Reference Chains for Concept Annotation of Domain Texts

http://max-mstparser.sourceforge.net/
Dependency Parser for Chinese Constituent Parsing

http://multiword.sourceforge.net/PHITE.php?sitesig=
Multiword Expressions Dataset for Indian Languages

http://sourceforge.net/apps/mediawiki/jwordnet/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation
Extending a wordnet framework for simplicity and scalability

http://en.sourceforge.jp/projects/ipadic/
Generating “A for Alpha” When There Are Thousands of Characters

http://snopy.sourceforge.net/user-guide.html
TTT: A Tree Transduction Language for Syntactic and Semantic Processing

http://wntw.sourceforge.net
The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database

http://sourceforge.net/projects/mecab/files/mecab-
Extracting Pre-ordering Rules from Predicate-Argument Structures

http://sourceforge.net/projects/touchgraph/
Keyphrase-Driven Document Visualization Tool

http://multiword.sourceforge.net/sharedtask2017
The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions

http://sourceforge.net/projects/macvissta/
An Exchange Format for Multimodal Annotations

http://sourceforge.net/p/lemur/wiki/RankLib/;
Towards Constructing Sports News from Live Text Commentary

http://cmusphinx.sourceforge.net
New Dimensions in Testimony Demonstration
SPPAS: a tool for the phonetic segmentation of speech
A Web Application for Automated Dialect Analysis
Evaluating Spoken Dialogue Processing for Time-Offset Interaction
OpenDial: A Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
Exploiting the Human Computational Effort Dedicated to Message Reply Formatting for Training Discursive Email Segmenters
A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing

http://sourceforge.net/project/showfiles.php?group
Three BioNLP Tools Powered by a Biological Lexicon
How to Make the Most of NE Dictionaries in Statistical NER

http://nlpfarm.sourceforge.net
Open Resources for Language Technology

http://transag.sourceforge.net
Future Directions in Technological Support for Language Documentation

http://senseclusters.sourceforge.net
Unsupervised Discrimination and Labeling of Ambiguous Names
Automatic Cluster Stopping with Criterion Functions and the Gap Statistic
SenseClusters - Finding Clusters that Represent Word Senses
Word Sense Discrimination by Clustering Contexts in Vector and Similarity Spaces
Improving Name Discrimination: A Language Salad Approach
Selecting the “Right” Number of Senses Based on Clustering Criterion Functions
Duluth: Word Sense Discrimination in the Service of Lexicography
Duluth-WSI: SenseClusters Applied to the Sense Induction Task of SemEval-2
SemEval-2015 Task 15: A CPA dictionary-entry-building task
Cluster Stopping Rules for Word Sense Discrimination
Duluth : Word Sense Induction Applied to Web Page Clustering
UMND2 : SenseClusters Applied to the Sense Induction Task of Senseval-4
SenseClusters: Unsupervised Clustering and Labeling of Similar Contexts
Whats in a Name? The Automatic Recognition of Metonymical Location Names

http://sourceforge.net/projects/medslt/
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Japanese Speech Understanding using Grammar Specialization
MedSLT: A Limited-Domain Unidirectional Grammar-Based Medical Speech Translator
Evaluating Task Performance for a Unidirectional Controlled Language Medical Speech Translation System

http://jwordnet.sourceforge.net/
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
Multilingual, Efficient and Easy NLP Processing with IXA Pipeline
It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text
Building a BioWordNet Using WordNet Data Structures and WordNet’s Software Infrastructure–A Failure Story

http://sourceforge.net/projects/joshua/files/joshua/1.3/joshua-
Reproducible Results in Parsing-Based Machine Translation: The JHU Shared Task Submission

http://startdict.sourceforge.net/Dictionaries_zh_TW.php
英文技術文獻中一般動詞與其受詞之中文翻譯的語境效用 (Collocational Influences on the Chinese Translations of Non-Technical English Verbs and Their Objects in Technical Documents) [In Chinese]
應用平行語料建構中文斷詞組件 (Applications of Parallel Corpora for Chinese Segmentation) [In Chinese]

http://sourceforge.net/projects/runes
Saxon: an Extensible Multimedia Annotator

http://opennlp.sourceforge.net/models-
ASAP: Automatic Semantic Alignment for Phrases

http://ghalatawi.sourceforge.net/
Arib@QALB-2015 Shared Task: A Hybrid Cascade Model for Arabic Spelling Error Detection and Correction

http://opensmile.sourceforge.net/
An Unsupervised Dynamic Bayesian Network Approach to Measuring Speech Style Accommodation

http://multiword.sourceforge.net/download/MWE2008-
E-Connecting Balkan Languages

http://sourceforge.net/projects/ngram/
The Ngram Statistics Package (Text::NSP) : A Flexible Tool for Identifying Ngrams, Collocations, and Word Associations
A Case Study on Interoperability for Language Resources and Applications

http://sourceforge.net/projects/hecl
Interoperability and Knowledge Representation in Distributed Health and Fitness Companion Dialogue System

http://sourceforge.jp/projects/sfnet_crfpp/
An Introduction to BLCU Personal Attributes Extraction System

http://cmusphinx.sourceforge.net/html/cmusphinx.php
A Case-Based Reasoning Approach for Speech Corpus Generation
First Broadcast News Transcription System for Khmer Language
Automatic Scoring of Children’s Read-Aloud Text Passages and Word Lists
Grounded Language Modeling for Automatic Speech Recognition of Sports Video

http://quex.sourceforge.net/
E-magyar – A Digital Language Processing System

https://sourceforge.net/projects/jclal/
TextPro-AL: An Active Learning Platform for Flexible and Efficient Production of Training Data for NLP Tasks

http://sourceforge.net/projects/germandict/
Disambiguation of Period Characters in Clinical Narratives
Unsupervised Abbreviation Detection in Clinical Narratives

http://jung.sourceforge.net
Scientific Authoring Support: A Tool to Navigate in Typed Citation Graphs
V3: Unsupervised Generation of Domain Aspect Terms for Aspect Based Sentiment Analysis

https://sourceforge.net/projects/ar-text-mining/
Semantic Similarity of Arabic Sentences with Word Embeddings

http://audacity.sourceforge.net
A Speech Corpus for Dyslexic Reading Training

http://sourceforge.net/projects/elixir-fm
Dependency Parsing of Modern Standard Arabic with Lexical and Inflectional Features

http://libots.sourceforge.net
A Personal Storytelling about Your Favorite Data
Collaborative Summarization: When Collaborative Filtering Meets Document Summarization

http://gposttl.sourceforge.net/
A Statistical Test for Grammar

http://sourceforge.jp/projects/ipadic/docs/
Semantic Annotation of Japanese Functional Expressions and its Impact on Factuality Analysis

http://tashaphyne.sourceforge.net
Enhancing Root Extractors Using Light Stemmers

https://sourceforge.net/projects/medslt/
A Knowledge-Modeling Approach for Multilingual Regulus Lexica
Grammar Sharing Techniques for Rule-based Multilingual NLP Systems

http://pybsddb.sourceforge.net/bsddb3.html
A Breadth-First Representation for Tree Matching in Large Scale Forest-Based Translation

http://knowtator.sourceforge.net/index.shtml
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora
CAT: the CELCT Annotation Tool

http://sourceforge.net/projects/aligner
Improving English-Russian sentence alignment through POS tagging and Damerau-Levenshtein distance

http://sourceforge.jp/projects/naist-jdic
Splitting Noun Compounds via Monolingual and Bilingual Paraphrasing: A Study on Japanese Katakana Words

http://defoe.sourceforge
TransRead: Designing a Bilingual Reading Experience with Machine Translation Technologies

http://sourceforge.net/projects/mecab/files/
A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation
A Binarized Neural Network Joint Model for Machine Translation
Learning Word Reorderings for Hierarchical Phrase-based Statistical Machine Translation
Learning Hierarchical Translation Spans
Improving Neural Machine Translation through Phrase-based Forced Decoding
Extracting Pre-ordering Rules from Predicate-Argument Structures

http://mstparser.sourceforge.net
Testing the Effect of Morphological Disambiguation in Dependency Parsing of Basque
Integrating Graph-Based and Transition-Based Dependency Parsers
Utilizing Dependency Language Models for Graph-based Dependency Parsing Models
Improving Graph-based Dependency Parsing with Decision History
On WordNet Semantic Classes and Dependency Parsing
Multilingual Dependency Learning: Exploiting Rich Features for Tagging Syntactic and Semantic Dependencies
Semantic Dependency Parsing of NomBank and PropBank: An Efficient Integrated Approach via a Large-scale Feature Selection
Combining Rule-Based and Statistical Syntactic Analyzers
Analyzing and Integrating Dependency Parsers
Benchmarking of Statistical Dependency Parsers for French
Improving Dependency Parsing with Subtrees from Auto-Parsed Data
Evaluation of Dependency Parsers on Unbounded Dependencies
Characterizing the Errors of Data-Driven Dependency Parsing Models

http://bionlp.sourceforge.net/Knowtator/index.shtml
Annotating Arguments: The NOMAD Collaborative Annotation Tool
The SYNC3 Collaborative Annotation Tool
BOEMIE Ontology-Based Text Annotation Tool

http://sourceforge.net/projects/jazzy/
TransChat: Cross-Lingual Instant Messaging for Indian Languages

http://poliqarp.sourceforge.net/
Poliqarp: An open source corpus indexer and search engine with syntactic extensions

http://trans.sourceforge.net/
Modeling Collaborative Multimodal Behavior in Group Dialogues: The MULTISIMO Corpus
Euronews: a multilingual speech corpus for ASR
The SAMMIE Multimodal Dialogue Corpus Meets the Nite XML Toolkit
The Sweet-Home speech and multimodal corpus for home automation interaction
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
Aix Map Task corpus: The French multimodal corpus of task-oriented dialogue
JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese
A Real-life, French-accented Corpus of Air Traffic Control Communications
The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
HESITA(te) in Portuguese

http://sourceforge.net/projects/wikiprep
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains

http://jtidy.sourceforge.net/
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles
Kitten: a tool for normalizing HTML and extracting its textual content

http://sourceforge.net/projects/corpussearch/
Surprisal and Satisfaction: Towards an Information-theoretic Characterization of Presuppositions with a Diachronic Application

http://htmlparser.sourceforge.net/
Evaluating automatically generated user-focused multi-document summaries for geo-referenced images
Enhanced Personalized Search using Social Data

http://sourceforge.jp/projects/
Item Development and Scoring for Japanese Oral Proficiency Testing
Evaluating Features for Identifying Japanese-Chinese Bilingual Synonymous Technical Terms from Patent Families
Building and Annotating the Linguistically Diverse NTU-MC (NTU-Multilingual Corpus)

http://sourceforge.net/projects/arabic-named-entity-corpora/
A Hybrid Approach to Features Representation for Fine-grained Arabic Named Entity Recognition

http://sox.sourceforge.net
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
The COST278 Pan-European Broadcast News Database
Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora

http://Infomap-nlp.sourceforge.net/
Learning to Grade Short Answer Questions using Semantic Similarity Measures and Dependency Graph Alignments

http://multiword.sourceforge.net/PHITE.php?
A Generic Framework for Multiword Expressions Treatment: from Acquisition to Applications

http://mmax2.sourceforge.net/
CorA: A web-based annotation tool for historical and other non-standard language data
Annotating modals with GraphAnno, a configurable lightweight tool for multi-level annotation
Annotating Discourse Anaphora
Usability Recommendations for Annotation Tools
Annotating Attribution Relations: Towards an Italian Discourse Treebank
Modality in Text: a Proposal for Corpus Annotation
PotTS: The Potsdam Twitter Sentiment Corpus
Sentence and Expression Level Annotation of Opinions in User-Generated Discourse
Towards a Unified Approach to Modality Annotation in Portuguese
CAT: the CELCT Annotation Tool

https://sourceforge.net/projects/mecab/
Large-Scale Categorization of Japanese Product Titles Using Neural Attention Models
Web-Scale Language-Independent Cataloging of Noisy Product Listings for E-Commerce

http://dylan.sourceforge.net
Training an adaptive dialogue policy for interactive learning of visually grounded word meanings
Interactively Learning Visually Grounded Word Meanings from a Human Tutor

http://sourceforge.net/projects/winglpk/
Extractive Multi-Document Summarization with Integer Linear Programming and Support Vector Regression

http://sourceforge.net/projects/desr
Learning to Rank Answers on Large Online QA Collections
Learning to Rank Answers to Non-Factoid Questions from Web Collections

http://sourceforge.net/projects/maxparser/
Fourth-Order Dependency Parsing
Dependency Parser Adaptation with Subtrees from Auto-Parsed Target Domain Data
High-order Graph-based Neural Dependency Parsing

http://nltk.sourceforge.net/index.php/Book
ProPOSEL: a human-oriented prosody and PoS English lexicon for machine-learning and NLP
ProPOSEL: A Prosody and POS English Lexicon for Language Engineering

http://pywikipediabot.sourceforge.net/
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary

http://sourceforge.net/projects/ontocreation/
Lexico-Syntactic Patterns for Automatic Ontology Building
Pattern-Based Ontology Construction from Selected Wikipedia Pages

http://strigi.sourceforge.net/
Integration of a Multilingual Keyword Extractor in a Document Management System

http://sourceforge.net/projects/jatlas/
An Exchange Format for Multimodal Annotations

http://mecab.sourceforge.net/
Sorting Texts by Readability
Japanese Pronunciation Prediction as Phrasal Statistical Machine Translation
Unsupervised Japanese-Chinese Opinion Word Translation using Dependency Distance and Feature-Opinion Association Weight
Japanese Predicate Argument Structure Analysis Exploiting Argument Position and Type
MSS: Investigating the Effectiveness of Domain Combinations and Topic Features for Word Sense Disambiguation
Zero Pronoun Resolution can Improve the Quality of J-E Translation
Using Collocations and K-means Clustering to Improve the N-pos Model for Japanese IME
Detecting Cyberbullying Entries on Informal School Websites Based on Category Relevance Maximization
Translation of Patent Sentences with a Large Vocabulary of Technical Terms Using Neural Machine Translation
Robust Extraction of Named Entity Including Unfamiliar Word
Co-STAR: A Co-training Style Algorithm for Hyponymy Relation Acquisition from Structured and Unstructured Text
Advanced Corpus Solutions for Humanities Researchers
Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia
Post-ordering by Parsing for Japanese-English Statistical Machine Translation
Textual Entailmaint Recognition using Word Overlap, Mutual Information and Subpath Set
Distortion Model Considering Rich Context for Statistical Machine Translation
Overview of the 3rd Workshop on Asian Translation
Measuring Interlanguage: Native Language Identification with L1-influence Metrics
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
A Compositional Approach toward Dynamic Phrasal Thesaurus
Automatic Labeling of Voiced Consonants for Morphological Analysis of Modern Japanese Literature
Unsupervised Approach for Dialogue Act Classification
Overview of the 1st Workshop on Asian Translation
Overview of the 4th Workshop on Asian Translation
Automatic Paraphrasing of Japanese Functional Expressions Using a Hierarchically Organized Dictionary
A Pregroup Analysis of Japanese Causatives
Divide and Translate: Improving Long Distance Reordering in Statistical Machine Translation
An Abstract Generation System for Social Scientific Papers
A Hybrid Approach for Functional Expression Identification in a Japanese Reading Assistant
Morphological Analysis for Japanese Noisy Text based on Character-level and Word-level Normalization
Multi-label Text Categorization with Model Combination based on F1-score Maximization
A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields
A Named Entity Recognition Method using Rules Acquired from Unlabeled Data
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
Computing Paraphrasability of Syntactic Variants Using Web Snippets
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units
Developing Japanese WordNet Affect for Analyzing Emotions
Evaluating Features for Identifying Japanese-Chinese Bilingual Synonymous Technical Terms from Patent Families
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
Mining Revision Log of Language Learning SNS for Automated Japanese Error Correction of Second Language Learners
Detection of Users Suspected of Pretending to Be Other Users in a Community Site by Using Messages Submitted to Non-Target Categories
Language Homogeneity in the Japanese Wikipedia
Returning-Home Analysis in Tokyo Metropolitan Area at the time of the Great East Japan Earthquake using Twitter Data
Enlarging Paraphrase Collections through Generalization and Instantiation
Multi-aspects Review Summarization Based on Identification of Important Opinions and their Similarity
Enriching Dictionaries with Images from the Internet - Targeting Wikipedia and a Japanese Semantic Lexicon: Lexeed -
Applying Sentiment-oriented Sentence Filtering to Multilingual Review Classification
Detection of submitters suspected of pretending to be someone else in a community site
Overview of the 2nd Workshop on Asian Translation
Overview of the 5th Workshop on Asian Translation
Proceedings of the 1st Workshop on Asian Translation (WAT2014)

http://pinyin4j.sourceforge.net/
Professor or Screaming Beast? Detecting Anomalous Words in Chinese
Detecting Word Misuse in Chinese

http://sourceforge.net/projects/jgibblda/
Word Sense Induction Using Lexical Chain based Hypergraph Model

http://sourceforge.net/projects/jwordsplitter/
The University of Illinois submission to the WMT 2015 Shared Translation Task

http://tadm.sourceforge.net
Exploiting Semantic Information for HPSG Parse Selection

http://trans.sourceforge.net
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
The REPERE Corpus : a multimodal corpus for person recognition
Evaluation of multimodal components within CHIL: The evaluation packages and results
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
Standoff Coordination for Multi-Tool Annotation in a Dialogue Corpus
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Corpus Analysis of Spoken Smart-Home Interactions with Older Users
Advances in the Witchcraft Workbench Project

http://sourceforge.net/projects/jazzy
The Influence of Spelling Errors on Content Scoring Performance

http://banner.sourceforge.net
Simple and Effective Semi-Supervised Question Answering
Towards Event Extraction from Full Texts on Infectious Diseases
Leveraging coreference to identify arms in medical abstracts: An experimental study

http://opennlp.sourceforge.net/models-1.5/
Named Entity Recognition in Swedish Health Records with Character-Based Deep Bidirectional LSTMs
Evaluating the Impact of Phrase Recognition on Concept Tagging
IITP: Supervised Machine Learning for Aspect based Sentiment Analysis

http://opennlp.sourceforge.net
A Shortest Path Dependency Kernel for Relation Extraction
Evaluation of Unsupervised Information Extraction
Open Resources for Language Technology
Can Click Patterns across User’s Query Logs Predict Answers to Definition Questions?
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain
Solving Relational Similarity Problems Using the Web as a Corpus
Acquisition of Named-Entity-Related Relations for Searching
Edlin: an Easy to Read Linear Learning Framework
Shedding (a Thousand Points of) Light on Biased Language
Combining Relational and Attributional Similarity for Semantic Relation Classification
Integrating Co-occurrence Statistics with Information Extraction for Robust Retrieval of Protein Interactions from Medline
RePort - Um Sistema de Extração de Informações Aberta para Língua Portuguesa (Report - An Open Information Extraction System for Portuguese Language)
Text Annotation with OpenNLP and UIMA
KUL: Recognition and Normalization of Temporal Expressions
Cross-lingual Adaptation as a Baseline: Adapting Maximum Entropy Models to Bulgarian
Learning to Extract Relations from the Web using Minimal Supervision
A Lucene and Maximum Entropy Model Based Hedge Detection System
Bootstrapping Biomedical Ontologies for Scientific Text using NELL
EM Works for Pronoun Anaphora Resolution
The Tradeoffs Between Open and Traditional Relation Extraction
Extracting STRIPS Representations of Actions and Events
Building a Textual Entailment Suite for the Evaluation of Automatic Content Scoring Technologies
NUS at the HOO 2011 Pilot Shared Task
A Beam-Search Decoder for Grammatical Error Correction
Evaluating Roget‘s Thesauri
ASAP-II: From the Alignment of Phrases to Textual Similarity
Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates
ASAP: Automatic Semantic Alignment for Phrases
Towards Creation of a Corpus for Argumentation Mining the Biomedical Genetics Research Literature
A Sequencing Model for Situation Entity Classification
Recognising Entailment within Discourse
Assessing the Role of Discourse References in Entailment Inference
ThaiHerbMiner: A Thai Herbal Medicine Mining and Visualizing Tool
Source-Language Entailment Modeling for Translating Unknown Terms
Machine Reading Between the Lines: A Simple Evaluation Framework for Extracted Knowledge Bases
Cross-Caption Coreference Resolution for Automatic Image Understanding
A Unified Kernel Approach for Learning Typed Sentence Rewritings
Scoring Spoken Responses Based on Content Accuracy
Automating Model Building in c-rater
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence
UCB: System Description for SemEval Task #4
Semantic Role Labeling of Implicit Arguments for Nominal Predicates
Unsupervised Learning of Narrative Event Chains
Generating Constituent Order in German Clauses
Developing Robust Models for Favourability Analysis
DERE: A Task and Domain-Independent Slot Filling Framework for Declarative Relation Extraction
Timed Annotations — Enhancing MUC7 Metadata by the Time It Takes to Annotate Named Entities
Discovering Relations between Noun Categories
A Survey of Text Mining Architectures and the UIMA Standard
HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions
Tunable Domain-Independent Event Extraction in the MIRA Framework

http://ghalatawi.sourceforge.net
Autocorrection of arabic common errors for large text corpus
SAHSOH@QALB-2015 Shared Task: A Rule-Based Correction Method of Common Arabic Native and Non-Native Speakers’ Errors

http://multiword.sourceforge.net/mwe2013
Proceedings of the 9th Workshop on Multiword Expressions

https://audacity.sourceforge.net
Semi-automatic annotation of the UCU accents speech corpus

http://weka.sourceforge.net/doc.dev/
Bilingual Chronological Classification of Hafez’s Poems

http://wordlist.sourceforge.net/
The Design of a Proofreading Software Service
Improving Web 2.0 Opinion Mining Systems Using Text Normalisation Techniques
Combining Lexical Resources for Contextual Synonym Expansion
A Figure of Merit for the Evaluation of Web-Corpus Randomness
BRAINSUP: Brainstorming Support for Creative Sentence Generation

http://jung.sourceforge
ExB Text Summarizer

http://tashaphyne.sourceforge.net/
Exploring the Effects of Word Roots for Arabic Sentiment Analysis

http://ngramj.sourceforge.net/
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
Integration of a Multilingual Keyword Extractor in a Document Management System

http://jazzy.sourceforge.net/
SWATCS65: Sentiment Classification Using an Ensemble of Class Projects
System Combination for Grammatical Error Correction
Measuring Contextual Fitness Using Error Contexts Extracted from the Wikipedia Revision History
Helping Our Own 2011: UKP Lab System Description
SWATAC: A Sentiment Analyzer using One-Vs-Rest Logistic Regression
TeamX: A Sentiment Analyzer with Enhanced Lexicon Mapping and Weighting Scheme for Unbalanced Data
SWASH: A Naive Bayes Classifier for Tweet Sentiment Identification
Joint English Spelling Error Correction and POS Tagging for Language Learners Writing
A Broad-Coverage Normalization System for Social Media Language
The UI System in the HOO 2012 Shared Task on Error Correction
Insertion, Deletion, or Substitution? Normalizing Text Messages without Pre-categorization nor Supervision

http://sourceforge.net/projects/miacp/
Pattern Dictionary of English Prepositions

http://jgap.sourceforge.net/
Is it Worth Submitting this Run? Assess your RTE System with a Good Sparring Partner

http://sourceforge.net/projects/zgen/
An Empirical Comparison Between N-gram and Syntactic Language Models for Word Ordering

http://opennlp.sourceforge.net/projects.html
Automatic Approaches for Gene-Drug Interaction Extraction from Biomedical Text: Corpus and Comparative Evaluation
Exploiting Category-Specific Information for Multi-Document Summarization
Target-dependent Twitter Sentiment Classification
Extending an interoperable platform to facilitate the creation of multilingual and multimodal NLP applications

https://sourceforge.net/projects/febrl/
A Graph-based Approach for Contextual Text Normalization

http://sourceforge.net/projects/procep/
Towards a Large Parallel Corpus of Cleft Constructions

http://pdftohtml.sourceforge.net/
Understanding Tables in Context Using Standard NLP Toolkits
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building

https://sourceforge.net/p/ccg2ptb/home/Home/
A Machine Learning Approach to Convert CCGbank to Penn Treebank

http://sourceforge.net/projects/dvbtools/
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development

http://snowball.sourceforge.net/
Evaluation of a Vector Space Similarity Measure in a Multilingual Framework

http://julius.sourceforge.jp/en_index.php
Developing further speech recognition resources for Welsh

http://jena.sourceforge.net
A User and NLP-Assisted Strategic Workflow for a Social Semantic OWL 2-Based Knowledge Platform
Towards Metadata Interoperability

http://taga.sourceforge.net/
Development and Alignment of a Domain-Specific Ontology for Question Answering

https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-eval-translator/
Rule-based Machine Translation between Indonesian and Malaysian

http://oprofile.sourceforge.net/
Mixture Pruning and Roughening for Scalable Acoustic Models

http://gargantua.sourceforge.net
Structural and Topical Dimensions in Multi-Task Patent Translation

http://mmax2.sourceforge.net
Agile Corpus Annotation in Practice: An Overview of Manual and Automatic Annotation of CVs
Knowledge-lean projection of coreference chains across languages
C-3: Coherence and Coreference Corpus
Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation
Connective-based Local Coherence Analysis: A Lexicon for Recognizing Causal Relationships
Three Knowledge-Free Methods for Automatic Lexical Chain Extraction
Towards a Methodology for Named Entities Annotation
Potsdam Commentary Corpus 2.0: Annotation for Discourse Research

http://htmlunit.sourceforge.net/
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
VSoLSCSum: Building a Vietnamese Sentence-Comment Dataset for Social Context Summarization
WikiKreator: Improving Wikipedia Stubs Automatically

http://agtk.sourceforge.net
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
Building and Using a Corpus of Shallow Dialogue Annotated Meetings
Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction
Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
Functional Requirements for an Interlinear Text Editor

http://sourceforge.net/projects/giza/
OWNS: Cross-lingual Word Sense Disambiguation Using Weighted Overlap Counts and Wordnet Based Similarity Measures

http://sourceforge.jp/projects/comedic/
Divide and Translate: Improving Long Distance Reordering in Statistical Machine Translation

http://sourceforge.net/projects/jregex/files/
Pattern-Based Ontology Construction from Selected Wikipedia Pages

http://texrex.sourceforge.net/
CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws

http://sourceforge.net/p/lemur/wiki/RankLib/
Prune-and-Score: Learning for Greedy Coreference Resolution
Utilizing Microblogs for Automatic News Highlights Extraction
QU-IR at SemEval 2016 Task 3: Learning to Rank on Arabic Community Question Answering Forums with Word Embedding

http://kdiff3.sourceforge.net/
EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora

http://sourceforge.net/projects/conexp/
FCA_Retrieval: A Multi-operator Algorithm for Information Retrieval from Binary Concept Lattice

http://khc.sourceforge.net/
Obtaining Japanese Lexical Units for Semantic Frames from Berkeley FrameNet Using a Bilingual Corpus

http://sourceforge.net/projects/mstparser/
A Method for Automatically Constructing Case Frames for English
Exploring Difficulties in Parsing Imperatives and Questions
Event Extraction as Dependency Parsing
Effect of Minimal Semantics on Dependency Parsing
Joint Models for Chinese POS Tagging and Dependency Parsing
Learning Reliability of Parses for Domain Adaptation of Dependency Parsing
Model Combination for Event Extraction in BioNLP 2011
Event Extraction as Dependency Parsing for BioNLP 2011
Parse Reranking Based on Higher-Order Lexical Dependencies
Dynamic Programming for Higher Order Parsing of Gap-Minding Trees
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Error Detection for Treebank Validation
Steps to Excellence: Simple Inference with Refined Scoring of Dependency Trees
Improved Parsing and POS Tagging Using Inter-Sentence Consistency Constraints
A Cross-lingual Annotation Projection-based Self-supervision Approach for Open Information Extraction
Hybrid Constituent and Dependency Parsing with Tsinghua Chinese Treebank
High-order Graph-based Neural Dependency Parsing
Multilingual Dependency Parsing and Domain Adaptation using DeSR
Towards Domain Adaptation for Parsing Web Data
Polynomial Time Joint Structural Inference for Sentence Compression
Sentence-Level Instance-Weighting for Graph-Based and Transition-Based Dependency Parsing
Fast and Accurate Arc Filtering for Dependency Parsing
Punctuation Processing for Projective Dependency Parsing
Hebrew Dependency Parsing: Initial Results
A Simple Approach for HPSG Supertagging Using Dependency Information
Inspecting the Structural Biases of Dependency Parsing Algorithms
Linköping: Cubic-Time Graph Parsing with a Simple Scoring Scheme
Joint prediction in MST-style discourse parsing for argumentation mining
Low-Rank Tensors for Scoring Dependency Structures
An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing
Joint Training of Dependency Parsing Filters through Latent Support Vector Machines
Data point selection for cross-language adaptation of dependency parsers

http://crftagger.sourceforge.net
Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature

https://apertium.svn.sourceforge.net/
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation

http://jbnc.sourceforge.net
A Semantic Scattering Model for the Automatic Interpretation of Genitives
LCC-SRN: LCC’s SRN System for SemEval 2007 Task 4

http://wordlist.sourceforge.net
Classifying English Documents by National Dialect
Pronunciation Modeling in Spelling Correction for Writers of English as a Foreign Language
CoMeT: Integrating different levels of linguistic modeling for meaning assessment

http://linguistico.sourceforge.net/pages/thesaurus
Towards an Empirical Subcategorization of Multiword Expressions

http://owlapi.sourceforge.net
“Hidden semantics”: what can we learn from the names in an ontology?
Expressing OWL axioms by English sentences: dubious in theory, feasible in practice

http://abcplus.sourceforge.net
An alternate approach towards meaningful lyric generation in Tamil

http://docbook.sourceforge.net/release/litprog/
Joint Grammar Development by Linguists and Computer Scientists

http://sourceforge.net/projects/wordlist/files/SCOWL/
Arabic Word Generation and Modelling for Spell Checking

http://sourceforge.net/projects/perstem/
A Morphological Lexicon for the Persian Language

http://sourceforge.net/projects/pocket-crf-1/
Improving Named Entity Recognition in Tweets via Detecting Non-Standard Words
Document Summarization via Guided Sentence Compression
Improving Multi-documents Summarization by Sentence Compression based on Expanded Constituent Parse Trees

https://matxin.sourceforge.net
Exploiting portability to build an RBMT prototype for a new source language

http://netcat.sourceforge.net/
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development

http://glpk-java.sourceforge.net
Parallel Sentence Compression

http://sourceforge.net/projects/uplug
ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts alignment of parallel texts

https://sourceforge.net/projects/aligner/
Multi-source annotation projection of coreference chains: assessing strategies and testing opportunities

http://weka.sourceforge.net/doc.stable/weka/classifiers/
INSIGHT Galway: Syntactic and Lexical Features for Aspect Based Sentiment Analysis

http://sourceforge.net/projects/nlp-sanchay
Dependency Annotation Scheme for Indian Languages
Insights into Non-projectivity in Hindi

http://www.sourceforge.net/projects/maxent/
Learning Count Classifier Preferences of Malay Nouns

http://wnsqlbuilder.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://palmkit.sourceforge.net
Item Development and Scoring for Japanese Oral Proficiency Testing

http://sourceforge.net/projects/joshua/files/joshua/1.3/wmt2010-
Reproducible Results in Parsing-Based Machine Translation: The JHU Shared Task Submission

https://sourceforge.net/projects/aucopro
Automatic Compound Processing: Compound Splitting and Semantic Analysis for Afrikaans and Dutch

http://sourceforge.net/projects/tashkeela/
Smoothing methods for a morpho-statistical approach of automatic diacritization Arabic texts (Méthodes de lissage d’une approche morpho-statistique pour la voyellation automatique des textes arabes) [in French]
Unsupervised Construction of a Lexicon and a Repository of Variation Patterns for Arabic Modal Multiword Expressions

http://wavesurfer.sourceforge.net/
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish

http://wordlist.sourceforge.net/pos-readme
Robust Sentiment Detection on Twitter from Biased and Noisy Data

http://opennlp.sourceforge.net/models-1.5/en-sent.bin
Chemical-Induced Disease Detection Using Invariance-based Pattern Learning Model

https://sourceforge.net/projects/spare2015/files/457-diseases
Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs

http://sourceforge.net/projects/zpar/files/0.3/
A Separately Passive-Aggressive Training Algorithm for Joint POS Tagging and Dependency Parsing

http://jake2.cvs.sourceforge.net/
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment

http://imli.sourceforge.net
Acharya - A Text Editor and Framework for working with Indic Scripts

http://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_04_LAW-MWE-CxG_
Deep-BGT at PARSEME Shared Task 2018: Bidirectional LSTM-CRF Model for Verbal Multiword Expression Identification

http://multiword.sourceforge.net/PHITE.php?sitesig=MWE
Using Linguistic Resources to Evaluate the Quality of Annotated Corpora

http://sourceforge.net/projects/saxon
Corpus Development and Publication

http://crfpp.sourceforge.net//
Creating and evaluating a consensus for negated and speculative words in a Swedish clinical corpus

http://pyml.sourceforge.net
Prosodic Correlates of Rhetorical Relations

http://soaplab.sourceforge.net/soaplab2
Interoperability and Technology for a Language Resources Factory
Towards a User-Friendly Platform for Building Language Resources based on Web Services

http://jvntextpro.sourceforge.net/
Dealing with Out-Of-Vocabulary Problem in Sentence Alignment Using Word Similarity
A Model of Vietnamese Person Named Entity Question Answering System
NTU-MC Toolkit: Annotating a Linguistically Diverse Corpus
VnCoreNLP: A Vietnamese Natural Language Processing Toolkit

http://StarDict.sourceforge.net/
Graph Representation of Synonymy and Translation Resources for Crosslinguistic Modelisation of Meaning

https://sourceforge.net/projects
Dual-Type Automatic Speech Recogniser Designs for Spoken Dialogue Systems

http://jena.sourceforge.net/index.html
Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources

http://jazzy.sourceforge.net
Unsupervised Context-Sensitive Spelling Correction of Clinical Free-Text with Word and Character N-Gram Embeddings
The Illinois-Columbia System in the CoNLL-2014 Shared Task
Improving Text Normalization via Unsupervised Model and Discriminative Reranking
Improving Text Normalization using Character-Blocks Based Models and System Combination
The Influence of Spelling Errors on Content Scoring Performance

http://sigmakee.cvs.sourceforge
Enhancing the Japanese WordNet

http://annocultor.sourceforge.net/
An Intelligent Authoring Environment for Abstract Semantic Representations of Cultural Object Descriptions

http://tidy.sourceforge.net
Categorizing Web Pages as a Preprocessing Step for Information Extraction
An Unsupervised System for Identifying English Inclusions in German Text
Annotated Web as corpus

http://transonto.sourceforge.net/
An Intelligent Authoring Environment for Abstract Semantic Representations of Cultural Object Descriptions

http://stardict.sourceforge.net/Dictionaries_zh_TW.php
以範例為基礎之英漢TIMSS詴題輔助翻譯 (Example Based Machine Translation of TIMSS Test Items) [In Chinese]

http://emu.sourceforge.net/
Discourse-level Annotation for Investigating Information Structure
Likelihood Ratio-based Forensic Voice Comparison on L2 speakers: A Case of Hong Kong native male production of English vowels
The BAS Speech Data Repository

http://multiword.sourceforge.net/sharedtask2018
Veyn at PARSEME Shared Task 2018: Recurrent Neural Networks for VMWE Identification
Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018)

http://multiword.sourceforge.net/cupt-format
Veyn at PARSEME Shared Task 2018: Recurrent Neural Networks for VMWE Identification
Without lexicons, multiword expression identification will never fly: A position statement

http://hyperestraier.sourceforge.net/index.html
Exploiting Wikipedia as External Knowledge for Named Entity Recognition

http://crfpp.sourceforge.net/
A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding
Using Morphological and Syntactic Structures for Chinese Opinion Analysis
Correcting Comma Errors in Learner Essays, and Restoring Commas in Newswire Text
Exploiting Rich Features for Detecting Hedges and their Scope
Information extraction using finite state automata and syllable n-grams in a mobile environment
Term Contributed Boundary Tagging by Conditional Random Fields for SIGHAN 2010 Chinese Word Segmentation Bakeoff
An Empirical Study in Source Word Deletion for Phrase-Based Statistical Machine Translation
Which Performs Better on In-Vocabulary Word Segmentation: Based on Word or Character?
A Generalised Hybrid Architecture for NLP
A Survey of Arabic Named Entity Recognition and Classification
Predicting Chinese Abbreviations with Minimum Semantic Unit and Global Constraints
Building a Web Corpus of Czech
Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition
Which is More Suitable for Chinese Word Segmentation, the Generative Model or the Discriminative One?
Opinion Word Expansion and Target Extraction through Double Propagation
Hedge Classification with Syntactic Dependency Features Based on an Ensemble Classifier
Generation of Compound Words in Statistical Machine Translation into Compounding Languages
Learning Reliability of Parses for Domain Adaptation of Dependency Parsing
A Tree-to-String Phrase-based Model for Statistical Machine Translation
CRF-based Experiments for Cross-Domain Chinese Word Segmentation at CIPS-SIGHAN-2010
Medical Entity Recognition: A Comparaison of Semantic and Statistical Methods
Efficient Inference of CRFs for Large-Scale Natural Language Data
Understanding the Semantic Intent of Natural Language Query
Named Entity Recognition for Telugu
Simplification de phrases pour l’extraction de relations (Sentence Simplification for Relation Extraction) [in French]
Annotating named entities in clinical text by combining pre-annotation and active learning
Constructing Chinese Abbreviation Dictionary: A Stacked Approach
A Chinese Word Segmentation System Based on Structured Support Vector Machine Utilization of Unlabeled Text Corpus
Improving Transliteration Accuracy Using Word-Origin Detection and Lexicon Lookup
Tense and Aspect Error Correction for ESL Learners Using Global Context
Annotating and Recognising Named Entities in Clinical Notes
Everybody loves a rich cousin: An empirical study of transliteration through bridge languages
Accelerated Training of Maximum Margin Markov Models for Sequence Labeling: A Case Study of NP Chunking
Contextual Information Improves OOV Detection in Speech
Comparing Local and Sequential Models for Statistical Incremental Natural Language Understanding
A Minimum Error Weighting Combination Strategy for Chinese Semantic Role Labeling
Recognition of Polish Temporal Expressions
Expert Finding for Microblog Misinformation Identification
Chinese Evaluative Information Analysis
Consistency Maintenance in Prosodic Labeling for Reliable Prediction of Prosodic Breaks
Semantic Role Labeling for News Tweets
Dependency-based Analysis for Tagalog Sentences
Learning Logical Structures of Paragraphs in Legal Articles
Sequential Tagging of Semantic Roles on Chinese FrameNet
Arabic-English Text Translation Leveraging Hybrid NER
Context-based Arabic Morphological Analysis for Machine Translation
A Deep Learning Approach to Machine Transliteration
NE Tagging for Urdu based on Bootstrap POS Learning
Statistical Transliteration for Cross Language Information Retrieval using HMM alignment model and CRF
Predicting Opinion Dependency Relations for Opinion Analysis
Recognizing Named Entities in Tweets
Cascaded Chinese Weibo Segmentation Based on CRFs
A Multi-layer Chinese Word Segmentation System Optimized for Out-of-domain Tasks
The Taming of Reconcile as a Biomedical Coreference Resolver
Joint Annotation of Search Queries
TEXT2TABLE: Medical Text Summarization System Based on Named Entity Recognition and Modality Identification
Cascading Classifiers for Named Entity Recognition in Clinical Notes
Productive Generation of Compound Words in Statistical Machine Translation
From Field Notes towards a Knowledge Base
Factors Affecting Part-of-Speech Tagging for Tagalog
Towards a Balanced Named Entity Corpus for Dutch
Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition
ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields
Word Boundary Decision with CRF for Chinese Word Segmentation
Active Learning for Chinese Word Segmentation
A Cascade Method for Detecting Hedges and their Scope in Natural Language Text
A Cross-lingual Annotation Projection-based Self-supervision Approach for Open Information Extraction
Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty
Enhancement of Feature Engineering for Conditional Random Field Learning in Chinese Word Segmentation Using Unlabeled Data
Arabic Named Entity Recognition using Optimized Feature Sets
Multi-Document Summarization using Automatic Key-Phrase Extraction
Learning from Bullying Traces in Social Media
TimeML Events Recognition and Classification: Learning CRF Models with Semantic Roles
Term Contributed Boundary Feature using Conditional Random Fields for Chinese Word Segmentation Task
Fast Decoding and Easy Implementation: Transliteration as Sequential Labeling
Hybrid Citation Extraction from Patents
Knowledge-Poor Approach to Shallow Parsing: Contribution of Unsupervised Part-of-Speech Induction
學術會議資訊之擷取及其應用 (Information Extraction for Academic Conference and It’s Application) [In Chinese]
Practical Very Large Scale CRFs
A Language-Independent Transliteration Schema Using Character Aligned Models at NEWS 2009
Multi-Task Learning in Conditional Random Fields for Chunking in Shallow Semantic Parsing
Structural Opinion Mining for Graph-based Sentiment Representation
An Experiment in Integrating Sentiment Features for Tech Stock Prediction in Twitter
意見持有者辨識之研究 (A Study on Identification of Opinion Holders) [In Chinese]
The infinite HMM for unsupervised PoS tagging
Discovery of Dependency Tree Patterns for Relation Extraction
Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination
BOEMIE Ontology-Based Text Annotation Tool
A Reranking Model for Discourse Segmentation using Subtree Features
Combining Character-Based and Subsequence-Based Tagging for Chinese Word Segmentation
The Character-based CRF Segmenter of MSRA&NEU for the 4th Bakeoff
What's great and what's not: learning to classify the scope of negation for improved sentiment analysis
Improving Chunk-based Semantic Role Labeling with Lexical Features
Machine Transliteration using Target-Language Grapheme and Phoneme: Multi-engine Transliteration Approach
Genetic Algorithm (GA) Implementation for Feature Selection in Manipuri POS Tagging
Forward-backward Machine Transliteration between English and Chinese Based on Combined CRFs
Experiments in Base-NP Chunking and Its Role in Dependency Parsing for Thai
Improved Chinese Parsing Using Named Entity Cue
Verbs are where all the action lies: Experiences of Shallow Parsing of a Morphologically Rich Language
Improving Chinese Word Segmentation and POS Tagging with Semi-supervised Methods Using Large Auto-Analyzed Data
MWU-Aware Part-of-Speech Tagging with a CRF Model and Lexical Resources
Identification of Opinion Holders
Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging Accuracy
Opinion Identification in Spanish Texts
Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation
Tagging and Linking Web Forum Posts
TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
Named Entity Recognition in Chinese News Comments on the Web
Re-Ranking Models Based-on Small Training Data for Spoken Language Understanding
Mining Fine-grained Opinion Expressions with Shallow Parsing
TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2
What Information is Helpful for Dependency Based Semantic Role Labeling
Extracting Relation Descriptors with Conditional Random Fields
A Broad-Coverage Normalization System for Social Media Language
Supervised and Semi-Supervised Sequence Learning for Recognition of Requisite Part and Effectuation Part in Law Sentences
ParsCit: an Open-source CRF Reference String Parsing Package
Iterative Viterbi A* Algorithm for K-Best Sequential Decoding
Arabic-Segmentation Combination Strategies for Statistical Machine Translation
Classifying Dialogue Acts in One-on-One Live Chats
Chunking with Max-Margin Markov Networks
Chinese NER Using CRFs and Logic for the Fourth SIGHAN Bakeoff
Dependency Parser for Chinese Constituent Parsing
Insertion, Deletion, or Substitution? Normalizing Text Messages without Pre-categorization nor Supervision

http://mosesdecoder.sourceforge.net/
Sub-sentencial Paraphrasing by Contextual Pivot Translation

http://sourceforge.net/projects/carafe
Domain-specific vs. Uniform Modeling for Coreference Resolution
Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
Sources of Performance in CRF Transfer Training: a Business Name-tagging Case Study
SpatialML: Annotation Scheme, Corpora, and Tools
A simple feature-copying approach for long-distance dependencies
Classification of Discourse Coherence Relations: An Exploratory Study using Multiple Knowledge Sources

https://sourceforge.net/p/kaldi/code/
Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices

http://sourceforge.net/projects/mstparser
A Cascaded Syntactic and Semantic Dependency Parsing System
2D Trie for Fast Parsing
Concise Integer Linear Programming Formulations for Dependency Parsing
Task-oriented Evaluation of Syntactic Parsers and Their Representations
XLike Project Language Analysis Services
HPSG Parsing with Shallow Dependency Constraints
Summarization with a Joint Model for Sentence Extraction and Compression
Multilingual Dependency Parsing Using Global Features
A Joint Syntactic and Semantic Dependency Parsing System based on Maximum Entropy Models
Learning with Probabilistic Features for Improved Pipeline Models
Generalization of Words for Chinese Dependency Parsing
Stacking Dependency Parsers
Web-Scale Features for Full-Scale Parsing
Word-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and its Application
MAXSIM: A Maximum Similarity Metric for Machine Translation Evaluation
An Iterative Approach for Joint Dependency Parsing and Semantic Role Labeling

http://secondstring.sourceforge.net
A Graph-Search Framework for GeneId Ranking
Cross Linguistic Name Matching in English and Arabic
Robust Similarity Measures for Named Entities Matching
Practical Markov Logic Containing First-Order Quantifiers with Application to Identity Uncertainty

http://olac.wiki.sourceforge.net/
Toward a Global Infrastructure for the Sustainability of Language Resources

http://sourceforge.net/projects/tercpp/
Machine Translation Quality Estimation Across Domains

http://sourceforge.net/projects/zpar/
Influence of Parser Choice on Dependency-Based MT
Syntactic SMT Using a Discriminative Text Generation Model
Fast and Accurate Shift-Reduce Constituent Parsing
Chinese Parsing Exploiting Characters
Transition-based Spinal Parsing
Transition-based Neural Constituent Parsing
Multi-view Chinese Treebanking

http://htmlcleaner.sourceforge.net
Web pages segmentation for document selection in Question Answering (Pré-segmentation de pages web et sélection de documents pertinents en Questions-Réponses) [in French]

http://wnejb.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://mulan.sourceforge
Czech Text Document Corpus v 2.0

http://transag.sourceforge.net/
SPPAS: a tool for the phonetic segmentation of speech

http://viper-toolkit.sourceforge.net
The REPERE Corpus : a multimodal corpus for person recognition

http://getm-project.sourceforge.net/
An Exploration of Mining Gene Expression Mentions and Their Anatomical Locations from Biomedical Text

http://sourceforge.net/projects/kaldi
Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices

https://apertium.svn.sourceforge
Enlarging Monolingual Dictionaries for Machine Translation with Active Learning and Non-Expert Users

http://sourceforge.net/apps/mediawiki/opennlp/
EVBCorpus - A Multi-Layer English-Vietnamese Bilingual Corpus for Studying Tasks in Comparative Linguistics
Building English-Vietnamese Named Entity Corpus with Aligned Bilingual News Articles

http://sourceforge.net/projects/randlm/
Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval

https://sourceforge.net/p/lemur/wiki/RankLib/
Par4Sim – Adaptive Paraphrasing for Text Simplification
Scoring Sentence Singletons and Pairs for Abstractive Summarization
Word and Document Embedding with vMF-Mixture Priors on Context Word Vectors

http://minorthird.sourceforge.net
Regular Expression Learning for Information Extraction
A Graph-Search Framework for GeneId Ranking
Non-textual Event Summarization by Applying Machine Learning to Template-based Language Generation
Learning to Classify Email into “Speech Acts”
Named Entity Recognition in Questions: Towards a Golden Collection
Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk
A Joint Model to Identify and Align Bilingual Named Entities

http://sourceforge.net/projects/thot/
Improving On-line Handwritten Recognition using Translation Models in Multimodal Interactive Machine Translation

http://sourceforge.net/projects/crf/
Learning Representations for Weakly Supervised Natural Language Processing Tasks
Open-Domain Semantic Role Labeling by Modeling Word Spans
Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling
Exploring Representation-Learning Approaches to Domain Adaptation
Language Models as Representations for Weakly Supervised NLP Tasks

https://sourceforge.net/projects/arabic-wordlist/
POS Tagging for Improving Code-Switching Identification in Arabic

http://jwbf.sourceforge.net/
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary

http://aperture.sourceforge.net/
Extracting Information for Context-aware Meeting Preparation

http://irrlicht.sourceforge.net/
Fusing Eye Gaze with Speech Recognition Hypotheses to Resolve Exophoric References in Situated Dialogue

https://sourceforge.net/project/showfiles.php?group_id=59
Dependency Tree-based SRL with Proper Pruning and Extensive Feature Engineering

http://sourceforge.net/projects/tematres/
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool

http://pyke.sourceforge.net/
A Strategic Reasoning Model for Generating Alternative Answers

https://sourceforge.net/projects/tdbbrowser/
METU Turkish Discourse Bank Browser

http://asmalgorithm.sourceforge.net
ASM Kernel: Graph Kernel using Approximate Subgraph Matching for Relation Extraction
Generalizing an Approximate Subgraph Matching-based System to Extract Events in Molecular Biology and Cancer Genetics

http://prdownloads.sourceforge.net/weka/
UMCC_DLSI-(EPS): Paraphrases Detection Based on Semantic Distance

https://sourceforge.net/projects/aucopro/
More Than Only Noun-Noun Compounds: Towards an Annotation Scheme for the Semantic Modelling of Other Noun Compound Types

http://sourceforge.net/projects/runes/
An Approach to Modeling Heterogeneous Resources for Information Extraction

http://chemnomparse.sourceforge.net/
A System for Semantic Analysis of Chemical Compound Names

http://cmusphinx.sourceforge.net/wiki/tutorialadapt
Which ASR should I choose for my dialogue system?

https://sourceforge.net/projects/mstparser/
Multilingual Back-and-Forth Conversion between Content and Function Head for Easy Dependency Parsing

http://djangology.sourceforge.net/
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation

http://crftagger.sourceforge.net/
Joint Annotation of Search Queries
Creating a manually error-tagged and shallow-parsed learner corpus
Reconstructing an Indo-European Family Tree from Non-native English Texts
MorphoLogic‘s Submission for the WMT 2009 Shared Task

http://regulus.cvs.sourceforge.net/regulus/Regulus/doc/
A Knowledge-Modeling Approach for Multilingual Regulus Lexica

http://multiword.sourceforge.net/sharedtaskresults2018
Edition 1.1 of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions
VarIDE at PARSEME Shared Task 2018: Are Variants Really as Alike as Two Peas in a Pod?

http://chasen-legacy.sourceforge.jp
Dialect Translation: Integrating Bayesian Co-segmentation Models with Pivot-based SMT
Coherent Back-Channel Feedback Tagging of In-Car Spoken Dialogue Corpus

http://sourceforge.net/projects/arabic-wordlist/
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction

http://ccash.sourceforge.net
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis

http://sp-tk.sourceforge.net/
Toward Construction of Spoken Dialogue System that Evokes Users’ Spontaneous Backchannels
可變速中文文字轉語音系統 (Variable Speech Rate Mandarin Chinese Text-to-Speech System) [In Chinese]
可變速中文文字轉語音系統 (Variable Speech Rate Mandarin Chinese Text-to-Speech System) [In Chinese]

http://sourceforge.net/projects/
MARS: A Specialized RTE System for Parser Evaluation
Predicting Polarities of Tweets by Composing Word Embeddings with Long Short-Term Memory
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
Farasa: A New Fast and Accurate Arabic Word Segmenter
Learning from Relatives: Unified Dialectal Arabic Segmentation
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Graph-based Dependency Parsing with Bidirectional LSTM
Encoding Semantic Resources in Syntactic Structures for Passage Reranking
Automatically Classifying Edit Categories in Wikipedia Revisions
Language Processing Infrastructure in the XLike Project
WikiNetTK – A Tool Kit for EmbeddingWorld Knowledge in NLP Applications
Online Multitask Learning for Machine Translation Quality Estimation
Checking a structured pathology report for completeness of content using terminological knowledge
DirRelCond3: Detecting Textual Entailment Across Languages With Conditions On Directional Text Relatedness Scores
Recognising the Predicate-argument Structure of Tagalog
ASVUniOfLeipzig: Sentiment Analysis in Twitter using Data-driven Machine Learning Techniques
Improving CoGrOO: the Brazilian Portuguese Grammar Checker
Translating Literary Text between Related Languages using SMT
A Cascaded Classification Approach to Semantic Head Recognition
FBK-UPV-UEdin participation in the WMT14 Quality Estimation shared-task
Orthographic and Morphological Processing for Persian-to-English Statistical Machine Translation
Ensemble Models for Dependency Parsing: Cheap and Good?
Company-Oriented Extractive Summarization of Financial News
Arabic Spelling Correction using Supervised Learning
Does Size Matter? Text and Grammar Revision for Parsing Social Media Data
ERSOM: A Structural Ontology Matching Approach Using Automatically Learned Entity Representation
Developing New Linguistic Resources and Tools for the Galician Language
Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors
Approximating a Deep-Syntactic Metric for MT Evaluation and Tuning
Making Speech Look Like Text in the Regulus Development Environment
Structured and Unstructured Cache Models for SMT Domain Adaptation
Learning Paraphrasing for Multiword Expressions
Detecting Dependency Parse Errors with Minimal Resources
AfriBooms: An Online Treebank for Afrikaans
Importing MASC into the ANNIS linguistic database: A case study of mapping GrAF
Unsupervised Dependency Parsing: Let’s Use Supervised Parsers
Why are You Taking this Stance? Identifying and Classifying Reasons in Ideological Debates
High Quality Dependency Selection from Automatic Parses
Statistical French Dependency Parsing: Treebank Conversion and First Results
The Quaero Evaluation Initiative on Term Extraction
An Effective Neural Network Model for Graph-based Dependency Parsing
Robust Morphological Tagging with Word Representations
Structural and Topical Dimensions in Multi-Task Patent Translation
A Repository of Free Lexical Resources for African Languages: The Project and the Method
How Feasible and Robust is the Automatic Extraction of Gene Regulation Events? A Cross-Method Evaluation under Lab and Real-Life Conditions
Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding
A Framework for Compiling High Quality Knowledge Resources From Raw Corpora
Recognizing Biomedical Named Entities Using Skip-Chain Conditional Random Fields
Which Side are You on? Identifying Perspectives at the Document and Sentence Levels
The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages
Impact of Citing Papers for Summarisation of Clinical Documents
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
Single and multi-objective optimization for feature selection in anaphora resolution
CMUQ@QALB-2014: An SMT-based System for Automatic Arabic Error Correction
An Embodied Dialogue System with Personality and Emotions
Chinese-to-Spanish rule-based machine translation system
NAIST at 2013 CoNLL Grammatical Error Correction Shared Task

http://julius.sourceforge.jp/sapi/
Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface

https://freetts.sourceforge.io/
Voice Builder: A Tool for Building Text-To-Speech Voices

http://desr.sourceforge.net
Semantically Annotated Snapshot of the English Wikipedia

http://sourceforge.net/projects/wn-similarity/
UMCC_DLSI-(EPS): Paraphrases Detection Based on Semantic Distance

https://sourceforge.net/projects/la-strings/
Non-linear Mapping for Improved Identification of 1300+ Languages

http://sourceforge.net/projects/crfpp/
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping

https://sourceforge.net/projects/spare2015/files/symptom-
Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs

http://cmu-ebmt.sourceforge.net
Taming Structured Perceptrons on Wild Feature Vectors

https://sourceforge.net/projects/awmysql/
Towards Building A Domain Agnostic Natural Language Interface to Real-World Relational Databases

http://sourceforge.net/projects/hannanum
Korean NLP2RDF Resources

http://sourceforge.net/projects/joshua/files/joshua/1.3/
Reproducible Results in Parsing-Based Machine Translation: The JHU Shared Task Submission

http://infomap-nlp.sourceforge.net
Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application
Automatic Acquisition of the Argument-Predicate Relations from a Frame-Annotated Corpus
Paraphrasing for Automatic Evaluation
Evaluating Hierarchical Discourse Segmentation
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations

https://sourceforge.net/projects/wnlt/
Towards a Welsh Semantic Annotation System

http://wvware.sourceforge.net/
Space characters in Chinese semi-structured texts

http://clipsrules.sourceforge.net/
Translation of TO infinitives in Anusaaraka Platform: an English Hindi MT system

http://cmusphinx.sourceforge.net/
Rapid Development of Advanced Question-Answering Characters by Non-experts
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
The WaveSurfer Automatic Speech Recognition Plugin
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
Speech-to-Speech Translation Activities in Thailand

http://cmusphinx.sourceforge
Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems

http://lpsolve.sourceforge.net/5.5/
Automatically Generating Wikipedia Articles: A Structure-Aware Approach
Learning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues
Predicting Chinese Abbreviations with Minimum Semantic Unit and Global Constraints
Global Learning of Focused Entailment Graphs
Cascaded Attention based Unsupervised Information Distillation for Compressive Summarization
Towards Time-Aware Knowledge Graph Completion
Cross-narrative Temporal Ordering of Medical Events
Argument Inference from Relevant Event Mentions in Chinese Argument Extraction
PKUSUMSUM : A Java Platform for Multilingual Document Summarization
MITRE: Seven Systems for Semantic Similarity in Tweets
Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset
MITRE at SemEval-2017 Task 1: Simple Semantic Similarity
On the Effectiveness of using Sentence Compression Models for Query-Focused Multi-Document Summarization
Learning Entailment Relations by Global Graph Structure Optimization
Joint Modeling of Structure Identification and Nuclearity Recognition in Macro Chinese Discourse Treebank
Generating Aspect-oriented Multi-Document Summarization with Event-aspect model
Noisy Or-based model for Relation Extraction using Distant Supervision
Constructing Coherent Event Hierarchies from News Stories

http://jtidy.sourceforge.net
Web pages segmentation for document selection in Question Answering (Pré-segmentation de pages web et sélection de documents pertinents en Questions-Réponses) [in French]

http://sourceforge.net/projects/jwordnet
Accessing GermaNet Data and Computing Semantic Relatedness
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations
JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages
UNIBA: JIGSAW algorithm for Word Sense Disambiguation

http://crf.sourceforge.net/
Enhanced Answer Type Inference from Questions using Sequential Models
Minimally-Supervised Extraction of Entities from Text Advertisements
Clause Identification and Classification in Bengali

http://notabene.svn.sourceforge.net/
Building the Syntactic Reference Corpus of Medieval French Using NotaBene RDF Annotation Tool

http://cuitools.sourceforge.net
An Unsupervised Vector Approach to Biomedical Term Disambiguation: Integrating UMLS and Medline
Using Second-order Vectors in a Knowledge-based Method for Acronym Disambiguation

http://soplib.sourceforge.net/
Features for Detecting Hedge Cues

http://crf.sourceforge.net
Segment-based Fine-grained Emotion Detection for Chinese Text
Automatic Detection of Document Organizational Structure from Visual and Lexical Markers (Détection automatique de la structure organisationnelle de documents à partir de marqueurs visuels et lexicaux) [in French]
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk

http://wn-similarity.sourceforge
What implementation and translation teach us: the case of semantic similarity measures in wordnets

http://julius.sourceforge
Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices

http://trans.sourceforge.net/en/
ANNEX - a web-based Framework for Exploiting Annotated Media Resources
A Recorded Debating Dataset

https://sourceforge.net/projects/champollion/
Findings of the WMT 2016 Bilingual Document Alignment Shared Task

http://opennap.sourceforge.net/
A Novel Method for Content Consistency and Efficient Full-text Search for P2P Content Sharing Systems

http://multiword.sourceforge.net
How to Account for Idiomatic German Support Verb Constructions in Statistical Machine Translation

http://mwetoolkit.sourceforge.net/
Une plate-forme générique et ouverte pour le traitement des expressions polylexicales (An Open and Generic Framework for the Acquisition of Multiword Expressions) [in French]

http://wikipedia-miner.sourceforge.net/
Human-competitive tagging using automatic keyphrase extraction
Finding Domain Terms using Wikipedia

http://jpf.sourceforge.net/
ICARUS – An Extensible Graphical Search Tool for Dependency Treebanks
A Lightweight Modeling Middleware for Corpus Processing

https://sourceforge.net/projects/spare2015/files/D-
Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs

http://sourceforge.net/projects/ccash
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development

http://bioc.sourceforge.net/
BioNLP Shared Task 2013: Supporting Resources

http://sourceforge.net/apps/trac/exe/wiki
AutoLearn’s authoring tool: a piece of cake for teachers

http://notepad-plus.sourceforge.net/
Cleaneval: a Competition for Cleaning Web Pages

http://igt.sourceforge.net/
Evaluating Automation Strategies in Language Documentation

http://biocreative.sourceforge.net
What’s in a gene name? Automated refinement of gene name dictionaries

http://tess4j.sourceforge.net
NLP Web Services for Resource-Scarce Languages

http://sourceforge.net/projects/wordlist/files/AGID/
Arabic Word Generation and Modelling for Spell Checking

https://sourceforge.net/projects/cognates/files/?source=navb
Multilingual lexical resources to detect cognates in non-aligned texts

https://sourceforge.net
On- and Off-Topic Classification and Semantic Annotation of User-Generated Software Requirements

http://emustru.sourceforge.net/detecting_grammati
A Grammar Checker for Tagalog using LanguageTool

http://portableaserve.sourceforge.net/
Design of an Interactive Web-based User Interface for Speech Database Query Formation

http://bnj.sourceforge.net/
A Unified Probabilistic Approach to Referring Expressions

http://bionlp-corpora.sourceforge.net/
Test Suite Design for Biomedical Ontology Concept Recognition Systems

http://sourceforge.net/projects/lius/
Building Multilingual Search Index using open source framework

http://htmlcleaner.sourceforge.net/
Kitten: a tool for normalizing HTML and extracting its textual content

http://sary.sourceforge.net
An Efficient Algorithm for Unsupervised Word Segmentation with Branching Entropy and MDL

http://saxon.sourceforge.net/
Layering and Merging Linguistic Annotations

http://wnsql.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://www.sourceforge.net/projects/zpar
Transition-Based Parsing of the Chinese Treebank using a Global Discriminative Model
A Fast Decoder for Joint Word Segmentation and POS-Tagging Using a Single Discriminative Model
Shift-Reduce CCG Parsing
Transition-based Dependency Parsing with Rich Non-local Features

http://queequeg.sourceforge.net/index-e.html
Contrastive Analysis and Native Language Identification

http://sourceforge.net/project/showfiles.php?group_id=201943
CRF-based Hybrid Model for Word Segmentation, NER and even POS Tagging

http://sourceforge.net/projects/ebla/
EBLA: A Perceptually Grounded Model of Language Acquisition

http://afner.sourceforge.net/what.html
FEUP at SemEval-2018 Task 5: An Experimental Study of a Question Answering System

http://wn-similarity.sourceforge.net
Information Content Measures of Semantic Similarity Perform Better Without Sense-Tagged Text
A Game-Theoretic Approach to Word Sense Disambiguation
Duluth at SemEval-2017 Task 7 : Puns Upon a Midnight Dreary, Lexical Semantics for the Weak and Weary
Weiwei: A Simple Unsupervised Latent Semantics based Approach for Sentence Similarity
WordNet::SenseRelate::AllWords - A Broad Coverage Word Sense Tagger that Maximizes Semantic Relatedness
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence
WordNet::Similarity - Measuring the Relatedness of Concepts
evision PDF of 'Modeling Sentences in the Latent Space
UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness

http://espeak.sourceforge.net/
The Power of Character N-grams in Native Language Identification
Indian Language Screen Readers and Syllable Based Festival Text-to-Speech Synthesis System
An In-depth Analysis of the Effect of Text Normalization in Social Media
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project
Do NLP and machine learning improve traditional readability formulas?
Multilingual Voice Creation Toolkit for the MARY TTS Platform

http://sourceforge.net/projects/mgizapp/
EMDC: A Semi-supervised Approach for Word Alignment

http://sourceforge.net/projects/regulus/
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
An Intelligent Procedure Assistant Built Using REGULUS 2 and ALTERF
Japanese Speech Understanding using Grammar Specialization
Zero to Spoken Dialogue System in One Quarter: Teaching Computational Linguistics to Linguists Using Regulus
A Development Environment for Building Grammar-Based Speech-Enabled Applications
MedSLT: A Limited-Domain Unidirectional Grammar-Based Medical Speech Translator
Evaluating Task Performance for a Unidirectional Controlled Language Medical Speech Translation System
A Limited-Domain English to Japanese Medical Speech Translator Built Using REGULUS 2
A Voice Enabled Procedure Browser for the International Space Station
A procedure assistant for astronauts in a functional programming architecture, with step previewing and spoken correction of dialogue moves

https://sourceforge.net/projects/regulus/
A Knowledge-Modeling Approach for Multilingual Regulus Lexica

http://wnjn.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://dhvani.sourceforge.net
Automatic Generation of Compound Word Lexicon for Hindi Speech Synthesis

http://sourceforge.net/projects/morfologik/files/morfologik-
A Grammar Checker for Tagalog using LanguageTool

http://gibbslda.sourceforge.net
A Novel Content Enriching Model for Microblog Using News Corpus
SentTopic-MultiRank: a Novel Ranking Model for Multi-Document Summarization
Cross-Cultural Analysis of Blogs and Forums with Mixed-Collection Topic Models
An Unsupervised Neural Attention Model for Aspect Extraction
Three Knowledge-Free Methods for Automatic Lexical Chain Extraction
WordTopic-MultiRank: A New Method for Automatic Keyphrase Extraction
Topical Coherence in LDA-based Models through Induced Segmentation
Unsupervised Topic Modeling for Short Texts Using Distributed Representations of Words
WINGS:Writing with Intelligent Guidance and Suggestions
Improving Question Recommendation by Exploiting Information Need
Unsupervised Topic-Specific Domain Dependency Graphs for Aspect Identification in Sentiment Analysis

http://communicator.sourceforge.net/
Reengineering a Domain-Independent Framework for Spoken Dialogue Systems

http://ffmpeg.sourceforge.net
Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish

https://sourceforge.net/projects/arabicpatterns/
The Power of Language Music: Arabic Lemmatization through Patterns

http://cloc.sourceforge.net/
Edlin: an Easy to Read Linear Learning Framework

http://sourceforge.net/projects/mishkal/
OSMAN ― A Novel Arabic Readability Metric

http://mecab.sourceforge.jp
Left-to-Right Target Generation for Hierarchical Phrase-Based Translation

https://sourceforge.net/projects/aqualog
AquaLog: An ontology-driven Question Answering System to interface the Semantic Web

http://sourceforge.net/projects/ngram
A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word Lexemes

http://flexcrfs.sourceforge.net
An Empirical Study in Source Word Deletion for Phrase-Based Statistical Machine Translation
An Empirical Study of Vietnamese Noun Phrase Chunking with Discriminative Sequence Models
Adding Redundant Features for CRFs-based Sentence Sentiment Classification

http://sourceforge.net/projects/tbxtools/
TBXTools: A Free, Fast and Flexible Tool for Automatic Terminology Extraction

http://soaplab.sourceforge.net/
Language Resources Factory: case study on the acquisition of Translation Memories

http://sourceforge.jp/projects/chasen-legacy/
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems
Annotating Dialogue Acts to Construct Dialogue Systems for Consulting

http://dylan.sourceforge.net/
Incremental Semantic Construction in a Dialogue System
Probabilistic Type Theory for Incremental Dialogue Processing
Proceedings of the Ninth International Conference on Computational Semantics (IWCS 2011)

http://sourceforge.net/projects/zpar/,version0.7
Jointly or Separately: Which is Better for Parsing Heterogeneous Dependencies?

http://mturkclt.sourceforge.net
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk

http://sourceforge.net/projects/mosesdecoder/
Using Context Vectors in Improving a Machine Translation System with Bridge Language

http://textcat.sourceforge.net
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests

http://sourceforge.net/p/apertium/svn/
New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian

http://sourceforge.net/projects/evt-project/
Sharing Cultural Heritage: the Clavius on the Web Project

http://maxent.sourceforge.net/about.html
MultiDPS – A multilingual Discourse Processing System
Compensating for Annotation Errors in Training a Relation Extractor
Can Click Patterns across User’s Query Logs Predict Answers to Definition Questions?
Collaborative Ranking: A Case Study on Entity Linking
MindLab-UNAL: Comparing Metamap and T-mapper for Medical Concept Extraction in SemEval 2014 Task 7

http://bibtexml.sourceforge.net
Combining OCR Outputs for Logical Document Structure Markup. Technical Background to the ACL 2012 Contributed Task

http://j-spider.sourceforge.net/
Lexico-Syntactic Patterns for Automatic Ontology Building
Pattern-Based Ontology Construction from Selected Wikipedia Pages

http://sourceforge.net/projects/oxyus/
Building Multilingual Search Index using open source framework

https://sourceforge.net/projects/zore/
ZORE: A Syntax-based System for Chinese Open Relation Extraction

http://sourceforge.net/projects/spare2015/files/220Patterns
Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs

http://gposttl.sourceforge.net
Automatic Annotation of Parameters from Nanodevice Development Research Papers
Refining the most frequent sense baseline

http://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_04_
CRF-Seq and CRF-DepTree at PARSEME Shared Task 2018: Detecting Verbal MWEs using Sequential and Dependency-Based Approaches

http://julius.sourceforge.jp/en_index
Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license

http://sourceforge.net/projects/clucene
WINGS:Writing with Intelligent Guidance and Suggestions

https://sourceforge.net/p/coocviewer
Exploring Cities in Crime: Significant Concordance and Co-occurrence in Quantitative Literary Analysis

http://flex.sourceforge.net/
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis

http://freetts.sourceforge.net/docs/index.php
Multilingual Voice Creation Toolkit for the MARY TTS Platform

http://sourceforge.net/projects/inaracorpus/
Intrinsic Plagiarism Detection using N-gram Classes

http://sourceforge.net/projects/elixir-fm/
Developing a New System for Arabic Morphological Analysis and Generation

https://sourceforge.net/projects/apertium/
Why Catalan-Spanish Neural Machine Translation? Analysis, comparison and combination with standard Rule and Phrase-based technologies

http://sourceforge.net/projects/joshua/
Hierarchical Phrase-Based MT at the Charles University for the WMT 2011 Shared Task
Cross-Language Entity Linking
Hierarchical Phrase-Based MT at the Charles University for the WMT 2010 Shared Task
cdec: A Decoder, Alignment, and Learning Framework for Finite-State and Context-Free Translation Models

http://vtd-xml.sourceforge.net
Annotated Gigaword

http://sourceforge.net/projects/aramorph/
A Light Lexicon-based Mobile Application for Sentiment Mining of Arabic Tweets

http://mp3splt.sourceforge.net/
Discovering the Italian literature: interactive access to audio indexed text resources

http://sourceforge.net/projects/clucene/
PAL: A Chatterbot System for Answering Domain-specific Questions

http://sourceforge.net/projects/superalign/
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation

http://flora.sourceforge.net/
HEI: Hunter Events Interface A platform based on services for the detection and reasoning about events

http://weka.sourceforge.net/doc.dev/weka/classifiers/functions/SMO.html
An Ensemble Approach for Aggression Identification in English and Hindi Text

http://mulan.sourceforge.net/
Multi-label Text Categorization with Joint Learning Predictions-as-Features Method

http://hunspell.sourceforge.net
Mining Naturally-occurring Corrections and Paraphrases from Wikipedia’s Revision History
Towards Building Advanced Natural Language Applications - An Overview of the Existing Primary Resources and Applications in Nepali
Tools facilitating better use of online dictionaries: Technical aspects of Multidict, Wordlink and Clilstore
Omnifluent English-to-French and Russian-to-English Systems for the 2013 Workshop on Statistical Machine Translation

http://jollyday.sourceforge.net/
Recognising and Interpreting Named Temporal Expressions

http://libots.sourceforge.net/
Web services and data mining: combining linguistic tools for Polish with an analytical platform
Finding the Best Approach for Multi-lingual Text Summarisation: A Comparative Analysis
Web Information Mining and Decision Support Platform for the Modern Service Industry
Context-Enhanced Personalized Social Summarization
Automatic Summarization Using Terminological and Semantic Resources
The Polish Summaries Corpus

http://sourceforge.net/projects/irstlm/
NAIST at the HOO 2012 Shared Task
An Improved Hierarchical Word Sequence Language Model Using Directional Information

http://en.sourceforge.jp/projects/sfnet_tashkeela/
OSMAN ― A Novel Arabic Readability Metric

http://argmax.sourceforge.net/
Efficient Inference of CRFs for Large-Scale Natural Language Data

http://sourceforge.jp/projects/ipadic/
Even Unassociated Features Can Improve Lexical Distributional Similarity
Using Collocations and K-means Clustering to Improve the N-pos Model for Japanese IME
Translation of Patent Sentences with a Large Vocabulary of Technical Terms Using Neural Machine Translation
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
Non-parametric Bayesian Segmentation of Japanese Noun Phrases
phloat : Integrated Writing Environment for ESL learners

http://jclec.sourceforge.net/
Multi-objective Optimization for Efficient Brahmic Keyboards

http://kbedic.sourceforge.net/
Bootstrapping Open-Source English-Bulgarian Computational Dictionary

http://mecab.sourceforge.net
Inducing Gazetteers for Named Entity Recognition by Large-Scale Clustering of Dependency Relations
Cross-Language Text Classification Using Structural Correspondence Learning
A Comparison of Entity Matching Methods between English and Japanese Katakana
Word-based Partial Annotation for Efficient Corpus Construction
Classification of Research Papers into a Patent Classification System Using Two Translation Models
Item Development and Scoring for Japanese Oral Proficiency Testing
Hacking Wikipedia for Hyponymy Relation Acquisition
Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis

http://matxin.sourceforge.net/
Cooperation for Arabic Language Resources and Tools — The MEDAR Project
Machine Learning Disambiguation of Quechua Verb Morphology

https://sourceforge.net/
#WarTeam at SemEval-2017 Task 6: Using Neural Networks for Discovering Humorous Tweets
An Arabic Morphological Analyzer and Generator with Copious Features
Exploring the value space of attributes: Unsupervised bidirectional clustering of adjectives in German
An Extensible Multilingual Open Source Lemmatizer
Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing
A Multiform Balanced Dependency Treebank for Romanian

http://grok.sourceforge.net
Leo: an Architecture for Sharing Resources for Unification-Based Grammars

http://crfpp.sourceforge
Co-regularizing character-based and word-based models for semi-supervised Chinese word segmentation

http://owlapi.sourceforge.net/
Exploring the inference role in automatic information extraction from texts
An API for Multi-lingual Ontology Matching

http://phpmorphy.sourceforge.net/dokuwiki/
Towards an Automatic Classification of Illustrative Examples in a Large Japanese-French Dictionary Obtained by OCR

http://jgibblda.sourceforge
Growing Multi-Domain Glossaries from a Few Seeds using Probabilistic Topic Models

http://t-rex.sourceforge.net/
A Novel Approach to Automatic Gazetteer Generation using Wikipedia
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction

http://arabic-wordlist.sourceforge.net/
Arabic Word Generation and Modelling for Spell Checking

https://sourceforge.net/projects/tprdb/files/
Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation

http://aspell.sourceforge.net
Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method

http://jena.sourceforge.net/inference/index.html
A General Methodology for Equipping Ontologies with Time

http://texhyphj.sourceforge.net
Conditional Random Fields for Word Hyphenation

http://sourceforge.net/projects/supersensetag/
Supersense Tagger for Italian
Relation Classification via Convolutional Deep Neural Network
Semantically Annotated Snapshot of the English Wikipedia

http://maxent.sourceforge.net/index.html
Re-examining Automatic Keyphrase Extraction Approaches in Scientific Articles
Re-Ranking Algorithms for Name Tagging
Simple features for statistical Word Sense Disambiguation
Think Globally, Apply Locally: Using Distributional Characteristics for Hindi Named Entity Identification

http://meka.sourceforge.net
Automatically Classifying Edit Categories in Wikipedia Revisions
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data

http://sourceforge.net/projects/featurama/
Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition
The Effects of Syntactic Features in Automatic Prediction of Morphology

http://sourceforge.net/projects/senserelate
The Duluth lexical sample systems in Senseval-3
Concept Disambiguation for Improved Subject Access Using Multiple Knowledge Sources

http://freetts.sourceforge.net/
Integrating Natural Language Generation with XML Web Technology

http://marcion.sourceforge.net/dictionary/coptic.html
A Linked Coptic Dictionary Online

http://acopost.sourceforge.net/
Mink: An Incremental Data-Driven Dependency Parser with Integrated Conversion to Semantics
Fine-Grain Morphological Analyzer and Part-of-Speech Tagger for Arabic Text

http://sp-tk.sourceforge.net
Multilingual Voice Creation Toolkit for the MARY TTS Platform

http://lpsolve.sourceforge.net
Parsing Argumentation Structures in Persuasive Essays
Discriminating Non-Native English with 350 Words
The Unified and Holistic Method Gamma (γ) for Inter-Annotator Agreement Measure and Alignment

https://sourceforge.net/projects/oscar3-chem
Cascaded Classifiers for Confidence-Based Chemical Named Entity Recognition

http://epydoc.sourceforge.net/
Multidisciplinary Instruction with the Natural Language Toolkit
NLTK: The Natural Language Toolkit

http://openccg.sourceforge.net
Designing an Extensible API for Integrating Language Modeling and Realization
The SAMMIE System: Multimodal In-Car Dialogue
Hypertagging: Supertagging for Surface Realization with CCG
Analysis of Mixed Natural and Symbolic Input in Mathematical Dialogs
A Logical Basis for the D Combinator and Normal Form in CCG
Inheritance and the CCG Lexicon

http://jena.sourceforge.net/
SYMBERED - a Symbol-Concept Editing Tool
Content selection from an ontology-based knowledge base for the generation of football summaries
Semantic Web based Machine Translation
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
Transforming Controlled Natural Language Biomedical Queries into Answer Set Programs
A Multiple-Domain Ontology Builder
SentiProfiler: Creating Comparable Visual Profiles of Sentimental Content in Texts
Detecting Implicit Expressions of Sentiment in Text Based on Commonsense Knowledge
Techniques for Text Planning with XSLT

http://sourceforge.net/projects/nlpfarm
Open Resources for Language Technology

http://sourceforge.net/projects/gargantua
Improved Unsupervised Sentence Alignment for Symmetrical and Asymmetrical Parallel Corpora

http://extjwnl.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://jgibblda.sourceforge.net/
Unsupervised Domain Tuning to Improve Word Sense Disambiguation
Social Metaphor Detection via Topical Analysis
Phrase-Based Evaluation for Machine Translation
Sweeping through the Topic Space: Bad luck? Roll again!
TopicTiling: A Text Segmentation Algorithm based on LDA
Encoding Distributional Semantics into Triple-Based Knowledge Ranking for Document Enrichment
How Text Segmentation Algorithms Gain from Topic Models
Identifying Comparable Corpora Using LDA
A Two-Stage Classifier for Sentiment Analysis
Social Metaphor Detection via Topical Analysis
Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation
International Journal of Computational Linguistics & Chinese Language Processing, Volume 19, Number 2, June 2014

http://cmusphinx.sourceforge.net/wiki/tutorial
Development of Speech corpora for different Speech Recognition tasks in Malayalam language

http://nekohtml.sourceforge.net/
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
BlogBuster: A Tool for Extracting Corpora from the Blogosphere

http://www.sourceforge.net
KUI: an ubiquitous tool for collective intelligence development

http://senserelate.sourceforge.net
Duluth at SemEval-2017 Task 7 : Puns Upon a Midnight Dreary, Lexical Semantics for the Weak and Weary
WordNet::SenseRelate::AllWords - A Broad Coverage Word Sense Tagger that Maximizes Semantic Relatedness
SenseRelate::TargetWord—A Generalized Framework for Word Sense Disambiguation
WordNet::Similarity - Measuring the Relatedness of Concepts
UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness

http://crftagger.sourceforge
Part-of-Speech Tagging for Historical English

http://freelcs.sourceforge.net/
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis.

https://sourceforge.net/projects/wnlt/files/user-guide.pdf
Towards a Welsh Semantic Annotation System

http://www.cwb.sourceforge.net
A methodology for the extraction of information about the usage of formulaic expressions in scientific texts
VARTRA: A Comparable Corpus for Analysis of Translation Variation
Scientific registers and disciplinary diversification: a comparable corpus approach

http://multiword.sourceforge.net/mwewn2019/
Proceedings of the Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019)

http://gedcom-parse.sourceforge
Comparison between historical population archives and decentralized databases

http://featurama.sourceforge.net/
Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis

http://crfpp.sourceforge.net
Tibetan Word Segmentation as Syllable Tagging Using Conditional Random Field
Government Domain Named Entity Recognition for South African Languages
A Cost Sensitive Part-of-Speech Tagging: Differentiating Serious Errors from Minor Errors
A Hybrid Discourse Relation Parser in CoNLL 2015
Parenthetical Classification for Information Extraction
Learning-Based Named Entity Recognition for Morphologically-Rich, Resource-Scarce Languages
IITP: A Supervised Approach for Disorder Mention Detection and Disambiguation
SemanticNet-Perception of Human Pragmatics
Supervised Models for Coreference Resolution
Hybrid Approach for Coreference Resolution
Exploring Features for Named Entity Recognition in Lithuanian Text Corpus
Kernel-based Reranking for Named-Entity Extraction
Adapting a State-of-the-art Anaphora Resolution System for Resource-poor Language
Automatic Generation of Tamil Lyrics for Melodies
BioDCA Identifier: A System for Automatic Identification of Discourse Connective and Arguments from Biomedical Text
Combining Manual Rules and Supervised Learning for Hedge Cue and Scope Detection
Supervised Metaphor Detection using Conditional Random Fields
Voted NER System using Appropriate Unlabeled Data
SentiWordNet for Indian Languages
Corpus and Method for Identifying Citations in Non-Academic Text
Feature-Rich Twitter Named Entity Recognition and Classification
A Generic Anaphora Resolution Engine for Indian Languages
Named Entity Recognition in Bengali: A Conditional Random Field Approach
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Co-reference Resolution in Tamil Text
Malayalam Clause Boundary Identifier: Annotation and Evaluation
Text Segmentation and Graph-based Method for Template Filling in Information Extraction
Differential Evolution Based Feature Selection and Classifier Ensemble for Named Entity Recognition
Multi-Engine Approach for Named Entity Recognition in Bengali
Hybrid Named Entity Recognition System for South and South East Asian Languages
Voted Approach for Part of Speech Tagging in Bengali
Development of Bengali Named Entity Tagged Corpus and its Use in NER Systems
Language Independent Named Entity Recognition in Indian Languages
Nanjing Normal University Segmenter for the Fourth SIGHAN Bakeoff
South African Language Resources: Phrase Chunking
Domain Focused Named Entity Recognizer for Tamil Using Conditional Random Fields
Joint Distant and Direct Supervision for Relation Extraction
The UMUS System for Named Entity Generation at GREC 2010
Cross Linguistic Variations in Discourse Relations among Indian Languages
Topic-Based Bengali Opinion Summarization
Automatic Detection of Comma Splices
English to Indian Languages Machine Transliteration System at NEWS 2010
A Supervised Machine Learning Approach for Event-Event Relation Identification
Scalable Bio-Molecular Event Extraction System towards Knowledge Acquisition
Generalization Methods for In-Domain and Cross-Domain Opinion Holder Extraction
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches
Tamil NER - Coping with Real Time Challenges
Extracting Formulaic and Free Text Clinical Research Articles Metadata using Conditional Random Fields
Cross-lingual tagger evaluation without test data
JU_CSE_TEMP: A First Step towards Evaluating Events, Time Expressions and Temporal Relations
Clause Boundary Identification for Malayalam Using CRF
End-to-End Relation Extraction Using Distant Supervision from External Semantic Repositories

http://multiword.sourceforge.net/PHITE
Idioms in Context: The IDIX Corpus

http://htmlparser.sourceforge.net
Learning Domain-Specific Information Extraction Patterns from the Web

http://rdrpostagger.sourceforge.net
From Word Segmentation to POS Tagging for Vietnamese

http://poliqarp.sourceforge.net
The Polish Sejm Corpus

http://sourceforge.net/projects/wikiprep/
Mining for Domain-specific Parallel Text from Wikipedia
Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages

http://thmanager.sourceforge.net/
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool

http://corpussearch.sourceforge.net/CS-
Deterministic natural language generation from meaning representations for machine translation

http://sourceforge.net/projects/modelblocks/
Evidence of syntactic working memory usage in MEG data
An Analysis of Frequency- and Memory-Based Processing Costs
Accurate Unbounded Dependency Recovery using Generalized Categorial Grammars

http://sourceforge.net/projects/jbnc/
Using Wizard-of-Oz simulations to bootstrap Reinforcement - Learning based dialog management systems

http://awstats.sourceforge.net
Query log analysis with LangLog

http://text-similarity.sourceforge.net
A Study on Linking Wikipedia Categories to Wordnet Synsets using Text Similarity

http://varro.sourceforge.net/
An Efficient, Generic Approach to Extracting Multi-Word Expressions from Dependency Trees
Varro: An Algorithm and Toolkit for Regular Structure Discovery in Treebanks

http://pocket-crf-1.sourceforge.net/
FBK-UPV-UEdin participation in the WMT14 Quality Estimation shared-task
Goodness: A Method for Measuring Machine Translation Confidence

http://sourceforge.net/projects/masayu-a/
Automatic Domain Adaptation for Word Sense Disambiguation Based on Comparison of Multiple Classifiers
Automatic Determination of a Domain Adaptation Method for Word Sense Disambiguation Using Decision Tree Learning
Negation Naive Bayes for Categorization of Product Pages on the Web

http://bionlp.sourceforge.net/Knowtator
Knowtator: A Protégé plug-in for annotated corpus construction

http://sourceforge.net/projects/spatialml/
Integrating Motion Predicate Classes with Spatial and Temporal Annotations
Capturing Motion in ISO-SpaceBank

http://jchardet.sourceforge.net/
Kitten: a tool for normalizing HTML and extracting its textual content

http://sourceforge.net/projects/conexp
Interaction between Lexical Base and Ontology with Formal Concept Analysis

http://espeak.sourceforge.net
Sentiment Analysis in Czech Social Media Using Supervised Machine Learning
From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning
Automatic Generation of English Respellings
Unravelling Names of Fictional Characters
Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)

http://multiword.sourceforge.net/cupt-format/
Pay Attention when you Pay the Bills. A Multilingual Corpus with Dependency-based and Semantic Annotation of Collocations.

http://sourceforge.net/projects/medslt
A Development Environment for Building Grammar-Based Speech-Enabled Applications

http://sigmakee.cvs.sourceforge.net/sigmakee/KBs/Wo
Chinese Core Ontology Construction from a Bilingual Term Bank

https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-id-ms
Rule-based Machine Translation between Indonesian and Malaysian

http://sourceforge.net/projects/jwordnet/
WSL: Sentence Similarity Using Semantic Distance Between Words
UBY - A Large-Scale Unified Lexical-Semantic Resource Based on LMF
Extending a wordnet framework for simplicity and scalability
Question Answering System using Multiple Information Source and Open Type Answer Merge
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora

http://sourceforge.net/projects/emu
An Acoustic Corpus Contemplating Regional Variation for Studies of European Portuguese Nasals

http://palmkit.sourceforge.net/
Automatic Labeling of Voiced Consonants for Morphological Analysis of Modern Japanese Literature
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models
A Clustered Global Phrase Reordering Model for Statistical Machine Translation

https://sourceforge.net/projects/zpar/
Combining Discrete and Continuous Features for Deterministic Transition-based Dependency Parsing

http://chebi.cvs.sourceforge.net/
LASIGE: using Conditional Random Fields and ChEBI ontology

http://meka.sourceforge.net/#about
Multiple Emotions Detection in Conversation Transcripts

https://sourceforge.net/projects/wordlist/files/speller/
Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction

http://mat-annotation.sourceforge
Finding Names in Trove: Named Entity Recognition for Australian Historical Newspapers

http://sourceforge.net/projects/tableseer
Extracting Structured Scholarly Information from the Machine Translation Literature

http://bioevent.sourceforge.net/
Double Layered Learning for Biological Event Extraction from Text

http://regain.sourceforge.net/
Building Multilingual Search Index using open source framework

http://wnsqlbuilder.sourceforge.net
Comparing the Value of Latent Semantic Analysis on two English-to-Indonesian lexical mapping tasks

http://java-ml.sourceforge.net/api/0.1.7/
A Visual Analytics System for Cluster Exploration

http://statsmodels.sourceforge.net/devel/mixed_
An Empirical Analysis of Formality in Online Communication

http://sourceforge.net/projects/jirs/
IRIT: Textual Similarity Combining Conceptual Similarity with an N-Gram Comparison Method

http://cwb.sourceforge.net/
ScienQuest: a Treebank Exploitation Tool for non NLP-Specialists
Exploring and Visualizing Variation in Language Resources
The GermaParl Corpus of Parliamentary Protocols
A Bird’s-eye View of Language Processing Projects at the Romanian Academy
Efficient Dependency Graph Matching with the IMS Open Corpus Workbench
Distributional Semantics in R with the wordspace Package
The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
The Reference Corpus of the Contemporary Romanian Language (CoRoLa)
A modernised version of the Glossa corpus search system
OpenSoNaR: user-driven development of the SoNaR corpus interfaces
A Tool/Database Interface for Multi-Level Analyses
Novel Word-sense Identification
A Multilingual Speech Resource: The Nordic Dialect Corpus
Towards a Large Parallel Corpus of Cleft Constructions
CoRoLa — The Reference Corpus of Contemporary Romanian Language
The Varitext platform and the Corpus des variétés nationales du français (CoVaNa-FR) as resources for the study of French from a pluricentric perspective
Parallel Suffix Arrays for Linguistic Pattern Search
A Case Study on Interoperability for Language Resources and Applications

http://presage.sourceforge.net
Using Language Groundings for Context-Sensitive Text Prediction

http://bedic.sourceforge.net/
A Repository of Free Lexical Resources for African Languages: The Project and the Method

http://sourceforge.net/projects/jagatoo/
ENGAGE: Automated Gestures for Animated Characters

http://sourceforge.net/projects/pdf2xml/
OCR++: A Robust Framework For Information Extraction from Scholarly Articles

http://sourceforge.net/projects/mulan/
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora

http://sox.sourceforge.net/
Toward a Web-based Speech Corpus for Algerian Dialectal Arabic Varieties
Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
KALAKA-3: a database for the recognition of spoken European languages on YouTube audios
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development
Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data

http://hunspell.sourceforge.net/
C-WEP―Rich Annotated Collection of Writing Errors by Professionals
The Karlsruhe Institute of Technology Translation Systems for the WMT 2012
Joint WMT 2013 Submission of the QUAERO Project
Joint WMT Submission of the QUAERO Project
TECHLIMED system description for the Shared Task on Automatic Arabic Error Correction
A Gold Standard for Scalar Adjectives
Applying Machine Translation Metrics to Student-Written Translations
SAHSOH@QALB-2015 Shared Task: A Rule-Based Correction Method of Common Arabic Native and Non-Native Speakers’ Errors
The Karlsruhe Institute of Technology Translation Systems for the WMT 2011
Fast Labeling and Transcription with the Speechalyzer Toolkit
The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010
TECHLIMED@QALB-Shared Task 2015: a hybrid Arabic Error Correction System
Joint WMT 2012 Submission of the QUAERO Project
From newspaper to microblogging: What does it take to find opinions?

http://aleph-ml.sourceforge.net/
A Novel Approach to Automatic Gazetteer Generation using Wikipedia
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction

http://arabic-unknowns.sourceforge.net/
Handling Unknown Words in Arabic FST Morphology
The Floating Arabic Dictionary: An Automatic Method for Updating a Lexical Database through the Detection and Lemmatization of Unknown Words

http://jtextpro.sourceforge.net/
Phrase-Based Evaluation for Machine Translation
Relation Classification using Entity Sequence Kernels

http://subworkshop.sourceforge.net/
Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles

http://amine-platform.sourceforge.net/
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool

http://sourceforge.net/projects/mecab/
Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes

http://joda-time.sourceforge.net
Temporal Relation Classification Based on Temporal Reasoning

http://shareaza.sourceforge.net/?id=source
Personal Health Information Leak Prevention in Heterogeneous Texts

http://jwebpro.sourceforge.net/data-web-snippets.tar.gz
Neural Sparse Topical Coding
Short Text Clustering via Convolutional Neural Networks
Towards Unsupervised Text Classification Leveraging Experts and Word Embeddings

http://sourceforge.jp/projects/ipadic/releases/24435/
Evaluation Dataset and System for Japanese Lexical Simplification

http://proai.sourceforge.net
Lessons Learned: On the Challenges of Migrating a Research Data Repository from a Research Institution to a University Library.

http://sourceforge.net/projects/jmir/files/
Automatic Music Mood Classification of Hindi Songs

http://junicode.sourceforge
OTTO: A Transcription and Management Tool for Historical Texts

http://apelon-dts.sourceforge.net/index.html
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool

http://jgrapht.sourceforge.net
Building timelines of soccer matches from Twitter
You’ll Never Tweet Alone: Building Sports Match Timelines from Microblog Posts

http://sourceforge.net/p/gramlab
Tackling interoperability issues within UIMA work flows

https://sourceforge.net/projects/ngram/
VCU at Semeval-2016 Task 14: Evaluating definitional-based similarity measure for semantic taxonomy enrichment

http://agtk.sourceforge.net/
Using the NITE XML Toolkit on the Switchboard Corpus to Study Syntactic Choice: a Case Study
Web-based Annotation of Anaphoric Relations and Lexical Chains
Supporting Annotation Layers for Natural Language Processing
Functional Requirements for an Interlinear Text Editor
Towards General-Purpose Annotation Tools – How Far Are We Today?

http://scopefinder.sourceforge.net/
Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes

http://sourceforge.net/projects/morfologik/files/
Pattern Matching Refinements to Dictionary-Based Code-Switching Point Detection

http://sourceforge.net/projects/mecab
Splitting Noun Compounds via Monolingual and Bilingual Paraphrasing: A Study on Japanese Katakana Words

https://sourceforge.net/projects/gargantua/
Findings of the WMT 2016 Bilingual Document Alignment Shared Task

http://umls-similarity.sourceforge.net
Duluth : Measuring Cross-Level Semantic Similarity with First and Second Order Dictionary Overlaps
UMLS::Similarity: Measuring the Relatedness and Similarity of Biomedical Concepts

http://sourceforge.net/projects/arasuite/
Pictogrammar: an AAC device based on a semantic grammar

https://mosesdecoder.svn.sourceforge.net/svnroot/
Dual-Path Phrase-Based Statistical Machine Translation

http://sourceforge.net/projects/simmetrics/
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora
UMCC_DLSI: Textual Similarity based on Lexical-Semantic features
UMCC_DLSI: Multidimensional Lexical-Semantic Textual Similarity
A hybrid approach for extracting semantic relations from texts
UMCC_DLSI_SemSim: Multilingual System for Measuring Semantic Textual Similarity

http://sourceforge.net/projects/irstlm
Postech’s System Description for Medical Text Translation Task

http://stardict.sourceforge.net/
Cross Language Dependency Parsing using a Bilingual Lexicon
Exploiting Bilingual Translation for Question Retrieval in Community-Based Question Answering
Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization
Automated Translation of Semantic Relationships
Evaluating Multilanguage-Comparability of Subjectivity Analysis Systems

http://wikipedia-miner.sourceforge.net
WikiWalk: Random walks on Wikipedia for Semantic Relatedness
HUMB: Automatic Key Term Extraction from Scientific Articles in GROBID
Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’s Edit History

http://opennlp.sourceforge
A Cognitive Cost Model of Annotations Based on Eye-Tracking Data

http://gibbslda.sourceforge.net/
Topic Models for Meaning Similarity in Context
Classifying Frames at the Sentence Level in News Articles
MSS: Investigating the Effectiveness of Domain Combinations and Topic Features for Word Sense Disambiguation
Exploring English Lexicon Knowledge for Chinese Sentiment Analysis
Biography-Dependent Collaborative Entity Archiving for Slot Filling
A Topic Similarity Model for Hierarchical Phrase-based Translation
以多重表示選擇文章分類的樣本(Using Multiple Representations to Select Instances for Text Classification)[In Chinese]
Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
Cross-Lingual Topical Relevance Models
STS-UHH at SemEval-2017 Task 1: Scoring Semantic Textual Similarity Using Supervised and Unsupervised Ensemble
Query classification using topic models and support vector machine
Unsupervised Negation Focus Identification with Word-Topic Graph Model
Predicting Restaurant Consumption Level through Social Media Footprints
Integrating Topic Modeling with Word Embeddings by Mixtures of vMFs
Unsupervised Concept Annotation using Latent Dirichlet Allocation and Segmental Methods
Query classification via Topic Models for an art image archive
A Joint Model of Conversational Discourse Latent Topics on Microblogs

http://mecab.sourceforge.net/dic.html
Language Resource Addition: Dictionary or Corpus?

http://sourceforge.net/projects/aracomlex/
An Open-Source Finite State Morphological Transducer for Modern Standard Arabic

http://weka.sourceforge.net/
Two Phase Evaluation for Selecting Machine Translation Services

http://aligner.sourceforge.net
Knowledge-lean projection of coreference chains across languages

http://multiword.sourceforge.net/
Proposal for MWE Annotation in Running Text
USzeged: Identifying Verbal Multiword Expressions with POS Tagging and Parsing Techniques
Extended and Enhanced Polish Dependency Bank in Universal Dependencies Format
Multiword Expression Identification with Tree Substitution Grammars: A Parsing tour de force with French
Things between Lexicon and Grammar
Neural Networks for Multi-Word Expression Detection
Without lexicons, multiword expression identification will never fly: A position statement
The Impact of Word Representations on Sequential Neural MWE Identification

http://nltk.sourceforge.net/index.php/Main_Page
ProPOSEL: a human-oriented prosody and PoS English lexicon for machine-learning and NLP
ProPOSEL: A Prosody and POS English Lexicon for Language Engineering

http://grok.sourceforge.net/
Combining Clues for Word Alignment

http://multiword.sourceforge.net/lawmwecxg2018/
Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018)

http://sourceforge.net/projects/sserver/
Using Inverse lambda and Generalization to Translate English to Formal Languages
A Structured SVM Semantic Parser Augmented by Semantic Tagging with Conditional Random Field
Proceedings of the Ninth International Conference on Computational Semantics (IWCS 2011)

http://flexCRF.sourceforge.net
Sentence Type Based Reordering Model for Statistical Machine Translation

http://arasubcats-lfg.sourceforge.net
Automatic Extraction and Evaluation of Arabic LFG Resources

http://ant2dot.sourceforge.net
Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

http://sourceforge.net/projects/irstlm/files/irstlm/
Joint English Spelling Error Correction and POS Tagging for Language Learners Writing

http://sourceforge.net/p/lemur/wiki/
Using Content-level Structures for Summarizing Microblog Repost Trees
Driving ROVER with Segment-based ASR Quality Estimation

http://trove4j.sourceforge.net/
Edlin: an Easy to Read Linear Learning Framework

http://sigmakee.cvs.sourceforge.net/*checkout*/sigmakee/sigma/suo-kif.pdf
A Call for Executable Linguistics Research

http://cwb.sourceforge.net/files/
KorAP Architecture ― Diving in the Deep Sea of Corpus Data

http://julius.sourceforge.jp/
The WaveSurfer Automatic Speech Recognition Plugin
Rapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems
A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models
Non-humanlike Spoken Dialogue: A Design Perspective
Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus

http://sourceforge.net/p/currennt
Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis

http://units-in-java.sourceforge.net/
Large Scale Semantic Annotation, Indexing and Search at The National Archives

https://sourceforge.net/projects/powla/
Towards a Linked Open Data Edition of Sumerian Corpora

https://lists.sourceforge.net/lists/listinfo/apertium-stuff
Apertium’s Web Toolchain for Low-Resource Language Technology

http://riso.sourceforge.net
BART: A Modular Toolkit for Coreference Resolution
BART: A modular toolkit for coreference resolution

http://retratos.sourceforge.net/
PorTAl: Recursos e Ferramentas de Tradução Automática para o Português do Brasil (PorTAl: Resources and Tools for Machine Translation of Brazilian Portuguese) [in Portuguese]

http://sourceforge.net/projects/spatialml
SpatialML: Annotation Scheme, Corpora, and Tools

http://recoder.sourceforge.net/
Parsing Formal Languages using Natural Language Parsing Techniques

http://sourceforge.net/projects/supersensetag
Learning to Rank Answers to Non-Factoid Questions from Web Collections
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths

http://cwb.sourceforge.net/files/CQP_Tutorial.pdf
Corpus Query Lingua Franca (CQLF)

https://sourceforge.net/p/lemur/wiki/RankLib
Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results
Recognizing Reference Spans and Classifying their Discourse Facets

http://chatscript.sourceforge.net/
Interpreting Questions with a Log-Linear Ranking Model in a Virtual Patient Dialogue System

http://sourceforge.net/projects/wn-similarity
Incorporation of WordNet Features to n-gram Features in a Language Modeler

http://multiparse.sourceforge.net
Towards Robust Multi-Tool Tagging. An OWL/DL-Based Approach

http://opennlp.sourceforge.net/
CONE: Metrics for Automatic Evaluation of Named Entity Co-Reference Resolution
Comparing Semantic Role Labeling with Typed Dependency Parsing in Computational Metaphor Identification
OpinionFinder: A System for Subjectivity Analysis
Extracting Information for Generating A Diabetes Report Card from Free Text in Physicians Notes
Subtree Mining for Relation Extraction from Wikipedia
Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations
Identifying Relations for Open Information Extraction
Unsupervised Learning of Narrative Schemas and their Participants
RelationListwise for Query-Focused Multi-Document Summarization
Speaker Attribution in Cabinet Protocols
Towards Topic Labeling with Phrase Entailment and Aggregation
Charting Democracy Across Parsers
Learning to Tell Tales: A Data-driven Approach to Story Generation
Text Analysis for Automatic Image Annotation
Semantic Role Labeling for News Tweets
A High-Performance Syntactic and Semantic Dependency Parser
Multi-Document Summarization by Capturing the Information Users are Interested in
Priming Effects in Combinatory Categorial Grammar
Plot Induction and Evolutionary Search for Story Generation
Domain Adaptation with Active Learning for Coreference Resolution
Learning Textual Entailment using SVMs and String Similarity Measures
SJTULTLAB: Chunk Based Method for Keyphrase Extraction
The CNGL-DCU-Prompsit Translation Systems for WMT13
Learning Alignments and Leveraging Natural Logic
Collective Semantic Role Labeling on Open News Corpus by Leveraging Redundancy
Towards Data and Goal Oriented Analysis: Tool Inter-operability and Combinatorial Comparison
String Re-writing Kernel
Search Engine Statistics Beyond the n-Gram: Application to Noun Compound Bracketing
A Comparison of Features for Automatic Readability Assessment
Using Shallow Syntax Information to Improve Word Alignment and Reordering for SMT
It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text
SRL-Based Verb Selection for ESL
UvT: The UvT Term Extraction System in the Keyphrase Extraction Task
Extending the Semantics in Natural Language Understanding
Coreference Resolution with Reconcile
Using Corpus Statistics on Entities to Improve Semi-supervised Relation Extraction from the Web
Appraising UMLS Coverage for Summarizing Medical Evidence
Collective Cross-Document Relation Extraction Without Labelled Data
A Study of Machine Learning Algorithms for Recognizing Textual Entailment
Annotation Time Stamps — Temporal Metadata from the Linguistic Annotation Process
WLV: A Confidence-based Machine Learning Method for the GREC-NEG’09 Task
Understanding Mental States in Natural Language
Generating Image Descriptions Using Dependency Relational Patterns
Bootstrapping Word Alignment via Word Packing
Enriching Phrase-Based Statistical Machine Translation with POS Information
How Feasible and Robust is the Automatic Extraction of Gene Regulation Events? A Cross-Method Evaluation under Lab and Real-Life Conditions
Self-Annotation for fine-grained geospatial relation extraction
Named Entity Recognition for South and South East Asian Languages: Taking Stock
Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation
Conundrums in Noun Phrase Coreference Resolution: Making Sense of the State-of-the-Art
Mining Coreference Relations between Formulas and Text using Wikipedia
Authorship Attribution Using Probabilistic Context-Free Grammars
GuideLink: A Corpus Annotation System that Integrates the Management of Annotation Guidelines
Summary Generation for Toponym-referenced Images using Object Type Language Models
The wraetlic NLP suite
Reducing Wrong Labels in Distant Supervision for Relation Extraction
Semi-supervised Relation Extraction with Large-scale Word Clustering
URES : an Unsupervised Web Relation Extraction System
A Survey of Text Mining Architectures and the UIMA Standard
Automatic Labelling of Topic Models
Distantly Supervised POS Tagging of Low-Resource Languages under Extreme Data Sparsity: The Case of Hittite
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction
Semi-Automatic Annotation Tool to Build Large Dependency Tree-Tagged Corpus

http://exist.sourceforge.net/
Anatomy of an XML-based Text Corpus Server
From Field Notes towards a Knowledge Base

http://proai.sourceforge.net/
FLAT: Constructing a CLARIN Compatible Home for Language Resources
A Repository for the Sustainable Management of Research Data

http://combitagger.sourceforge.net
Tagging the Past: Experiments using the Saga Corpus
Construction of a Free Large Part-of-Speech Annotated Corpus in French (Construction d’un large corpus écrit libre annoté morpho-syntaxiquement en français) [in French]

http://mummer.sourceforge.net/
A Linguistic Knowledge Discovery Tool: Very Large Ngram Database Search with Arbitrary Wildcards

http://htmltrim.sourceforge.net/tidy.html
Web-based List Question Answering

http://cwb.sourceforge.net
The GermaParl Corpus of Parliamentary Protocols
Exploring Cities in Crime: Significant Concordance and Co-occurrence in Quantitative Literary Analysis
The Database of Catalan Adjectives
The TXM Platform: Building Open-Source Textual Analysis Software Compatible with the TEI Encoding Scheme
Improving corpus search via parsing
Predicting Cognitively Salient Modifiers of the Constitutive Parts of Concepts

http://jwn.sourceforge
Efficient Hierarchical Entity Classifier Using Conditional Random Fields

http://natools.sourceforge.net/
Portuguese-English Word Alignment: some Experiments

http://senseclusters.sourceforge.net/
A Study on Linking Wikipedia Categories to Wordnet Synsets using Text Similarity
SenseClusters - Finding Clusters that Represent Word Senses
An Unsupervised Vector Approach to Biomedical Term Disambiguation: Integrating UMLS and Medline

http://MaxEnt.sourceforge.net/
A Twin-Candidate Model for Learning-Based Anaphora Resolution

http://oscar3-chem.sourceforge.net/
Semantic enrichment of journal articles using chemical named entity recognition

http://ayaspell.sourceforge.net/
Arabic Word Generation and Modelling for Spell Checking

http://pyaiml.sourceforge.net/
Discourse-Based Modeling for AAC

http://sourceforge.net/projects/hannanum/
Identification of Implicit Topics in Twitter Data Not Containing Explicit Search Queries

http://sourceforge.net/apps/mediawiki/irstlm
Syntactic SMT Using a Discriminative Text Generation Model

https://sourceforge.net/projects/textometrie
The TXM Platform: Building Open-Source Textual Analysis Software Compatible with the TEI Encoding Scheme

http://sourceforge.net/projects/dhtmlgrid/
BEYTrans: A Free Online Collaborative Wiki-Based CAT Environment Designed for Online Translation Communities

http://julius.sourceforge.jp/en
Designing Language Technology Applications: A Wizard of Oz Driven Prototyping Framework
Multi-step Natural Language Understanding
Um Sistema para Melhorar a Usabilidade de um Gerenciador de Correio Eletrônico Baseado em Reconhecimento de Fala (A System for Improving the Usability of a E-mail Management Tool with Speech Recognition) [in Portuguese]

http://cwb.sourceforge.net/files/CWB_
Efficient Dependency Graph Matching with the IMS Open Corpus Workbench

http://festival-te.sourceforge.net/wiki/Main_Page
NLP Applications of Sinhala: TTS & OCR

http://markup.sourceforge.net
ConsentCanvas: Automatic Texturing for Improved Readability in End-User License Agreements

http://julius.sourceforge.jp
Multi-Engine Machine Translation with Voted Language Model
Language Modeling for Spoken Dialogue System based on Filtering using Predicate-Argument Structures
Correction Grammars for Error Handling in a Speech Dialog System
Coherent Back-Channel Feedback Tagging of In-Car Spoken Dialogue Corpus
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish

http://lpsolve.sourceforge
Multilingual Multi-Document Summarization with POLY2
Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes
Mining the Gaps: Towards Polynomial Summarization

http://cwb.sourceforge.net/files/CQP_Tutorial/
OpenSoNaR: user-driven development of the SoNaR corpus interfaces

http://sourceforge.net/projects/mmax2/files/
A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology

http://icenlp.sourceforge.net
The Icelandic Parsed Historical Corpus (IcePaHC)
Tagging a Morphologically Complex Language Using an Averaged Perceptron Tagger: The Case of Icelandic

http://www.sourceforge.net/projects/
Parsing Chinese with a Generalized Categorial Grammar

http://uplug.sourceforge.net/
Automatic Construction of Domain-specific Dictionaries on Sparse Parallel Corpora in the Nordic languages
Using Uplug and SiteSeeker to construct a cross language search engine for Scandinavian languages
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction

http://sourceforge.net/projects/dctfinder/
Extracting News Web Page Creation Time with DCTFinder

http://ngram.sourceforge.net/
Distributional Semantics in R with the wordspace Package
Using Language Resources in Humanities research

https://sourceforge.net/projects/mecab/files/
Sentence Complexity Estimation for Chinese-speaking Learners of Japanese

http://taga.sourceforge.net/owl/index.html
A Semantic Based Question Answering System for Thailand Tourism Information

http://htmlunit.sourceforge.net
Building a Dataset of Multilingual Cognates for the Romanian Lexicon
Automatic CEFR Level Prediction for Estonian Learner Text

http://sourceforge.net/projects/hrcf/
Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling
Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts

http://sourceforge.net/p/bibutils/home/
Integrating User-Generated Content in the ACL Anthology

http://sourceforge.net/projects/hcrf/
Using Tone Information in Thai Spelling Speech Recognition

http://freemind.sourceforge.net
Novelle, a collaborative open source writing tool software

http://sourceforge.net/projects/trans/
Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian

http://tokyocabinet.sourceforge.net/
CCG parsing with one syntactic structure per n-gram

http://sourceforge.jp/projects/mpaligner/
An Ensemble Model of Word-based and Character-based Models for Japanese and Chinese Input Method
Applying mpaligner to Machine Transliteration with Japanese-Specific Heuristics

http://audacity.sourceforge.net/
Compiling a Corpus of Taiwanese Students’ Spoken English
Aix Map Task corpus: The French multimodal corpus of task-oriented dialogue
Likelihood Ratio-based Forensic Voice Comparison on L2 speakers: A Case of Hong Kong native male production of English vowels
Automated Vocabulary Acquisition and Interpretation in Multimodal Conversational Systems
Spontal-N: A Corpus of Interactional Spoken Norwegian

http://chebi.cvs.sourceforge.net/viewvc/chebi/chapati/
Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts

http://sourceforge.net/projects/maxent/files/
A Shallow Discourse Parsing System Based On Maximum Entropy Model

http://ohnlp.sourceforge.net/MedKATp/
A Framework to Generate Sets of Terms from Large Scale Medical Vocabularies for Natural Language Processing

http://sourceforge.net/projects/hunspell/files/Hunspell/Documentation/
TECHLIMED system description for the Shared Task on Automatic Arabic Error Correction

http://sourceforge.net/projects/edits/files/
A Search Task Dataset for German Textual Entailment

http://jung.sourceforge.net/
Investigating the Applicability of current Machine-Learning based Subjectivity Detection Algorithms on German Texts
XLike Project Language Analysis Services
Statistical Thesaurus Construction for a Morphologically Rich Language
SentiProfiler: Creating Comparable Visual Profiles of Sentimental Content in Texts
Clairlib: A Toolkit for Natural Language Processing, Information Retrieval, and Network Analysis

http://jpf.sourceforge.net
SYMBERED - a Symbol-Concept Editing Tool

https://sourceforge.net/p/lemur/wiki/
Demonstrating Par4Sem - A Semantic Writing Aid with Adaptive Paraphrasing
News Citation Recommendation with Implicit and Explicit Semantics
Integrating Meaning into Quality Evaluation of Machine Translation
Reranking Translation Candidates Produced by Several Bilingual Word Similarity Sources

https://sourceforge.net/projects/maxent/
Verb Phrase Ellipsis detection using Automatically Parsed Text

http://open-jtalk.sourceforge.net/
Information Navigation System with Discovering User Interests

http://rdrpostagger.sourceforge.net/
RDRPOSTagger: A Ripple Down Rules-based Part-Of-Speech Tagger

http://stardict.sourceforge.net/Dictionaries.php
Cross Language Text Classification by Model Translation and Semi-Supervised Learning

http://sourceforge.net/projects/pdfxtk/
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus

http://jswarm-pso.sourceforge.net/
Optimizing Textual Entailment Recognition Using Particle Swarm Optimization
Automatic Cost Estimation for Tree Edit Distance Using Particle Swarm Optimization

http://jgrapht.sourceforge.net/
MARS: A Specialized RTE System for Parser Evaluation
On the Feasibility of Automatically Describing n-dimensional Objects

http://www.sourceforge.net/projects/texttoonto
LEILA: Learning to Extract Information by Linguistic Analysis

http://jldadmm.sourceforge.net/
Improving Topic Coherence with Latent Feature Word Representations in MAP Estimation for Topic Modeling

http://balie.sourceforge.net
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization

https://sourceforge.net/projects/vhmsg/
New Dimensions in Testimony Demonstration

http://openccg.sourceforge.net/
Multimodal Generation in the COMIC Dialogue System
Using CCG categories to improve Hindi dependency parsing
Parsing with Dynamic Continuized CCG
Lexicalization and Generative Power in CCG
Back to the Blocks World: Learning New Actions through Situated Human-Robot Dialogue
TuLiPA: Towards a Multi-Formalism Parsing Environment for Grammar Engineering
A Simple Method for Clarifying Sentences with Coordination Ambiguities

http://java-ml.sourceforge.net/
Subgroup Detector: A System for Detecting Subgroups in Online Discussions

http://minion.sourceforge.net
A Distributed Platform for Sanskrit Processing

http://lisa.sourceforge.net/
Sign Language Generation with Expert Systems and CCG

http://sourceforge.net/projects/sgmweka/
Cumulative Progress in Language Models for Information Retrieval

http://sourceforge.net/projects/ujmp/files/
Recursive Neural Networks for Coding Therapist and Patient Behavior in Motivational Interviewing

http://utool.sourceforge.net
Efficient Solving and Exploration of Scope Ambiguities

https://sourceforge.net/p/halef
A distributed cloud-based dialog system for conversational application development

http://mwetoolkit.sourceforge.net/PHITE.php?sitesig=MWE
A Comparative Study of Different Classification Methods for the Identification of Brazilian Portuguese Multiword Expressions
Identifying Portuguese Multiword Expressions using Different Classification Algorithms - A Comparative Analysis

http://aspell.sourceforge.net/wl/
The Topology of Synonymy and Homonymy Networks

http://minorthird.sourceforge.net/
Identifying Types of Claims in Online Customer Reviews
Understanding Temporal Expressions in Emails

http://maxent.sourceforge.net/index
Extracting Domain-Specific Words - A Statistical Approach

http://sourceforge.net/projects/lpsolve/
Coreference Clustering using Column Generation
Recognizing Causality in Verb-Noun Pairs via Noun and Verb Semantics
Structural Opinion Mining for Graph-based Sentiment Representation
Predicting Stance in Ideological Debate with Rich Linguistic Knowledge
Product Review Summarization by Exploiting Phrase Properties

http://downloads.sourceforge.net/
Unsupervised training of maximum-entropy models for lexical selection in rule-based machine translation

https://sourceforge.net/projects/tashkeela
Shamela: A Large-Scale Historical Arabic Corpus

http://sourceforge.net/projects/py2ipa
Unsupervised phonemic Chinese word segmentation using Adaptor Grammars

http://sourceforge.net/p/apertium/
New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
Predicting Inflectional Paradigms and Lemmata of Unknown Words for Semi-automatic Expansion of Morphological Lexicons

http://sourceforge.net/projects/leonlp/
Transparent combination of rule-based and data-driven approaches in speech understanding
An Open-Source Environment for Compiling Typed Unification Grammars into Speech Recognisers

http://jwordnet.sourceforge.net
Solving Relational Similarity Problems Using the Web as a Corpus

http://chasen-legacy.sourceforge.jp/
Using Collocations and K-means Clustering to Improve the N-pos Model for Japanese IME
Learning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues
Bilingual Lexicon Extraction from Comparable Corpora Using Label Propagation
An Effective Compositional Model for Lexical Alignment
A Named Entity Recognition Method based on Decomposition and Concatenation of Word Chunks
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation
TEXT2TABLE: Medical Text Summarization System Based on Named Entity Recognition and Modality Identification
A Cross-Lingual ILP Solution to Zero Anaphora Resolution
Recurrent Neural Networks for Word Alignment Model
Imposing Constraints from the Source Tree on ITG Constraints for SMT
SemEval-2010 Task: Japanese WSD
Evaluating Multilanguage-Comparability of Subjectivity Analysis Systems
A Boosted Semi-Markov Perceptron

http://ngram.sourceforge.net
The Duluth lexical sample systems in Senseval-3
Complementarity of lexical and simple syntactic features: The SyntaLex approach to Senseval-3
Duluth: Word Sense Discrimination in the Service of Lexicography
Combining Lexical and Syntactic Features for Supervised Word Sense Disambiguation
jMWE: A Java Toolkit for Detecting Multi-Word Expressions
Identifying Collocations to Measure Compositionality: Shared Task System Description

http://prefixspan-rel.sourceforge.jp/
Detection of Users Suspected of Pretending to Be Other Users in a Community Site by Using Messages Submitted to Non-Target Categories
Detection of submitters suspected of pretending to be someone else in a community site

http://java-ml.sourceforge.net
Watset: Automatic Induction of Synsets from a Graph of Synonyms

http://champollion.sourceforge.net/
Generating Chinese Named Entity Data from a Parallel Corpus
Findings of the WMT 2018 Biomedical Translation Shared Task: Evaluation on Medline test sets
Towards Bilingual Term Extraction in Comparable Patents
Findings of the WMT 2019 Biomedical Translation Shared Task: Evaluation for MEDLINE Abstracts and Biomedical Terminologies

http://sourceforge.net/projects/cabocha/
Automatic Domain Adaptation for Word Sense Disambiguation Based on Comparison of Multiple Classifiers
LAF/GrAF-grounded Representation of Dependency Structures
Automatic Determination of a Domain Adaptation Method for Word Sense Disambiguation Using Decision Tree Learning

http://exist.sourceforge.net
A Multilingual Database of Polarity Items
Logic Based Methods for Terminological Assessment

https://sourceforge.net/projects/weka/
Combining Off-the-shelf Grammar and Spelling Tools for the Automatic Evaluation of Scientific Writing (AESW) Shared Task 2016

http://anthy.sourceforge.jp/
Error Correcting Romaji-kana Conversion for Japanese Language Education
Multilingual Text Entry using Automatic Language Detection

http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/omseek/om/languages/
Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

http://textcat.sourceforge.net/
Web Service Integration for Next Generation Localisation

http://sourceforge.net/projects/ar-text-
POS Tagging for Arabic Tweets

http://jlibrary.sourceforge.net/
Integration of a Multilingual Keyword Extractor in a Document Management System

http://webascorpus.sourceforge.net/PHITE.php?sitesig
BlogBuster: A Tool for Extracting Corpora from the Blogosphere

http://hts-engine.sourceforge.net/
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform

http://balie.sourceforge.net/
On Classifying Coherent/Incoherent Romanian Short Texts
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization

http://sourceforge.net/projects/djangology/
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation

http://sourceforge.net/projects/pyaline/
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling

http://mrbayes.sourceforge.net/
Computational analysis of Gondi dialects

http://wntrans.sourceforge.net/
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation

http://dylan.sourceforge
Comparing Attribute Classifiers for Interactive Language Grounding

http://sourceforge.net/projects/secondstring/
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora

http://wordfreak.sourceforge.net/
Annotating Arguments: The NOMAD Collaborative Annotation Tool
The SYNC3 Collaborative Annotation Tool
BOEMIE Ontology-Based Text Annotation Tool
Semi-Automatic Annotation Tool to Build Large Dependency Tree-Tagged Corpus

http://joshua.sourceforge.net/Joshua/
Forest Reranking through Subtree Ranking

http://runes.sourceforge.net/
A Novel Approach to Automatic Gazetteer Generation using Wikipedia
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction

http://sourceforge.net/projects/jpl/
Parsing Akkadian Verbs with Prolog

http://jwordnet.sourceforge.net/han
Building a WordNet for Sinhala

http://mwetoolkit.sourceforge.net
UFRGS&LIF at SemEval-2016 Task 10: Rule-Based MWE Identification and Predominant-Supersense Tagging
A Broad Evaluation of Techniques for Automatic Acquisition of Multiword Expressions
Identification of Ambiguous Multiword Expressions Using Sequence Models and Lexical Resources
Identification of Multiword Expressions for Latvian and Lithuanian: Hybrid Approach
jMWE: A Java Toolkit for Detecting Multi-Word Expressions

http://crfchunker.sourceforge.net/
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora
EVBCorpus - A Multi-Layer English-Vietnamese Bilingual Corpus for Studying Tasks in Comparative Linguistics
Resolving Object and Attribute Coreference in Opinion Mining
A Hybrid Word Alignment Model for Phrase-Based Statistical Machine Translation
Word Alignment-Based Reordering of Source Chunks in PB-SMT
Automatic Evaluation of Summary Using Textual Entailment
Handling Named Entities and Compound Verbs in Phrase-Based Statistical Machine Translation
Bootstrapping Method for Chunk Alignment in Phrase Based SMT
Event and Event Actor Alignment in Phrase Based Statistical Machine Translation
Contextual Phrase-Level Polarity Analysis Using Lexical Affect Scoring and Syntactic N-Grams

http://mecab.sourceforge.jp/
A Fast and Accurate Method for Detecting English-Japanese Parallel Texts
A Casual Conversation System Using Modality and Word Associations Retrieved from the Web
A Simple Approach to Unknown Word Processing in Japanese Morphological Analysis
A Compositional Approach toward Dynamic Phrasal Thesaurus
Japanese Named Entity Recognition Using Structural Natural Language Processing

http://sourceforge.net/p/zpar/code/
Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora

http://francais.sourceforge.jp/
JMdict: a Japanese-Multilingual Dictionary

http://flexcrfs.sourceforge.net/
Review Classification Using Semantic Features and Run-Time Weighting
Identifying Sections in Scientific Abstracts using Conditional Random Fields

http://bionlp.sourceforge.net
Refactoring Corpora

http://sourceforge.net/projects/desr/
A Dependency Based Statistical Translation Model

http://multiword.sourceforge.net/sharedtask2018/
VarIDE at PARSEME Shared Task 2018: Are Variants Really as Alike as Two Peas in a Pod?

http://www.sourceforge.net/
Chinese Named Entity Recognition and Word Segmentation Based on Character

http://part.sourceforge.net/
Interoperability and Knowledge Representation in Distributed Health and Fitness Companion Dialogue System

http://sourceforge.net/projects/citeseerx/
Extraction of Author’s Definitions Using Indexed Reference Identification

http://swirl-parser.sourceforge.net/
Question Prediction Language Model

http://sigmakee.sourceforge.net
Representing Story Plans in SUMO

http://jpivot.sourceforge.net
Query log analysis with LangLog

http://jaxe.sourceforge.net/
An Integrated Digital Tool for Accessing Language Resources

https://cmusphinx.svn.sourceforge.net/
Translating English Names to Arabic Using Phonotactic Rules

https://sourceforge.net/projects/broken-
An OpenNMT Model to Arabic Broken Plurals