A Deep Learning Based Morph Analyzer

This repository holds the code for a Neural Network based Morph Analyzer built for Hindi Language. The analyzer accepts a UTF-8 encoded sentence in Devnagri Script and outputs a list of words complete with its lemma and a set of 6 Morphological features namely POS, Gender, Person, Case, Number, TAM Marker.

The analyzer employs a CNN-RNN model with multi-task learning to jointly learn all the six morphological tags and the lemma for each word.

Installation

Create a python3 virtual environment with keras, scikit-learn, pandas, numpy and matplotlib installed.

Clone the repository and you are ready to go.

Downloading Datasets

The Hindi-Urdu Dependency Treebanks can be download from this webpage hosted by IIIT-Hyderabad.

The downloaded datasets should can then be extracted in the datasets directory.

Usage

The code for generating predictions rests in the make_prection.py file. It takes input from input.txt and prints the output to output.txt. Should you need to predict more than one sentence, just separate the sentences by newline.

python make_prediction.py

Creating Encoders

The file make_encoders.py hosts the code to generate Label Encoders for each of the Morphological Tags. The encoders are pickled and saved to the file tag_encoders.pickle

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
X_idx2word		X_idx2word
X_word2idx		X_word2idx
accuracy.txt		accuracy.txt
conll.py		conll.py
enc		enc
feature-encoders.pickle		feature-encoders.pickle
frozen_training_weights.hdf5		frozen_training_weights.hdf5
generate_input_file.py		generate_input_file.py
input-train.pickle		input-train.pickle
input.txt		input.txt
make_encoders.py		make_encoders.py
make_prediction.py		make_prediction.py
n		n
output.txt		output.txt
outs.txt		outs.txt
phonetic_feature_encoders		phonetic_feature_encoders
predict_with_features.py		predict_with_features.py
tester.py		tester.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Deep Learning Based Morph Analyzer

Installation

Downloading Datasets

Usage

Creating Encoders

About

Releases

Packages

Languages

Eerie16/deep-learning-morph-analyzer

Folders and files

Latest commit

History

Repository files navigation

A Deep Learning Based Morph Analyzer

Installation

Downloading Datasets

Usage

Creating Encoders

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages