Tagging Tool is a language resource development software
developed by GIST, C-DAC Pune.
More on Dictionary Tagging Tool. . .
As the name suggests Imla Shanaas
is a spell-checker for modern Urdu used both in India
and Pakistan. The Spell-checker has features which incorporate
the latest in both technology as well as in language.
More on Imla
Shanaas. . .
Thesaurus And Dictionary Building Tools
In the areas of NLP, thesauri and dictionaries contribute as major databases for various activities. They are rich source of words and synonyms, which is highly required for tools and applications running on corpus. They are also the backbone of NLP related work like machine translation, search engines and also for developing as well as evaluating spell checkers.
The need for high-end Indian language databases in official languages constantly makes itself felt and C-DAC , Gist has taken up the challenge to provide unique and simple solution for multilingual country like India by proposing tool that facilitate the generation of thesauri and dictionaries.
This tool is designed for building a large database of synonyms for respective headwords. The Gist-Synonym Builder Tool is a good way to digitalize and store synonym data. The Encoding for the stored data is UNICODE. Rarely used synonyms can be added for head word. Also Grammatical information can be preserved for head word.
Thesaurus Generation Tool
More on Thesaurus Generation Tool. . .
The Thesaurus generation Tool is a good way to help digitalize and store. Thesaurus data in XML file format. Various traditional Thesauri of different languages are studied thoroughly to design it.
CLDR (Common Locale Data Repository)
CLDR is the largest and most extensive standard repository of locale data. This data is used for software internationalization and localization i.e. adapting software to the conventions of different languages.
More on CLDR. . .