Home UNL Development Set What is UDS?

A new UNL Explorer has been developed and is provided at www.undl.org/unlexp/ and www.unl.undl.org/unlexp/ by the UNL Center.



The UNL Explorer is a UNL-based Multilingual Intelligent Information and Knowledge Management System.



The UNL Explorer provides users with an integrated environment that users can search for and edit knowledge and information based on UNL.



Using the UNL Explorer, search for information of UNL Encyclopedia can be made by content, keyword or keyconcept search, or though navigation of the UW System of the UNL Ontology



The UNL Explorer has the following characteristics:


Semantic Co-occurrence Relation Search

whether a relation between two UWs is true or not, what UWs can have a relation with another UW, or what relations are possible between two UWs can be verified based on the UNL Ontology.




UNL-based Semantic Network Search

Content search is carried out based on UNL Expressions.




Intensional Definition-based Inference

every UW is given an intentional definition to specify its all the essential properties. the definition is used in inference about all concepts it related.




Multilingual Information Processability

the UNL Explorer allows users to search for information described in whatever language using their native languages, or to provide or edit information using their native languages.




UNL Graphical Editability

A graphical editor of UNL Expressions works together with the UNL Explorer.

Page Access Counter

     About UNL : 20440
     UNL Programme : 7598
     UNL System : 25111
     UNL Applications : 7222
     UNL Materials : 17884
     UNL & Related Activities : 5416
     UNL Society : 8061
     UDS : 11118
     About UNDLF : 17413
     UW Gate : 3478
     UNL Platform : 3679
     UNL Explorer : 2155

What is UDS?

The UDS (UNL Development Set) is a set of tools of the UNL System for developers to use to develop conversion modules between languages and UNL. It contains the DeConverter, the EnConveter, the Word Dictionary Builder, and specifications or manuals of the tools. More information on these tools is explained at the UNL System.

Who can use the UDS?


For using the UDS, it is necessary to sign the following agreements:

"AGREEMENT TO ENTER THE UNL SOCIETY"
"UNL DEVELOPMENT SET LICENSE OF AGREEMENT "

For more information about the UNL Society see here.

How to start on developing a deconversion module?


To develop a language deconversion module using the DeConverter provided by UNL Center needs to develop a word dictionary and deconversion rules of the language. The word dictionary provides correspondent words of the language of UWs that appear in UNL Expressions of input of the DeConverter, and grammatical attributes (features) of the words. Deconversion rules of the language describe operations of processes to deconvert UNL Expressions to sentences of the language. Detailed information on the DeConverter, deconversion rules and word dictionary is given in the specifications of the DeConverter and the manual of the Word Dictionary Builder. All tools, specifications and manuals of the UDS can be downloaded at to downlaod.

In the following explanation, "d.txt" is a list of examples of entry of English word dictionary. "elgexam.txt" is a set of examples of English deconversion rules. Using these word entries and rules, the example of UNL Expression “example.unl” can be deconverted into an English sentence.

To start on developing a deconversion module can simply follow the steps.

STEP 1

To prepare dictionary data

To prepare text data of word dictionary entries for words of target language that can express the meanings of UWs included in UNL Expressions. And if necessary, also to prepare dictionary entries for function words such as an affix, an ending, etc. Description format of text data of word dictionary entry is given in the manual of the Word Dictionary Builder.

STEP 2

To convert dictionary data


To convert text data of word dictionary entries into IBAM formatted files.
DicBldL.exe” is used to convert one-byte code language word dictionary data.
DicBldC.exe” is used to convert two-byte code language word dictionary data.
Usage of the Dictionary Builder tools is shown in the manual.
"d.dic" and "d.pix" are examples of the IBAM formatted files made from “d.txt” using "DicBldL.exe".

STEP 3

To write deconversion rules

Information on how to write deconversion rules is given in the specifications of the DeConveter. "elgexam.txt" is an example of English deconversion rules, using which "example.unl" can be deconverted into an English sentence.

STEP 4

To deconvert

The “DeCoL” version is used to deconvert UNL into one-byte code languages.
The “DeCoC” version is used to deconvert UNL into two-byte code languages.
"example_decoe.txt" shows the results (with trace) of deconversion from "example.unl".
Usage of the DeConverter is shown in the specifications.

STEP 5

To check the result


The DeConverter can output detailed traces of deconversion processes. If a result is not correct, the causes can be detected by checking the trace. What information is included in the traces is explained in the specifications.

How to start on developing an encoversion module?


To develop a language enconversion module using the EnConverter provided by UNL Center needs to develop a word dictionary and enconversion rules of the language. The word dictionary provides correspondent UWs of words included in input sentences of the language, and grammatical attributes (features) of the words. Enconversion rules of the language describe operations of processes to enconvert sentences of the language into UNL Expressions. Detailed information on the EnConverter, enconversion rules and word dictionary is given in the specifications of the EnConverter and the manual of the Word Dictionary Builder. All tools, specifications and manuals of the UDS can be downloaded at to downlaod.

To start on developing a deconversion module can simply follow the steps.

STEP 1

To prepare dictionary data


To prepare text data of word dictionary entries for all words included in input sentences. Correspondent UWs must be given to meaningful (independent) words. The EnConverter uses the UWs to create UNL Expressions.
Description format of text data of word dictionary entries is given in the manual of the Word Dictionary Builder.
"eng.txt" is an example of English input sentences.
"d.txt" is an example of English word dictionary, which includes the entries of all words included in “eng.txt”.

STEP 2

To convert dictionary data

To convert text data of word dictionary entries into IBAM formatted files.
DicBldL.exe” is used to convert one-byte code language word dictionary data.
DicBldC.exe” is used to convert two-byte code language word dictionary data.
Usage of the Dictionary Builder tools is shown in the manual.
"d.dic" and "d.pix" are examples of the IBAM formatted files made from “d.txt” using "DicBldL.exe".

STEP 3

To write enconversion rules


Information on how to write enconversion rules is provided in the specifications of the EnConveter. "elaexam.txt" is an example of English enconversion rules, using which "eng.txt" can be enconverted into UNL.

STEP 4

To enconvert


The “EnCoL” version is used to enconvert sentences of one-byte code languages.
The “EnCoC” version is used to enconvert sentences of two-byte code languages.
"eng.unl" is the results (including trace and UNL Expression) of enconversion from "eng.txt".
Usage of the DeConverter is shown in the specifications.

STEP 5

To check the results, if not correct to revise dictionary entries or rules
The EnConverter can output detailed traces of enconversion processes. Problems can be detected by checking the traces. What information is included in the traces is explained in the specifications.

To download


There are two versions of the DeConverter, EnConverter and Dictionary Builders, C - Version and L - Version. The C - Versions are developed for dealing with two-byte code languages of Chinese (GB code), Korean (KIS code), Thai language, and so on. The L - Versions are developed for dealing with ASCII codes, any one-byte code languages such as Arabic, Latin languages, and Hindi.

DeConverter
Version 2006 C DOWNLOAD
Version 2006 L DOWNLOAD
Specifications DOWNLOAD
EnConverter
Version 3.3 C DOWNLOAD
Version 3.3 L DOWNLOAD
Specifications DOWNLOAD
Word Dictionary Builder
DicBldC DOWNLOAD
DicBldL DOWNLOAD
Manual DOWNLOAD