FOR IMMEDIATE RELEASE
February 24, 1999


Starting of the Joint Research Project on
"Cross-Language Information Retrieval" at AMF
- For Overcoming the Language Barrier in the Use of Internet -


Since the Asian Multimedia Forum (AMF(*1)) was inaugurated by major infocommunications providers in the Asian region in June 1997, Nippon Telegraph and Telephone Corporation (NTT) has played an active role as a founding member in pursuing the Forum's objectives of contributing to multimedia services and technologies in the region. This has included carrying out various multimedia utilization trials by structuring international test beds for ATM, satellite and the Internet.

As a part of the trials, three parties, NTT, Korea Advanced Institute of Science and Technology (KAIST(*2): Korea) and Kent Ridge Digital Labs( KRDL(*3): Singapore) have started a joint research project on cross-language information retrieval over the Internet.

The aim of this project is to develop a system which allows the user to input a search command in one language, and retrieve data from a number of different language WWW sites. This is made possible through the combination of natural language processing functions (such as translation) and search engines (*4) and/or sites developed by project members for various East Asian languages such as Japanese, Chinese, and Korean.

The entire system, referred to as the Cross-Language Information Retrieval Architecture, consists of multiple "search sites" and "meta-search engines" that are located on the Internet. The search site has a function for retrieving WWW pages in the same language as the search phrase or in a small number of other languages. The meta-search engine selects search sites suitable for the user's request, and translates the user's search phrase.

For instance, in retrieving WWW information in Japanese, Chinese and Korean in response to a Japanese search phrase, the meta-search engine first selects the search sites in these languages, and then translates the query into these languages. It finally submits the translated query to the search sites (See Attachment), and compiles search results from these search sites.

The unique feature of the system is that it employs a newly developed common protocol called the "Cross-Language Information Retrieval Protocol" for the exchange of information between meta-search engine and the search site. The new protocol enables flexible combination of search sites and meta-search engines, and makes possible expansion into new languages or fields.

Users will be able to retrieve multimedia information, such as photographs and music captioned in foreign languages.

A draft of the protocol has been formulated, and NTT, KAIST and KRDL are currently constructing meta-search engines and search sites that enable cross-language retrieval between Japanese/Korean/Chinese and English. NTT is constructing its sites based on the Internet cross-language information retrieval system (TITAN(*5)) that was developed by the Cyber Space Laboratories.

In March this year, interconnection tests will start among the parties, and trial service on the Internet is likely to be launched by June. In addition, the protocol specifications are planned to be made public on the Internet for wider distribution and promotion.

Also, the AMF will hold its sixth plenary meeting on February 25 and 26 at Cheju-Do, Republic of Korea. A demonstration of part of the aforementioned system will take place at the meeting.



Reference

*1 AMF (The Asian Multimedia Forum)
An open forum for private companies and organizations established in June 1997 by NTT and 16 telecommunication carriers, etc., with the objective of jointly developing, facilitating usage of, and structuring platforms for multimedia applications and services in the Asia-Pacific region. As of January 1999, the number of member companies and organizations exceeds 50.
The AMF website is at:
http://www.asiamf.org

*2 KAIST (Korea Advanced Institute Science and Technology)
Research and education organization under the Korean Ministry of Science and Technology
URL:
http://www.kaist.ac.kr/

*3 KRDL (Kent Ridge Digital Labs)
Research and education organization under the National Science and Technology Board, Singapore
URL:
http://www.krdl.org.sg/

*4 Search engine
Service or software that inclusively collects information resources (such as WWW pages) in advance and searches for information related to a given phrase from the collected information.

*5 TITAN
System that allows a search request made in one language to retrieve Internet documents written in another language.
The URL of the trial service is
http://titan.mcnet.ne.jp/.



Attachment
-Diagram of Cross-Language Information Retrieval Architecture



For further information, please contact:

Norihiko Ohkubo or Megumi Inaji
Public Relations Office
NTT Long Distance and Global Provisional Headquarters
Telephone: (03) 3500-8020




News Release Mark
NTT NEWS RELEASE