Coltec is reported to have signed a contract with Silicon Valley Soft in Jordan to digitize over 800,000 documents. Silicon Valley is a leading software company based in Jordan, pioneering a highly ambitious project to create the first comprehensive digital encyclopedia of Arab laws. The encyclopedia will include laws from almost every country in the Middle East & North Africa and will date back over 60 years.
Coltec has begun the digitization process, which is estimated to amount to over 160 million Arabic words in phase 1 alone. This project marks the debut of Coltec's diversification into this industry which it believes represents a relatively untapped market full of potential.
Fadl Al Tarzi, Strategy Consultant to Coltec USA, explains:
"Digitization is in high demand across our region and will continue to be a significant field as organizations and governments continue to realize the value of Information Technology. As these entities realize the importance of actually converting their data to actionable and useful information they will quickly realize that the obvious first step is to digitize it."
Coltec is a pioneer and leading provider of technology for Arabic language information processing with a focus on Information Analysis technologies include search and text mining applications. Coltec's Arabic language tools and services are not only used by leading corporations such as Microsoft but by various Intelligence communities around the world. It is also worth noting that COLTEC has been the only company since 1993 to develop Arabic proofing tools tested and in-use by millions all around the world.
Ambiguity Resolver:
This project will include the following sub-projects:
a. Corpus (collected from and classified into different domains).
b. Linguistic Model.
c. Parser.
d. Domain knowledge (extracted from Corpus) for each domain.
e. Statistical Model.
Browse
related articles
Posted by Siba Sami Ammari
