Text analytics

Text analytics is a search engine for IUCLID data and attachments. It is installed separately from, but connected to, an instance of IUCLID. It allows rapid and sophisticated searching of all IUCLID fields including the text content of attachments. Searches can be carried out on both structured data such as picklists and dates, and on unstructured data such as free text fields and attachments.

Text analytics indexes all the information from IUCLID dossiers using elastic search, which provides for high levels of performance. Text analytics also carries out optical character recognition (OCR) on scans within attachments.

Hardware requirements for a large IUCLID database are presented below; assuming that the Wildfly Server and Elastic Search Server are installed on the same host.

  • CPU: 6
  • RAM: 16 GB (4 GB Wildfly Server, 4 GB Elastic Search Server,  8 GB OS and Tesseract)
  • HDD: 50 GB

The IUCLID 6 Server and its database can also be hosted on the same server. The hardware requirements vary according to the amount of data managed by IUCLID.

Documentation

Text analytics (19th August 2020, version 3.4.0 / compatible with IUCLID 6.4.14)

The installation package contains all the software components needed to deploy Text Analytics on a Windows Server running an Oracle-based installation of IUCLID Server 6.4.14 with Instance-Based Security (IBS) activated. Oracle is not supplied. Deployment instructions are available on this page, above.

Support is provided for only the current version of Text analytics, and IUCLID 6.4.14 working together. For full compatibility information, see the installation manual. There is no updater tool for Text analytics. To use the current version, make a fresh installation.

Please sign in to download files

Categories Display