Text analytics

Text analytics is a search engine for IUCLID data and attachments. It is installed separately from, but connected to, an instance of IUCLID. It allows rapid and sophisticated searching of all IUCLID fields including the text content of attachments. Searches can be carried out on both structured data such as picklists and dates, and on unstructured data such as free text fields and attachments.

Text analytics indexes all the information from IUCLID dossiers using elastic search, which provides for high levels of performance. Text analytics also carries out optical character recognition (OCR) on scans within attachments.

Hardware requirements for a large IUCLID database are presented below; assuming that the Wildfly Server and Elastic Search Server are installed on the same host.

  • CPU: 6
  • RAM: 16GB (4GB Wildfly Server, 4GB Elastic Search Server,  8GB OS and Tesseract)
  • HDD: 50GB

The IUCLID 6 Server and its database can also be hosted on the same server. The hardware requirements vary according to the amount of data managed by IUCLID.

Text analytics (18th December 2019, version 3.2.1 / compatible with IUCLID 6.4)

The installation package contains all the software components needed to deploy Text Analytics on a Windows Server running an Oracle-based installation of IUCLID Server 6.4. Oracle is not supplied. Deployment instructions are available on this page, above.

Please sign in to download files

Categories Display