Platform Launch: Nopaque
Subproject INF (Data Infrastructure and Digital Humanities) of the SFB 1288 has published the web application nopaque. With this platform, researchers can convert their files into formats that can be further processed, automatically enrich them with information and then analyze them. Nopaque thus maps a large part of the research processes in the humanities.
All work steps are coordinated in such a way that individual services can be used in conjunction with each other. These include tools such as Optical Character Recognition (OCR), Natural Language Processing (NLP) and a Keyword In Context Search (KWIC). This toolbox can be used by researchers in the humanities from all disciplines and with different levels of knowledge. The data generated during the processes can be downloaded after each step in order to evaluate or further process them with other (external) tools.
Nopaque is developed by a small, interdisciplinary team and operated on servers of Bielefeld University. To implement the services, established open source software is used, such as Tesseract OCR and the NLP toolkit spaCy.
Since the web platform is still in the development phase, the team is happy to receive feedback on the further development of the platform.