Digital Maktaba
AI-Driven Digital Library for Non-Latin Scripts
Digital Maktaba is a web platform for the acquisition, digitisation, cataloguing, and access of documentary heritage in non-Latin scripts, with a focus on Arabic, Persian, and Azerbaijani.
- The system combines Optical Character Recognition (OCR), Natural Language Processing (NLP), and ad hoc technologies developed to support librarians in managing large-scale collections that traditional tools cannot process.
- Developed within the PNRR-funded ITSERR project (Italian Strengthening of the ESFRI RI RESILIENCE), Digital Maktaba follows the “AI in the loop, human in charge” paradigm: the system proposes automatic cataloguing suggestions, while domain experts retain full control over validation and correction.
- The platform is open source under the GNU General Public License v3.0 and is deployed as a containerised stack (Docker Compose) with Keycloak-based authentication via D4Science.




