The Macau Legal Affairs Bureau Localization Project is a core initiative in response to the national localization strategy, driving the autonomous control of government systems. The project aims to fully migrate the bureau's decades of citizen registration and notary documents from traditional IT infrastructure to a localized software and hardware platform — building a secure, efficient, and scalable e-government document management system. The project covers six core business systems: vehicle registration, civil registration, property registration, notary documents, commercial registration, and aircraft/vessel registration — involving over 2 million files and more than 10 million total documents.
The project utilizes a self-developed cross-platform migration tool supporting both Windows and Linux environments for seamless migration from heterogeneous systems. Based on cloud database architecture, all document data was securely migrated to a localized cloud platform. Proprietary intelligent parsing engines automatically recognize and extract structured data from WORD documents in multiple formats and languages — storing them in designated databases. Through this project, the Macau Legal Affairs Bureau established a secure, sovereign, and controllable digital archive foundation for smart government development.
Covering six core business systems — vehicle, civil, property, notary, commercial, and aircraft/vessel registration — comprehensive cross-system, cross-platform historical document migration and consolidation.
Proprietary migration tool supporting simultaneous Windows and Linux operation — adapted to localized server environments for efficient, stable migration.
Localized cloud database technology enabling secure migration from traditional storage to cloud platforms — supporting elastic scaling, high availability, and disaster recovery.
Self-developed parsing engine supporting multiple WORD formats (.doc/.docx) — Chinese-Portuguese bilingual and mixed language content — extracting text, tables, signatures and more.
Transforms unstructured document content into structured data — auto-identifying document types, party information, registration numbers, dates and storing in database fields.
Customized structuring models for commercial registration archives — auto-extracting company names, registered capital, shareholder information, business scope, and more.