A total of 12'200 handwritten survey forms contain information on the number of apprenticeship-leave examinations conducted and passed, expected apprenticeship leavers, and details of the contracts per canton in the respective survey year. Furthermore, these data sets contain important information for the history of education, and, the economy.
Information extraction from statistical survey forms from 1935-1969.
12’200 handwritten forms
As part of the research project "Education in Numbers", the Institute of Educational Sciences at the University of Zurich analysed survey forms from the years 1935 to 1969 from the apprenticeship contract statistics of the former Federal Office for Industry, Trade and Labour. The forms contain information on the number of completed apprenticeship-leave examinations, expected apprenticeship leavers, details on apprenticeship contracts per respective canton and much more. The data set is of great importance for educational and economic history, among other things, but the documents were only available in physical and handwritten form.
As an expert in the field of intelligent document processing (IDP), Acodis was commissioned to carry out the information extraction of the statistical survey forms.
At the beginning, a total of 12,200 survey forms were manually photographed and transmitted. The forms had five different layouts and, depending on the year, there were also differences in the handwriting. In order to be able to reproduce the information digitally, "Intelligent Character Recognition" was used, which enables the extraction of handwritten data.
In order to simplify the validation of the extracted values, a web editor was developed which allows the export into any data format. The structure, as well as the use, are easy to understand and do not require a long training period.