Three Methods for Occupation Coding Based on Statistical Learning

zu Verbundenen Objekten

Abstract: Occupation coding, an important task in ofﬁcial statistics, refers to coding a respondent's text answer into one of many hundreds of occupation codes. To date, occupation coding is still at least partially conducted manually, at great expense. We propose three methods for automatic coding: combining separate models for the detailed occupation codes and for aggregate occupation codes, a hybrid method that combines a duplicate-based approach with a statistical learning algorithm, and a modiﬁed nearest neighbor approach. Using data from the German General Social Survey (ALLBUS), we show that the proposed methods improve on both the coding accuracy of the underlying statistical learning algorithm and the coding accuracy of duplicates where duplicates exist. Further, we ﬁnd deﬁning duplicates based on ngram variables (a concept from text mining) is preferable to one based on exact string matches

Standort: Deutsche Nationalbibliothek Frankfurt am Main

Umfang: Online-Ressource

Sprache: Englisch

Anmerkungen: Veröffentlichungsversion
begutachtet (peer reviewed)
In: Journal of Official Statistics ; 33 (2017) 1 ; 101-122

Klassifikation: Informatik

Ereignis: Veröffentlichung

(wo): Mannheim

(wann): 2017

Urheber: Gweon, Hyukjun
Schonlau, Matthias
Kaczmirek, Lars
Blohm, Michael
Steiner, Stefan

DOI: 10.1515/JOS-2017-0006

URN: urn:nbn:de:101:1-2019052715483512319010

Rechteinformation: Open Access; Open Access; Der Zugriff auf das Objekt ist unbeschränkt möglich.

Letzte Aktualisierung: 14.08.2025, 10:48 MESZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Deutsche Nationalbibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Original beim Datenpartner anzeigen

Beteiligte

Gweon, Hyukjun
Schonlau, Matthias
Kaczmirek, Lars
Blohm, Michael
Steiner, Stefan

Entstanden

2017

Ähnliche Objekte (12)

Journal article | Zeitschriftenartikel

Three Methods for Occupation Coding Based on Statistical Learning

Journal article | Zeitschriftenartikel

Respondent incentives in a national face-to-face survey: effects on outcome rates, sample composition and fieldwork efforts

Bibliographie | Bibliography

ALLBUS-Bibliographie: (14. Fassung, Stand: 31.07.1996)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (18. Fassung, Stand: Juli 2002)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (23. Fassung, Stand: Februar 2009)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (22. Fassung, Stand: Februar 2008)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (17. Fassung, Stand Juni 2001)

List | Verzeichnis, Liste, Dokumentation

German General Social Survey 2002: English translation of the German "ALLBUS"-Questionnaire

Bibliographie | Bibliography

ALLBUS-Bibliographie: (19. Fassung, Stand: November 2003)

Arbeitspapier | Working paper

Nonresponse Bias (Version 2.0)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (16. Fassung, Stand Juni 2000)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (20. Fassung, Stand: Februar 2005)

Journal article | Zeitschriftenartikel

Three Methods for Occupation Coding Based on Statistical Learning

Journal article | Zeitschriftenartikel

Respondent incentives in a national face-to-face survey: effects on outcome rates, sample composition and fieldwork efforts

Bibliographie | Bibliography

ALLBUS-Bibliographie: (14. Fassung, Stand: 31.07.1996)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (18. Fassung, Stand: Juli 2002)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (23. Fassung, Stand: Februar 2009)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (22. Fassung, Stand: Februar 2008)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (17. Fassung, Stand Juni 2001)

List | Verzeichnis, Liste, Dokumentation

German General Social Survey 2002: English translation of the German "ALLBUS"-Questionnaire

Bibliographie | Bibliography

ALLBUS-Bibliographie: (19. Fassung, Stand: November 2003)

Arbeitspapier | Working paper

Nonresponse Bias (Version 2.0)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (16. Fassung, Stand Juni 2000)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (20. Fassung, Stand: Februar 2005)

Journal article | Zeitschriftenartikel

Three Methods for Occupation Coding Based on Statistical Learning

Journal article | Zeitschriftenartikel

Respondent incentives in a national face-to-face survey: effects on outcome rates, sample composition and fieldwork efforts

Bibliographie | Bibliography

ALLBUS-Bibliographie: (14. Fassung, Stand: 31.07.1996)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (18. Fassung, Stand: Juli 2002)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (23. Fassung, Stand: Februar 2009)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (22. Fassung, Stand: Februar 2008)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (17. Fassung, Stand Juni 2001)

List | Verzeichnis, Liste, Dokumentation

German General Social Survey 2002: English translation of the German "ALLBUS"-Questionnaire

Bibliographie | Bibliography

ALLBUS-Bibliographie: (19. Fassung, Stand: November 2003)

Arbeitspapier | Working paper

Nonresponse Bias (Version 2.0)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (16. Fassung, Stand Juni 2000)

Bibliographie | Bibliography

ALLBUS-Bibliographie: (20. Fassung, Stand: Februar 2005)

Informationen zur Registrierung von Kultur- und Wissenseinrichtungen finden Sie hier.

Felder mit * müssen ausgefüllt werden.

Benutzername*

Bitte geben Sie Ihren Benutzernamen ein

E-Mail*

Bitte geben Sie Ihre E-Mail ein

Bitte füllen Sie dieses Feld nicht aus

Vorname

Nachname

Passwort*

Bitte geben Sie Ihr Passwort ein

Passwort bestätigen*

Bitte geben Sie das gleiche Passwort ein

Ich habe die Nutzungsbedingungen und die Datenschutzerklärung zur Erhebung persönlicher Daten gelesen und stimme ihnen zu. *

Dieses Feld ist ein Pflichtfeld.

Ich möchte den Newsletter der Deutschen Digitalen Bibliothek abonnieren. Siehe Informationen zum Newsletter-Abonnement.

Benutzerkonto angelegt

Ihr „Meine DDB“-Konto wurde erfolgreich angelegt. Bevor Sie sich in Ihrem Konto anmelden können, müssen Sie auf den Bestätigungslink in der Nachricht klicken, die wir gerade an die von Ihnen angegebene E-Mail-Adresse geschickt haben

Three Methods for Occupation Coding Based on Statistical Learning

Angaben zum Objekt

Klassifikation und Themen

Beteiligte, Orts- und Zeitangaben

Weitere Informationen

Datenpartner

Beteiligte

Entstanden

Ähnliche Objekte (12)

Three Methods for Occupation Coding Based on Statistical Learning

Respondent incentives in a national face-to-face survey: effects on outcome rates, sample composition and fieldwork efforts

ALLBUS-Bibliographie: (14. Fassung, Stand: 31.07.1996)

ALLBUS-Bibliographie: (18. Fassung, Stand: Juli 2002)

ALLBUS-Bibliographie: (23. Fassung, Stand: Februar 2009)

ALLBUS-Bibliographie: (22. Fassung, Stand: Februar 2008)

ALLBUS-Bibliographie: (17. Fassung, Stand Juni 2001)

German General Social Survey 2002: English translation of the German "ALLBUS"-Questionnaire

ALLBUS-Bibliographie: (19. Fassung, Stand: November 2003)

Nonresponse Bias (Version 2.0)

ALLBUS-Bibliographie: (16. Fassung, Stand Juni 2000)

ALLBUS-Bibliographie: (20. Fassung, Stand: Februar 2005)

Three Methods for Occupation Coding Based on Statistical Learning

Respondent incentives in a national face-to-face survey: effects on outcome rates, sample composition and fieldwork efforts

ALLBUS-Bibliographie: (14. Fassung, Stand: 31.07.1996)

ALLBUS-Bibliographie: (18. Fassung, Stand: Juli 2002)

ALLBUS-Bibliographie: (23. Fassung, Stand: Februar 2009)

ALLBUS-Bibliographie: (22. Fassung, Stand: Februar 2008)

ALLBUS-Bibliographie: (17. Fassung, Stand Juni 2001)

German General Social Survey 2002: English translation of the German "ALLBUS"-Questionnaire

ALLBUS-Bibliographie: (19. Fassung, Stand: November 2003)

Nonresponse Bias (Version 2.0)

ALLBUS-Bibliographie: (16. Fassung, Stand Juni 2000)

ALLBUS-Bibliographie: (20. Fassung, Stand: Februar 2005)

Three Methods for Occupation Coding Based on Statistical Learning

Respondent incentives in a national face-to-face survey: effects on outcome rates, sample composition and fieldwork efforts

ALLBUS-Bibliographie: (14. Fassung, Stand: 31.07.1996)

ALLBUS-Bibliographie: (18. Fassung, Stand: Juli 2002)

ALLBUS-Bibliographie: (23. Fassung, Stand: Februar 2009)

ALLBUS-Bibliographie: (22. Fassung, Stand: Februar 2008)

ALLBUS-Bibliographie: (17. Fassung, Stand Juni 2001)

German General Social Survey 2002: English translation of the German "ALLBUS"-Questionnaire

ALLBUS-Bibliographie: (19. Fassung, Stand: November 2003)

Nonresponse Bias (Version 2.0)

ALLBUS-Bibliographie: (16. Fassung, Stand Juni 2000)

ALLBUS-Bibliographie: (20. Fassung, Stand: Februar 2005)

Verbundene Objekte

Passwort zurücksetzen