Konferenzbeitrag

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

We present a methodology to analyze the linguistic evolution of scientific registers with data mining techniques, comparing the insights gained from shallow vs. linguistic features. The focus is on selected scientific disciplines at the boundaries to computer science (computational linguistics, bioinformatics, digital construction, microelectronics). The data basis is the English Scientific Text Corpus (SCITEX) which covers a time range of roughly thirty years (1970/80s to early 2000s) (Degaetano-Ortlieb et al., 2013; Teich and Fankhauser, 2010). In particular, we investigate the diversification of scientific registers over time. Our theoretical basis is Systemic Functional Linguistics (SFL) and its specific incarnation of register theory (Halliday and Hasan, 1985). In terms of methods, we combine corpus-based methods of feature extraction and data mining techniques.

Urheber*in: Degaetano-Ortlieb, Stefania; Fankhauser, Peter; Kermes, Hannah; Lapshinova-Koltunski, Ekaterina; Ordan, Noam; Teich, Elke

In copyright

Language: Englisch

Subject: Korpus <Linguistik>
Linguistik

Event: Geistige Schöpfung

(who): Degaetano-Ortlieb, Stefania
Fankhauser, Peter
Kermes, Hannah
Lapshinova-Koltunski, Ekaterina
Ordan, Noam
Teich, Elke

Event: Veröffentlichung

(who): Reykjavik : European Language Resources Association (ELRA)

(when): 2014-06-13

URN: urn:nbn:de:bsz:mh39-26178

Last update: 06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Show original at data provider

Object type

Konferenzbeitrag

Associated

Degaetano-Ortlieb, Stefania
Fankhauser, Peter
Kermes, Hannah
Lapshinova-Koltunski, Ekaterina
Ordan, Noam
Teich, Elke
Reykjavik : European Language Resources Association (ELRA)

Time of origin

2014-06-13

Other Objects (12)

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Graphical error mining for linguistic annotated corpora

Linguistic and translation studies in scientific communication

Linguistic features and genre profiles of scientific English

Pitfalls in applying text mining to scientific literature

The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining

Research on reasonable coal pillar staggered distance in shallow multi-seam mining

Strength constraints of shallow crustal strata from analyses of mining induced seismicity

Scientific reports of mining, metallurgy and materials in Ukraine

Aufsatzsammlung

Scientific data mining and knowledge discovery : principles and foundations

Hochschulschrift

Integration of data mining into scientific data analysis processes

Hochschulschrift

Integration of Data Mining into Scientific Data Analysis Processes

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Graphical error mining for linguistic annotated corpora

Linguistic and translation studies in scientific communication

Linguistic features and genre profiles of scientific English

Pitfalls in applying text mining to scientific literature

The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining

Research on reasonable coal pillar staggered distance in shallow multi-seam mining

Strength constraints of shallow crustal strata from analyses of mining induced seismicity

Scientific reports of mining, metallurgy and materials in Ukraine

Aufsatzsammlung

Scientific data mining and knowledge discovery : principles and foundations

Hochschulschrift

Integration of data mining into scientific data analysis processes

Hochschulschrift

Integration of Data Mining into Scientific Data Analysis Processes

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Graphical error mining for linguistic annotated corpora

Linguistic and translation studies in scientific communication

Linguistic features and genre profiles of scientific English

Pitfalls in applying text mining to scientific literature

The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining

Research on reasonable coal pillar staggered distance in shallow multi-seam mining

Strength constraints of shallow crustal strata from analyses of mining induced seismicity

Scientific reports of mining, metallurgy and materials in Ukraine

Aufsatzsammlung

Scientific data mining and knowledge discovery : principles and foundations

Hochschulschrift

Integration of data mining into scientific data analysis processes

Hochschulschrift

Integration of Data Mining into Scientific Data Analysis Processes

Cultural heritage institutions wishing to register will find more information here.

Fields marked * need to be filled in.

Username*

Please enter your username

Email*

Please enter your email address

Please do not fill this field

First name

Last name

Password*

Please enter your password

Confirm password*

Please enter the same password

I have read the terms of use and the privacy policy for the collection of personal data and accept them. *

This field is required.

I would like to subscribe to the newsletter of the Deutsche Digitale Bibliothek. See newsletter subscription info.

Account created

Your "My DDB" account has been successfully created. Before you can log in to your account, you must click the confirmation link in the message we just sent to the email address you provided.

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Download

Object Details

Classification and Topics

Contributors, Places and Time

Further information

Data provider

Object type

Associated

Time of origin

Other Objects (12)

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Graphical error mining for linguistic annotated corpora

Linguistic and translation studies in scientific communication

Linguistic features and genre profiles of scientific English

Pitfalls in applying text mining to scientific literature

The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining

Research on reasonable coal pillar staggered distance in shallow multi-seam mining

Strength constraints of shallow crustal strata from analyses of mining induced seismicity

Scientific reports of mining, metallurgy and materials in Ukraine

Scientific data mining and knowledge discovery : principles and foundations

Integration of data mining into scientific data analysis processes

Integration of Data Mining into Scientific Data Analysis Processes

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Graphical error mining for linguistic annotated corpora

Linguistic and translation studies in scientific communication

Linguistic features and genre profiles of scientific English

Pitfalls in applying text mining to scientific literature

The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining

Research on reasonable coal pillar staggered distance in shallow multi-seam mining

Strength constraints of shallow crustal strata from analyses of mining induced seismicity

Scientific reports of mining, metallurgy and materials in Ukraine

Scientific data mining and knowledge discovery : principles and foundations

Integration of data mining into scientific data analysis processes

Integration of Data Mining into Scientific Data Analysis Processes

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

Graphical error mining for linguistic annotated corpora

Linguistic and translation studies in scientific communication

Linguistic features and genre profiles of scientific English

Pitfalls in applying text mining to scientific literature

The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining

Research on reasonable coal pillar staggered distance in shallow multi-seam mining

Strength constraints of shallow crustal strata from analyses of mining induced seismicity

Scientific reports of mining, metallurgy and materials in Ukraine

Scientific data mining and knowledge discovery : principles and foundations

Integration of data mining into scientific data analysis processes

Integration of Data Mining into Scientific Data Analysis Processes

Related objects

Reset password