Name: Using Layout Data for the Analysis of Scientific Literature
SKU: 978-3-86844-054-6
Price: 25.00 EUR
Availability: InStock

Using Layout Data for the Analysis of Scientific Literature

Mathiak, Brigitte

ISBN: 978-3-86844-054-6

25,00 €

inkl. 19 % MwSt. zzgl. Versandkosten

Beschreibung

It is said that the world knowledge is in the Internet. Scientific knowledge is in books, journals and conference proceedings. To cope with the huge amount of information clever algorithms are needed. They are filtering, sorting and ultimately mining the information, improving as they get more data. A common technique is to mine the text from the publications. But publications include more information than the their text. The position of a word gives clues about its meaning. Additional images either supplement the text or offer proof to a proposition. Tables only form semantic units when read in rows and columns. To deal with the additional information, classic text mining techniques have to be coupled with spatial data and image data. For this thesis a framework was developed that allows the analysis of layout information in scientific documents. This framework has been used for three case studies. The first one allows the automatic extraction of images and their annotation in the paper. The second one refines that approach as images are further classified into semantic categories based on their content. The third case study examines the use of tables in this context. They all discover knowledge that would not have been visible through classical text mining and give hard evidence to the hypothesis that using layout does indeed improve the possibilities of text mining.

Zusätzliche Informationen

Gewicht	161 kg
Autor	Mathiak, Brigitte
zur Person	Brigitte Mathiak studied computer science at the Carolo- Wilhelmina University of Braunschweig. For her Diplomarbeit she received a faculty award for outstanding student performance. After that she joined a bioinformatics research group at the institute of information systems. In June 2008 she received her doctorate summa cum laude.
Schlagworte, Tags	Layoutanalayse, PDF, Tabellenerkennung, Bildsuche, Text Mining, Information Extraction, Bildverstehen, Dokumentverarbeitung, Layout, Wissenschaftliche Literatur
Auflage	1
Auflage Ergänzung
Bandnummer	7
Erscheinungsdatum	2008-07-08 00:00:00
Abmessungen / Format	12,0 x 19,0
Seiten	113
Zielgruppe
Rezension
Vorwort
Internetressourcen

In situ transformations of minera particles in soils

Migration in Deutschland

Application of comprehensive gas chromatography (GCxGC) to measurements of volatile organic species in ambient air

Extending Time of Flight Optical 3D Imaging to Extreme Operating Conditions

Using Layout Data for the Analysis of Scientific Literature

Mathiak, Brigitte

Beschreibung

Zusätzliche Informationen

Birkefeld, Andreas

Böttiger, Henrik

Sandra Bartenbach

Büttgen, Bernhard

Using Lay­out Data for the Ana­ly­sis of Sci­en­ti­fic Literature

Mathiak, Brigitte

Beschreibung

Zusätzliche Informationen

Ähnliche Produkte

Birkefeld, Andreas

In situ trans­for­ma­ti­ons of mine­ra par­tic­les in soils

Böttiger, Henrik

Migra­ti­on in Deutschland

Sandra Bartenbach

Appli­ca­ti­on of com­pre­hen­si­ve gas chro­ma­to­gra­phy (GCxGC) to mea­su­re­ments of vola­ti­le orga­nic spe­ci­es in ambi­ent air

Büttgen, Bernhard

Exten­ding Time of Flight Opti­cal 3D Ima­ging to Extre­me Ope­ra­ting Conditions

Cookie- und Datenschutzeinstellungen

Using Layout Data for the Analysis of Scientific Literature

In situ transformations of minera particles in soils

Migration in Deutschland

Application of comprehensive gas chromatography (GCxGC) to measurements of volatile organic species in ambient air

Extending Time of Flight Optical 3D Imaging to Extreme Operating Conditions