Name: Using Layout Data for the Analysis of Scientific Literature
SKU: 978-3-86844-054-6
Price: 25.00 EUR
Availability: InStock

Using Layout Data for the Analysis of Scientific Literature

Mathiak, Brigitte

ISBN: 978-3-86844-054-6

25,00 €

inkl. 19 % MwSt. zzgl. Versandkosten

Beschreibung

It is said that the world knowledge is in the Internet. Scientific knowledge is in books, journals and conference proceedings. To cope with the huge amount of information clever algorithms are needed. They are filtering, sorting and ultimately mining the information, improving as they get more data. A common technique is to mine the text from the publications. But publications include more information than the their text. The position of a word gives clues about its meaning. Additional images either supplement the text or offer proof to a proposition. Tables only form semantic units when read in rows and columns. To deal with the additional information, classic text mining techniques have to be coupled with spatial data and image data. For this thesis a framework was developed that allows the analysis of layout information in scientific documents. This framework has been used for three case studies. The first one allows the automatic extraction of images and their annotation in the paper. The second one refines that approach as images are further classified into semantic categories based on their content. The third case study examines the use of tables in this context. They all discover knowledge that would not have been visible through classical text mining and give hard evidence to the hypothesis that using layout does indeed improve the possibilities of text mining.

Zusätzliche Informationen

Gewicht	161 kg
Autor	Mathiak, Brigitte
zur Person	Brigitte Mathiak studied computer science at the Carolo- Wilhelmina University of Braunschweig. For her Diplomarbeit she received a faculty award for outstanding student performance. After that she joined a bioinformatics research group at the institute of information systems. In June 2008 she received her doctorate summa cum laude.
Schlagworte, Tags	Layoutanalayse, PDF, Tabellenerkennung, Bildsuche, Text Mining, Information Extraction, Bildverstehen, Dokumentverarbeitung, Layout, Wissenschaftliche Literatur
Auflage	1
Auflage Ergänzung
Bandnummer	7
Erscheinungsdatum	2008-07-08 00:00:00
Abmessungen / Format	12,0 x 19,0
Seiten	113
Zielgruppe
Rezension
Vorwort
Internetressourcen

Der Eibenfreund 11/2004

Usability-Engineering in der Open-Source-SoftwareentwicklungPerspektiven, Vorgehensweisen und Techniken

Gestion complémentaire de la faune sauvage et du bétail en Afrique de l’Ouest: utopie ou perspective de développement?

Treatment of woods with silanes

Using Layout Data for the Analysis of Scientific Literature

Mathiak, Brigitte

Beschreibung

Zusätzliche Informationen

Scheeder, Thomas

Finck, Matthias

Wolfgang Bayer

Donath, Steffen

Using Lay­out Data for the Ana­ly­sis of Sci­en­ti­fic Literature

Mathiak, Brigitte

Beschreibung

Zusätzliche Informationen

Ähnliche Produkte

Scheeder, Thomas

Der Eiben­freund 11/2004

Finck, Matthias

Usa­bi­li­ty-Engi­nee­ring in der Open-Source-Soft­ware­ent­wick­lung­Per­spek­ti­ven, Vor­ge­hens­wei­sen und Techniken

Wolfgang Bayer

Ges­ti­on com­plé­men­tai­re de la fau­ne sau­va­ge et du bétail en Afri­que de l’Ouest: uto­pie ou per­spec­ti­ve de développement?

Donath, Steffen

Tre­at­ment of woods with silanes

Cookie- und Datenschutzeinstellungen

Using Layout Data for the Analysis of Scientific Literature

Der Eibenfreund 11/2004

Usability-Engineering in der Open-Source-SoftwareentwicklungPerspektiven, Vorgehensweisen und Techniken

Gestion complémentaire de la faune sauvage et du bétail en Afrique de l’Ouest: utopie ou perspective de développement?

Treatment of woods with silanes