<?xml version="1.0" encoding="utf-8"?>
<lom xmlns="http://www.imsglobal.org/xsd/imsmd_v1p2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.imsglobal.org/xsd/imsmd_v1p2 http://www.imsglobal.org/xsd/imsmd_v1p2p4.xsd"><general><identifier>300</identifier><title><langstring xml:lang="en">Converting arXiv into XHTML+MathML - access to scientific papers</langstring></title><description><langstring xml:lang="en">&lt;p&gt;This is the presentation that Michael Kohlhase gave at the @Science conference &lt;a href="./conferences/milan" title="@Science conference in Milan web page"&gt; "Making Science Accessible"&lt;/a&gt;.&lt;br /&gt;
He explains what their work is about, namely translating the collection of scientific publications of the Cornell e-Print Archive (arXiv) using the LATEXtoXML converter, which is currently under development.&lt;br /&gt;
The main technical task of the arXMLiv project is to supply LaTeXML bindings for the (thousands of) LATEX classes and packages used in the arXiv collection. To this aim, they developed a distributed build system that reiteratively runs LaTeXML over the arXiv collection and collects statistics about, e.g., the most sorely missing LaTeXML bindings and clusters common error events. This creates valuable feedback to both the developers of the LaTeXML package and to binding implementers. &lt;/p&gt;
&lt;p&gt;The results of the conversion are impressive: the complete arXiv collection of more than 400,000 documents has been processed from 1993 until 2006 (one run is a processor-year-size undertaking) and the success rate is more than 56% (i.e., over 56% of the documents that are LATEX have been converted by LaTeXML without noticing an error and are available as XHTML+MathML documents). &lt;/p&gt;
&lt;p&gt;These documents are directly accessible by blind and partially sighted users, because of the availability of readers.&lt;/p&gt;
</langstring></description><coverage><langstring xml:lang="en"></langstring></coverage></general><lifecycle><version><langstring xml:lang="en"></langstring></version><contribute><role><source><langstring xml:lang="en">LOMv1.0</langstring></source><value><langstring xml:lang="en">author</langstring></value></role><centity><vcard>BEGIN:VCARD
FN:Michael Kohlhase
EMAIL;INTERNET:
ORG:Jacobs University, Bremen, Germany
END:VCARD</vcard></centity><date><datetime></datetime></date></contribute></lifecycle><technical><format>MP3, pdf</format><size>5638 KB, 1014 KB</size><location>n/a</location><requirement><name><source><langstring xml:lang="en">LOMv1.2</langstring></source><value><langstring xml:lang="en"></langstring></value></name></requirement><installationremarks><langstring xml:lang="en"></langstring></installationremarks><otherplatformrequirements><langstring xml:lang="en"></langstring></otherplatformrequirements><duration><datetime></datetime><description><langstring xml:lang="en"></langstring></description></duration></technical><educational><typicalagerange><langstring xml:lang="en"></langstring></typicalagerange></educational><rights><cost><source><langstring xml:lang="en">LOMv1.2</langstring></source><value><langstring xml:lang="en"></langstring></value></cost><copyrightandotherrestrictions><source><langstring xml:lang="en">LOMv1.2</langstring></source><value><langstring xml:lang="en"></langstring></value></copyrightandotherrestrictions><description><langstring xml:lang="en"></langstring></description></rights><classification/></lom>

