Copyright 2009. This text is freely available provided the text is distributed with the header information provided.
Les droits de reproduction des gravures ont été achetés de la Bibliothèque Nationale de France grâce à une subvention accordée par le Conseil de recherches en sciences humaines du Canada. Les autres éléments du projet (les contributions des éditeurs, les transcriptions des textes, l'encodage et le code) sont distribués sous les termes de cette licence: Creative Commons Paternité - Pas d'Utilisation Commerciale - Pas de Modification 2.5 Canada.
The document index provides access to plain-text versions of all the texts in the collection (in UTF-8 encoding) through simple URLs, so that it's possible to run text-analysis tools against them. Text-analysis across the whole collection, or subsets of the collection, will most likely be more interesting, however, and we will provide suitable links here in future. For the moment, plain text and XML versions of the whole collection are available:
In the case of all of the plain-text versions, all editorial content (annotations, metadata, notes, etc.) has been stripped out, leaving only the original text. In the case of marked-up images, this means only the text that appears as part of the engraving. The XML corpus is complete with all headers and editorial annotation.
A simple way to get started is to plug one of these URLs into the TAPoR Tools available here: