Archive en agro-écologie de BSV (Bulletins de Santé du Végétal)

This dataset has been published on the initiative and under the responsibility of nicolas turenne
Published on November 30, 2016 and updated on December 1, 2016

nicolas turenne

Informations

License
Creative Commons Attribution
Temporal coverage
1946/01 to 2016/11
Frequency
Biannual
Creation date
November 30, 2016
Modification date
December 1, 2016
Latest resource update
December 1, 2016
Territorial coverage granularity
French region
Territorial coverage
France

Extras

ID
583eac9ac751df6321c0bb7e
Creation date
November 30, 2016
Modification date
December 1, 2016

The corpus describes damage of insects and diseases on crops (wheat, wine...).
corpus contains 41,000 documents. 17,000 were published from 1960 till 2000 of medium quality about text recognition.
Each file contains level of risk about crop from a region of France. Texts are in French

size of document corpus : 40,899
size of document sample : 37 (from different region of France, with different crops)

size of the corpus (txt format) in octets: 457 Mb
size of the corpus (pdf format) in octets: 37 Gb

metadata for each file:

_id: name of the file
region: name of a French region (example: Alsace)
crops: list of crop names (example: wheat)
diseases: list of diseases names (example: oidium)
insects: list of insects names (for example: puceron noir)
risk: patterns of risk (example : "12% of fields")
town: list of cities (example: Dijon)
date: date of publication of the document
pesticides: list of pesticides (exmple: d.d.t.)

The database contains :
cited areas: 27
cited insects: 389
cited diseases: 279
cited pesticids: 727
cited crops: 122

Resources 3

See also: community resources
8 downloads

Ecology Crop Disease Newsletter Corpus - PDF format

Disponible
zip (29.8Go)

Description of the corpus

The corpus describes damage of insects and diseases on crops (wheat, wine...).
corpus contains 41,000 documents. 17,000 were published from 1960 till 2000 of medium quality about text recognition.
Each file contains level of risk about crop from a region of France. Texts are in French

size of document corpus : 40,899
size of document sample : 37 (from different region of France, with different crops)

size of the corpus (pdf format) in octets: 37 Gb

Type
Main file
MIME Type
cc
Created on
December 1, 2016
Modified on
December 1, 2016
Published on
December 1, 2016
1 downloads

EcologySample.rar

Disponible
rar (14.2Mo)

File contains 37 documents (txt and pdf format for each one)
and one file of extracted entities for each file

Type
Main file
MIME Type
application/rar
sha1
e58702f1f8c03cda07bfc4ec10d1de8043920540
Created on
November 30, 2016
Modified on
November 30, 2016
Published on
November 30, 2016

Embed

You can easily embed this dataset on your website by pasting this snippet in your html page.

Community contributions

Community resources 0

You have built a more comprehensive database than those presented here? This is the time to share it!

Reuses 0

You reused these data and published an article, a computer graphics, or an application? It's time to let you know! Reference your work in just a few clicks and increase your visibility.

Discussions 0

Discussion between the organization and the community about this dataset.