Deep Sequoia corpus - PARSEME-FR corpus - FrSemCor

PID

The Sequoia corpus is a set of 3,099 linguistically-annotated French sentences, originating from four sources (Europarl, European Agency Reports, French regional journal L'Est Républicain, and French wikipedia).

Several types of annotations were added over the years. The current release comprises: - parts-of-speech (SEQUOIA ANR-08-EMER-013 project) - syntactic dependency trees - deep syntactic dependency graphs (Deep sequoia project) - multi-word expressions and named entities (PARSEME COST project and PARSEME-FR ANR-14-CERA-0001 project) - coarse semantic tags for nouns (FrSemCor project)

See the deep sequoia page for a detailed description: https://deep-sequoia.inria.fr/

Identifier
PID http://hdl.handle.net/11234/1-3429
Related Identifier https://deep-sequoia.inria.fr/
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-3429
Provenance
Creator Barque, Lucie; Candito, Marie; Constant, Matthieu; Cordeiro, Silvio Ricardo; Crabbé, Benoît; Fort, Karën; Guillaume, Bruno; Haas, Pauline; Huyghe, Richard; Perrier, Guy; Ramisch, Carlos; Ribeyre, Corentin; Savary, Agata; Seddah, Djamé; Segonne, Vincent; Tribout, Delphine; Villemonte de la Clergerie, Eric; Parmentier, Yannick; Pasquer, Caroline; Antoine, Jean-Yves
Publisher ANR
Publication Year 2020
Rights Deep Sequoia Licence; https://lindat.mff.cuni.cz/repository/xmlui/page/deep-sequoia-licence; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language French
Resource Type corpus
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 1
Discipline Linguistics