archive de youtube sur le lancement d'alerte

This dataset has been published on the initiative and under the responsibility of nicolas turenne
Published on 30 de noviembre de 2016 and updated on 30 de noviembre de 2016

nicolas turenne

Informations

Licencia
Creative Commons Attribution
Cobertura temporal
2007/01 to 2016/11
Frequency
Bianual
Fecha de creación
30 de noviembre de 2016
Modification date
30 de noviembre de 2016
Latest resource update
30 de noviembre de 2016

Extras

ID
583f0e5388ee38029fc65bb3
Fecha de creación
30 de noviembre de 2016
Modification date
30 de noviembre de 2016

Description of the corpus

The corpus describes videos about whistleblowing on the Youtube social media.
Goal of the corpus is the detect automatically new videos (persons or organizations) emitting whistleblowing.
The corpus aims at finding patterns for that purpose.

size of video corpus : 347,544
size of video sample : 22 (one for each topic class)

metadata :

_id : video id
title : video title
channelid : channel id
channeltitle : channel title
datepub : publication date of the video
description : description field of the video
tags : list of keywords for a video
kind : a type , for instance youtube#video
defaultaudiolang : default language of a video
viewcount : number of views for a video
likecount : number of users who likes a video
dislikecount : number of users who do not like a video
commentscount : number of comments
comments : a list of comments :
author : author of a comment
like : number of users who like the comment
message : content of a comment
transcription : video transcription in free text

Resources 2

See also: community resources
2 downloads

Sample Dataset about 22 Youtube transcription videos on whistleblowing

Disponible
txt (678.2Ko)

Description of the corpus

The corpus describes videos about whistleblowing on the Youtube social media.
Goal of the corpus is the detect automatically new videos (persons or organizations) emitting whistleblowing.
The corpus aims at finding patterns for that purpose.

size of video sample : 22 (one for each topic class)

metadata :

_id : video id
title : video title
channelid : channel id
channeltitle : channel title
datepub : publication date of the video
description : description field of the video
tags : list of keywords for a video
kind : a type , for instance youtube#video
defaultaudiolang : default language of a video
viewcount : number of views for a video
likecount : number of users who likes a video
dislikecount : number of users who do not like a video
commentscount : number of comments
comments : a list of comments :
author : author of a comment
like : number of users who like the comment
message : content of a comment
transcription : video transcription in free text

Tipo
Main file
MIME Type
text/plain
sha1
f8fd88899d291b8713f56245e7234bbf331a31b0
Created on
30 de noviembre de 2016
Modified on
30 de noviembre de 2016
Published on
30 de noviembre de 2016
5 downloads

Dataset about 347,000 Youtube transcription videos on whistleblowing

Disponible
zip (114.1Mo)

Description of the corpus

The corpus describes videos about whistleblowing on the Youtube social media.
Goal of the corpus is the detect automatically new videos (persons or organizations) emitting whistleblowing.
The corpus aims at finding patterns for that purpose.

size of video corpus : 347,544

metadata :

_id : video id
title : video title
channelid : channel id
channeltitle : channel title
datepub : publication date of the video
description : description field of the video
tags : list of keywords for a video
kind : a type , for instance youtube#video
defaultaudiolang : default language of a video
viewcount : number of views for a video
likecount : number of users who likes a video
dislikecount : number of users who do not like a video
commentscount : number of comments
comments : a list of comments :
author : author of a comment
like : number of users who like the comment
message : content of a comment
transcription : video transcription in free text

Tipo
Main file
MIME Type
application/zip
sha1
24dce9048ade6710d20aa613a0f805eed7c86295
Created on
30 de noviembre de 2016
Modified on
30 de noviembre de 2016
Published on
30 de noviembre de 2016

Embed

You can easily embed this dataset on your website by pasting this snippet in your html page.

Community resources 0

You have built a more comprehensive database than those presented here? This is the time to share it!

Reutilizaciones 0

You reused these data and published an article, a computer graphics, or an application? It's time to let you know! Reference your work in just a few clicks and increase your visibility.

Discussions 0

Discussion between the organization and the community about this dataset.