Indian Flag
Government Of India
A-
A
A+
Bhasik Indian English Disfluency Corpus

Bhasik Indian English Disfluency Corpus

Indian English disfluency corpora from technical lecture domain

About Dataset

Indian English disfluency corpora: Human-annotated disfluency corpus (DASIE (H)) comprising over 240K words for the technical lecture domain


Sample Dataset:
english "['.' 'the' 'number' 'of' 'microorganisms' 'present' 'in' 'air' 'depends'
 'on' 'factors' 'such' 'as' ',' 'extent' 'of' 'movement' 'of' 'air' ','
 'sunshine' ',' 'humidity' ',' 'location' 'and' 'amount' 'of' 'suspended'
 'dust' 'in' 'the' 'air' '.' 'air' 'with' 'low' 'moisture' ',' 'dust'
 'content' 'and' 'high' 'temperature' 'characteristically' 'having' 'a'
 'low' 'microbial' 'load']" "[' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O'
 ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O'
 ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O'
 ' O' ' O' ' O' ' O' ' O' ' O' ' O' ' O']

Activity Overview Activity Overview

  • Downloads1
  • Downloads 3
  • Views 40
  • File Size 555.72 KB

Tags Tags

  • Indian English
  • Data-annotation
  • expert-annotated
  • english
  • language_creators:expert-generated
  • disfluency
  • corpora
  • 240k words

License Control License Control

Attribution 4.0 International (CC BY- 4.0)

bhasik_Indian_English_disfluency_corpus ( 1 directories )


Directory
bhasik_Indian_English_disfluency_corpus

2 files, 1 directories

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(555.72 KB)
  • admin·4 month(s) ago
    • chevron_rightFolder
      bhasik_Indian_English_disfluency_corpus
      • chevron_rightFolder
        bhasik_Indian_English_disfluency_corpus