A Preliminary Study for Building an Arabic Corpus of Pair Questions-Texts from the Web: AQA-Webcorp.
Wided BakariPatrice BellotMahmoud NejiPublished in: CoRR (2017)
Keyphrases
- website
- newspaper articles
- answering questions
- textual features
- open domain
- web applications
- question answer pairs
- content management
- question answer
- pairwise
- natural language text
- web pages
- specific domains
- user generated content
- semantic web
- web documents
- text corpus
- english words
- training corpus
- answer questions
- manually annotated
- web content
- question answering
- multiword
- plain text
- end users
- web technologies
- link analysis
- web resources
- natural language
- arabic language