ASHISH UPADHYAY a.upadhyay@rgu.ac.uk
Research Student
A case-based approach to data-to-text generation. [Software]
Upadhyay, Ashish; Massie, Stewart; Singh, Ritwik Kumar; Gupta, Garima; Ojha, Muneendra
Authors
Dr Stewart Massie s.massie@rgu.ac.uk
Associate Professor
Ritwik Kumar Singh
Garima Gupta
Muneendra Ojha
Abstract
Traditional Data-to-Text Generation (D2T) systems utilise carefully crafted domain specific rules and templates to generate high quality accurate texts. More recent approaches use neural systems to learn domain rules from the training data to produce very fluent and diverse texts. However, there is a trade-off with rule-based systems producing accurate text but that may lack variation, while learning-based systems produce more diverse texts but often with poorer accuracy. This code has been used to help propose a case-based approach for D2T, which mitigates the impact of this trade-off by dynamically selecting templates from the training corpora. In our approach we develop a novel case-alignment based, feature weighing method that is used to build an effective similarity measure. Extensive experimentation is performed on a sports domain dataset. Through Extractive Evaluation metrics, we demonstrate the benefit of the CBR system over a rule-based baseline and a neural benchmark. The file accompanying this OpenAIR record contains a link to where the code is held on GitHub. The GitHub repository also includes information on how to use the code.
Citation
UPADHYAY, A., MASSIE, S., SINGH, R.K., GUPTA, G. and OJHA, M. 2021. A case-based approach to data-to-text generation. [Software]. Hosted on GitHub [online]. Available from: https://github.com/ashishu007/data2text-cbr
Digital Artefact Type | Software |
---|---|
Deposit Date | Nov 2, 2021 |
Publicly Available Date | Nov 4, 2021 |
Keywords | Case-based reasoning (CBR); Natural language generation (NLG); Natural language processing; Data-to-text; Textual case-based reasoning (Textual CBR); Feature weighting |
Public URL | https://rgu-repository.worktribe.com/output/1512837 |
Publisher URL | https://github.com/ashishu007/data2text-cbr |
Related Public URLs | https://rgu-repository.worktribe.com/output/1482039 |
Files
UPADHYAY 2021 A case-based approach to data (SOFTWARE - LINK ONLY)
(3 Kb)
Other
Related Outputs
A case-based approach to data-to-text generation.
(2021)
Conference Proceeding
You might also like
Content type profiling of data-to-text generation datasets.
(2022)
Conference Proceeding
A case-based approach for content planning in data-to-text generation.
(2022)
Conference Proceeding
A case-based approach to data-to-text generation.
(2021)
Conference Proceeding
Visualisation to explain personal health trends in smart homes.
(2021)
Presentation / Conference
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search