Skip to main content

Research Repository

Advanced Search

All Outputs (10)

Context-aware data-to-text generation. (2024)
Thesis
UPADHYAY, A. 2024. Context-aware data-to-text generation. Robert Gordon University, PhD thesis. Hosted on OpenAIR [online]. Available from: https://doi.org/10.48526/rgu-wt-2571408

Data-to-Text Generation (D2T) is the subfield of Artificial Intelligence (AI) and Natural Language Processing (NLP) that aims to build systems capable of summarising nonlinguistic structured data into textual reports. D2T systems extract important in... Read More about Context-aware data-to-text generation..

A case-based approach to data-to-text generation. [Software]
Digital Artefact
UPADHYAY, A., MASSIE, S., SINGH, R.K., GUPTA, G. and OJHA, M. 2021. A case-based approach to data-to-text generation. [Software]. Hosted on GitHub [online]. Available from: https://github.com/ashishu007/data2text-cbr

Traditional Data-to-Text Generation (D2T) systems utilise carefully crafted domain specific rules and templates to generate high quality accurate texts. More recent approaches use neural systems to learn domain rules from the training data to produce... Read More about A case-based approach to data-to-text generation. [Software].

WEC: weighted ensemble of text classifiers.
Presentation / Conference Contribution
UPADHYAY, A., NGUYEN, T.T., MASSIE, S. and MCCALL, J. 2020. WEC: weighted ensemble of text classifiers. In Proceedings of 2020 Institute of Electrical and Electronics Engineers (IEEE) congress on evolutionary computation (IEEE CEC 2020), part of the 2020 (IEEE) World congress on computational intelligence (IEEE WCCI 2020) and co-located with the 2020 International joint conference on neural networks (IJCNN 2020) and the 2020 IEEE International fuzzy systems conference (FUZZ-IEEE 2020), 19-24 July 2020, Glasgow, UK [virtual conference]. Piscataway: IEEE [online], article ID 9185641. Available from: https://doi.org/10.1109/CEC48606.2020.9185641

Text classification is one of the most important tasks in the field of Natural Language Processing. There are many approaches that focus on two main aspects: generating an effective representation; and selecting and refining algorithms to build the c... Read More about WEC: weighted ensemble of text classifiers..

Case-based approach to automated natural language generation for obituaries.
Presentation / Conference Contribution
UPADHYAY, A., MASSIE, S. and CLOGHER, S. 2020. Case-based approach to automated natural language generation for obituaries. In Watson, I. and Weber, R. (eds.) Case-based reasoning research and development: proceedings of the 28th International conference on case-based reasoning research and development (ICCBR 2020), 8-12 June 2020, Salamanca, Spain [virtual conference]. Lecture notes in computer science, 12311. Cham: Springer [online], pages 279-294. Available from: https://doi.org/10.1007/978-3-030-58342-2_18

Automated generation of human readable text from structured information is challenging because grammatical rules are complex making good quality outputs difficult to achieve. Textual Case-Based Reasoning provides one approach in which the text from p... Read More about Case-based approach to automated natural language generation for obituaries..

GEMv2: multilingual NLG benchmarking in a single line of code.
Presentation / Conference Contribution
GEHRMANN, S., BHATTACHARJEE, A., MAHENDIRAN, A., WANG, A., PAPANGELIS, A., MADAAN, A., MCMILLAN-MAJOR, A., SHVETS, A., UPADHYAY, A. and BOHNET, B. 2022. GEMv2: multilingual NLG benchmarking in a single line of code. In Proceedings of the 2022 Conference on empirical methods in natural language processing: system demonstrations, 7-11 December 2022, Abu Dhabi, UAE. Stroudsburg: Association for Computational Linguistics [online], pages 266-281. Available from: https://aclanthology.org/2022.emnlp-demos.27/

Evaluations in machine learning rarely use the latest metrics, datasets, or human evaluation in favor of remaining compatible with prior work. The compatibility, often facilitated through leaderboards, thus leads to outdated but standardized evaluati... Read More about GEMv2: multilingual NLG benchmarking in a single line of code..

A case-based approach to data-to-text generation.
Presentation / Conference Contribution
UPADHYAY, A., MASSIE, S., SINGH, R.K., GUPTA, G. and OJHA, M. 2021. A case-based approach to data-to-text generation. In Sánchez-Ruiz, A.A. and Floyd, M.W. (eds.) Case-based reasoning research and development: proceedings of 29th International conference case-based reasoning 2021 (ICCBR 2021), 13-16 September 2021, Salamanca, Spain. Lecture notes in computer science (LNCS), 12877. Cham: Springer [online], pages 232-247. Available from: https://doi.org/10.1007/978-3-030-86957-1_16

Traditional Data-to-Text Generation (D2T) systems utilise carefully crafted domain specific rules and templates to generate high quality accurate texts. More recent approaches use neural systems to learn domain rules from the training data to produce... Read More about A case-based approach to data-to-text generation..

A case-based approach for content planning in data-to-text generation.
Presentation / Conference Contribution
UPADHYAY, A. and MASSIE, S. 2022. A case-based approach for content planning in data-to-text generation. In Keane, M.T. and Wiratunga, N. (eds.) Case-based reasoning research and development: proceedings of the 30th International conference on case-based reasoning (ICCBR 2022), 12-15 September 2022, Nancy, France. Lecture notes in computer science, 13405. Cham: Springer [online], pages 380-394. Available from: https://doi.org/10.1007/978-3-031-14923-8_25

The problem of Data-to-Text Generation (D2T) is usually solved using a modular approach by breaking the generation process into some variant of planning and realisation phases. Traditional methods have been very good at producing high quality texts b... Read More about A case-based approach for content planning in data-to-text generation..

Content type profiling of data-to-text generation datasets.
Presentation / Conference Contribution
UPADHYAY, A. and MASSIE, S. 2022. Content type profiling of data-to-text generation datasets. In N. Calzolari, C.-R. Huang, H. Kim. et al. (eds.) Proceedings of the 29th International conference on computational linguistics (COLING 2022), 12-17 October 2022, Gyeongju, Republic of Korea. Stroudsburg, PA: International Committee on Computational Linguistics [online], 29(1), pages 5770–5782. Available from: https://aclanthology.org/2022.coling-1.pdf

Data-to-Text Generation (D2T) problems can be considered as a stream of time-stamped events with a text summary being produced for each. The problem becomes more challenging when event summaries contain complex insights derived from multiple records... Read More about Content type profiling of data-to-text generation datasets..

Machine learning for risk stratification of diabetic foot ulcers using biomarkers.
Presentation / Conference Contribution
MARTIN, K., UPADHYAY, A., WIJEKOON, A., WIRATUNGA, N. and MASSIE, S. 2023. Machine learning for risk stratification of diabetic foot ulcers using biomarkers. In Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational science: proceedings of the 23rd International conference on computational science 2023 (ICCS 2023): computing at the cutting edge of science (ICCS 2023), 3-5 July 2023, Prague, Czech Republic: [virtual event]. Lecture notes in computer science, 14075. Cham: Springer [online], part III, pages 153-161. Available from: https://doi.org/10.1007/978-3-031-36024-4_11

Development of a Diabetic Foot Ulcer (DFU) causes a sharp decline in a patient's health and quality of life. The process of risk stratification is crucial for informing the care that a patient should receive to help manage their Diabetes before an ul... Read More about Machine learning for risk stratification of diabetic foot ulcers using biomarkers..

CBR assisted context-aware surface realisation for data-to-text generation.
Presentation / Conference Contribution
UPADHYAY, A. and MASSIE, S. 2023. CBR assisted context-aware surface realisation for data-to-text generation. In MASSIE, S. and CHAKRABORTI, S. (eds.) 2023. Case-based reasoning research and development: proceedings of the 31st International conference on case-based reasoning 2023, (ICCBR 2023), 17-20 July 2023, Aberdeen, UK. Lecture notes in computer science (LNCS), 14141. Cham: Springer [online], pages 34-49. Available from: https://doi.org/10.1007/978-3-031-40177-0_3

Current state-of-the-art neural systems for Data-to-Text Generation (D2T) struggle to generate content from past events with interesting insights. This is because these systems have limited access to historic data and can also hallucinate inaccurate... Read More about CBR assisted context-aware surface realisation for data-to-text generation..