publications
Overall, in the last 5 years I have authored 40 papers which have 279 citations according to Google Scholar (h-index 10). Of those 35 are published in peer-reviewed conferences and journals, of which six are published in exclusive CORE A* ranked venues and another four in A ranked venues.
2025
-
MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic DialoguesIn AAAI, 2025
-
Learning to generate and evaluate fact-checking explanations with transformersEngineering Applications of Artificial Intelligence, 2025
2024
-
A Comprehensive Survey of Sentence Representations: From the BERT Epoch to the CHATGPT Era and BeyondIn EACL, 2024
-
Enriching the Metadata of Community-Generated Digital Content through Entity Linking: An Evaluative Comparison of State-of-the-Art ModelsIn LaTeCH-CLfL @ EACL, 2024
-
Our Heritage, Our Stories: developing AI tools to link and support community-generated digital cultural heritageJournal of Documentation, 2024
-
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and EvaluationIn ACL Findings, 2024
-
M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question AnsweringIn ACL Findings, 2024
-
From Outputs to Insights: A Survey of Rationalisation Approaches for Explainable Text ClassificationFrontiers in Artificial Intelligence, 2024
-
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading ComprehensionarXiv preprint arXiv:2408.05023, 2024
-
uMedSum: A Unified Framework for Advancing Medical Abstractive SummarizationarXiv preprint arXiv:2408.12095, 2024
-
LLMs are not Zero-Shot Reasoners for Biomedical Information ExtractionarXiv preprint arXiv:2408.12249, 2024
-
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers?In EMNLP, 2024
-
Representation Learning of Structured Data for Medical Foundation ModelsIn UniReps: 2nd Edition of the Workshop on Unifying Representations in Neural Models, 2024
-
Towards Explainable Multi-Label Text Classification: A Multi-Task Rationalisation Framework for Identifying Indicators of Forced LabourIn NLP4PI @ EMNLP, 2024
2023
-
Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel ClassificationarXiv preprint arXiv:2304.13998, 2023
-
Do You Hear The People Sing? Key Point Analysis via Iterative Clustering and Abstractive SummarisationIn ACL, 2023
-
A Two-Stage Decoder for Efficient ICD CodingIn ACL Findings, 2023
-
Global information-aware argument mining based on a top-down multi-turn QA modelInformation Processing & Management, 2023
-
Entity Coreference and Co-occurrence Aware Argument Mining from Biomedical LiteratureIn CODI @ ACL, 2023
-
Team: PULSAR at ProbSum 2023: PULSAR: Pre-training with Extracted Healthcare Terms for Summarising Patients’ Problems and Data Augmentation with Black-box Large Language ModelsIn BioNLP @ ACL, 2023
-
Few-shot entity linking of food namesInformation Processing & Management, 2023
-
Are Machine Reading Comprehension Systems Robust to Context Paraphrasing?In AACL, 2023
-
Mmt’s submission for the wmt 2023 quality estimation shared taskIn WMT @ EMNLP, 2023
-
Argument mining as a multi-hop generative machine reading comprehension taskIn EMNLP Findings, 2023
-
Automated Clinical Coding for Outpatient DepartmentsarXiv preprint arXiv:2312.13533, 2023
-
Pulsar at mediqa-sum 2023: Large language models augmented by synthetic dialogue convert patient dialogues to medical recordsIn ImageClef @ CLEF, 2023
2022
-
WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign LanguageIn ACL, 2022
-
A survey of methods for revealing and overcoming weaknesses of data-driven Natural Language UnderstandingNatural Language Engineering, 2022
-
RaFoLa: A Rationale-Annotated Corpus for Detecting Indicators of Forced LabourIn LREC, 2022
-
Incorporating Zoning Information into Argument Mining from Biomedical LiteratureIn LREC, 2022
-
‘Am I the Bad One’? Predicting the Moral Judgement of the Crowd Using Pre–trained Language ModelsIn LREC, 2022
-
Can Transformers Reason in Fragments of Natural Language?In EMNLP, 2022
-
Towards Human-Centred Explainability Benchmarks For Text ClassificationNEATCLasS @ ICWSM, 2022
2021
-
Semantics Altering Modifications for Evaluating Comprehension in Machine ReadingIn AAAI, 2021
-
Is the Understanding of Explicit Discourse Relations Required in Machine Reading Comprehension?In EACL, 2021
-
Emerging evaluation paradigms in natural language understanding: a case study in machine reading comprehension2021
2020
-
A framework for evaluation of machine reading comprehension gold standardsIn LREC, 2020
2019
-
Vajra: step-by-step programming with natural languageIn IUI, 2019
-
Identifying Supporting Facts for Multi-hop Question Answering with Document Graph NetworksIn TextGraphs-13 @ EMNLP, 2019
-
DBee: A Database for Creating and Managing Knowledge Graphs and EmbeddingsIn TextGraphs-13 @ EMNLP, 2019