Search Results for author: Natraj Raman

Found 10 papers, 0 papers with code

Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

no code implementations9 Apr 2024 Tianyu Cao, Natraj Raman, Danial Dervovic, Chenhao Tan

We propose a computational framework for characterizing multimodal long-form summarization and investigate the behavior of Claude 2. 0/2. 1, GPT-4/3. 5, and Command.

Hallucination Position +1

DocLLM: A layout-aware generative language model for multimodal document understanding

no code implementations31 Dec 2023 Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual and spatial modalities.

document understanding Language Modelling

Synthetic Text Generation using Hypergraph Representations

no code implementations6 Sep 2023 Natraj Raman, Sameena Shah

Generating synthetic variants of a document is often posed as text-to-text transformation.

Hypergraph representations Text Generation

Bayesian Hierarchical Models for Counterfactual Estimation

no code implementations21 Jan 2023 Natraj Raman, Daniele Magazzeni, Sameena Shah

Counterfactual explanations utilize feature perturbations to analyze the outcome of an original decision and recommend an actionable recourse.

counterfactual Fairness +1

WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain

no code implementations31 Oct 2022 Raj Sanjay Shah, Kunal Chawla, Dheeraj Eidnani, Agam Shah, Wendi Du, Sudheer Chava, Natraj Raman, Charese Smiley, Jiaao Chen, Diyi Yang

To this end, we contribute the Financial Language Understanding Evaluation (FLUE), an open-source comprehensive suite of benchmarks for the financial domain.

FLUE Language Modelling

Structure and Semantics Preserving Document Representations

no code implementations11 Jan 2022 Natraj Raman, Sameena Shah, Manuela Veloso

Retrieving relevant documents from a corpus is typically based on the semantic similarity between the document content and query text.

Metric Learning Retrieval +2

Synthetic Document Generator for Annotation-free Layout Recognition

no code implementations11 Nov 2021 Natraj Raman, Sameena Shah, Manuela Veloso

Analyzing the layout of a document to identify headers, sections, tables, figures etc.

Robust Document Representations using Latent Topics and Metadata

no code implementations23 Oct 2020 Natraj Raman, Armineh Nourbakhsh, Sameena Shah, Manuela Veloso

Task specific fine-tuning of a pre-trained neural language model using a custom softmax output layer is the de facto approach of late when dealing with document classification problems.

Document Classification Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.