Search Results for author: Andriy Mulyar

Found 5 papers, 4 papers with code

Nomic Embed: Training a Reproducible Long Context Text Embedder

1 code implementation2 Feb 2024 Zach Nussbaum, John X. Morris, Brandon Duderstadt, Andriy Mulyar

This technical report describes the training of nomic-embed-text-v1, the first fully reproducible, open-source, open-weights, open-data, 8192 context length English text embedding model that outperforms both OpenAI Ada-002 and OpenAI text-embedding-3-small on short and long-context tasks.

GPT4All: An Ecosystem of Open Source Compressed Language Models

1 code implementation6 Nov 2023 Yuvanesh Anand, Zach Nussbaum, Adam Treat, Aaron Miller, Richard Guo, Ben Schmidt, GPT4All Community, Brandon Duderstadt, Andriy Mulyar

It is our hope that this paper acts as both a technical overview of the original GPT4All models as well as a case study on the subsequent growth of the GPT4All open source ecosystem.

Clinical Concept Linking with Contextualized Neural Representations

no code implementations ACL 2020 Elliot Schumacher, Andriy Mulyar, Mark Dredze

We propose an approach to concept linking that leverages recent work in contextualized neural models, such as ELMo (Peters et al. 2018), which create a token representation that integrates the surrounding context of the mention and concept name.

Entity Linking

MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask Learning

2 code implementations21 Apr 2020 Andriy Mulyar, Bridget T. McInnes

Clinical notes contain an abundance of important but not-readily accessible information about patients.

Entity Extraction using GAN

Cannot find the paper you are looking for? You can Submit a new open access paper.