no code implementations • 26 Oct 2023 • Farima Fatahi Bayat, Kun Qian, Benjamin Han, Yisi Sang, Anton Belyi, Samira Khorshidi, Fei Wu, Ihab F. Ilyas, Yunyao Li
Detecting factual errors in textual information, whether generated by large language models (LLM) or curated by humans, is crucial for making informed decisions.
no code implementations • 16 May 2023 • Ihab F. Ilyas, JP Lacerda, Yunyao Li, Umar Farooq Minhas, Ali Mousavi, Jeffrey Pound, Theodoros Rekatsinas, Chiraag Sumanth
We then describe how our platform, including graph embeddings, can be leveraged to create a Semantic Annotation service that links unstructured Web documents to entities in our KG.
no code implementations • 4 Apr 2023 • Jason Mohoney, Anil Pacaci, Shihabur Rahman Chowdhury, Ali Mousavi, Ihab F. Ilyas, Umar Farooq Minhas, Jeffrey Pound, Theodoros Rekatsinas
Motivated by the tasks of finding related KG queries and entities for past KG query workloads, we focus on hybrid vector similarity search (hybrid queries for short) where part of the query corresponds to vector similarity search and part of the query corresponds to predicates over relational attributes associated with the underlying data vectors.
no code implementations • 15 Apr 2022 • Ihab F. Ilyas, Theodoros Rekatsinas, Vishnu Konda, Jeffrey Pound, Xiaoguang Qi, Mohamed Soliman
We introduce Saga, a next-generation knowledge construction and serving platform for powering knowledge-based applications at industrial scale.
1 code implementation • 2 Jun 2021 • Sahaana Suri, Ihab F. Ilyas, Christopher Ré, Theodoros Rekatsinas
Context enrichment, or rebuilding fragmented context, using keyless joins is an implicit or explicit step in machine learning (ML) pipelines over structured data sources.
1 code implementation • 31 Dec 2020 • Chang Ge, Shubhankar Mohapatra, Xi He, Ihab F. Ilyas
Existing differentially private data synthesis methods aim to generate useful data based on applications, but they fail in keeping one of the most fundamental data properties of the structured data -- the underlying correlations and dependencies among tuples and attributes (i. e., the structure of the data).
Databases Cryptography and Security
no code implementations • 9 Nov 2020 • Paulo H. Oliveira, Daniel S. Kaster, Caetano Traina-Jr., Ihab F. Ilyas
Lack of data and data quality issues are among the main bottlenecks that prevent further artificial intelligence adoption within many organizations, pushing data scientists to spend most of their time cleaning data before being able to answer analytical questions.
no code implementations • 24 Aug 2020 • Alireza Heidari, Shrinu Kushagra, Ihab F. Ilyas
Our goal is to develop a procedure that samples uniformly from the set of entities present in the database in the presence of duplicates.
no code implementations • 18 Jun 2020 • Alireza Heidari, George Michalopoulos, Shrinu Kushagra, Ihab F. Ilyas, Theodoros Rekatsinas
We use this feature vector alongwith the ground-truth information to learn a classifier for each of the attributes of the database.
no code implementations • WS 2019 • Ryan Clancy, Ihab F. Ilyas, Jimmy Lin
We present a scalable, open-source platform that {``}distills{''} a potentially large text collection into a knowledge graph.
no code implementations • 29 Jun 2019 • Alireza Heidari, Ihab F. Ilyas, Theodoros Rekatsinas
We study the problem of recovering the latent ground truth labeling of a structured instance with categorical random variables in the presence of noisy observations.
1 code implementation • 15 Mar 2019 • Michael Azmy, Peng Shi, Jimmy Lin, Ihab F. Ilyas
This paper explores the problem of matching entities across different knowledge graphs.
1 code implementation • 29 Dec 2017 • Chang Ge, Xi He, Ihab F. Ilyas, Ashwin Machanavajjhala
Organizations are increasingly interested in allowing external data scientists to explore their sensitive datasets.
Databases