no code implementations • 5 Dec 2023 • Tejit Pabari, Beth Tellman, Giannis Karamanolakis, Mitchell Thomas, Max Mauerman, Eugene Wu, Upmanu Lall, Marco Tedesco, Michael S Steckler, Paolo Colosio, Daniel E Osgood, Melody Braun, Jens de Bruijn, Shammun Islam
In this work, we explore a novel approach for supporting satellite-based flood index insurance by extracting high-resolution spatio-temporal information from news media.
no code implementations • 1 Jul 2023 • Zezhou Huang, Rathijit Sen, Jiaxiang Liu, Eugene Wu
Although dominant for tabular data, ML libraries that train tree models over normalized databases (e. g., LightGBM, XGBoost) require the data to be denormalized as a single table, materialized, and exported.
no code implementations • 19 Sep 2022 • Yiru Chen, Ryan Li, Austin Mac, Tianbao Xie, Tao Yu, Eugene Wu
We develop NL2INTERFACE to explore the potential of generating usable interactive multi-visualization interfaces from natural language queries.
1 code implementation • 31 Dec 2021 • Iddo Drori, Sarah Zhang, Reece Shuttleworth, Leonard Tang, Albert Lu, Elizabeth Ke, Kevin Liu, Linda Chen, Sunny Tran, Newman Cheng, Roman Wang, Nikhil Singh, Taylor L. Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene Wu, Gilbert Strang
We automatically synthesize programs using few-shot learning and OpenAI's Codex transformer and execute them to solve course problems at 81% automatic accuracy.
no code implementations • 26 Aug 2021 • Yejia Liu, Weiyuan Wu, Lampros Flokas, Jiannan Wang, Eugene Wu
The SQL-based training data debugging framework has proved effective to fix this kind of issue in a non-federated learning setting.
1 code implementation • 10 Feb 2021 • Brandon Lockhart, Jinglin Peng, Weiyuan Wu, Jiannan Wang, Eugene Wu
BO - a technique for finding the global optimum of a black-box function - is used to find the best predicate.
1 code implementation • 15 Jul 2020 • Haneen Mohammed, Ziyun Wei, Eugene Wu, Ravi Netravali
Interactive data visualization and exploration (DVE) applications are often network-bottlenecked due to bursty request patterns, large response sizes, and heterogeneous deployments over a range of networks and devices.
Databases
1 code implementation • 12 Apr 2020 • Weiyuan Wu, Lampros Flokas, Eugene Wu, Jiannan Wang
As the need for machine learning (ML) increases rapidly across all industry sectors, there is a significant interest among commercial database providers to support "Query 2. 0", which integrates model inference into SQL queries.
no code implementations • 7 Jan 2020 • Yiru Chen, Eugene Wu
Interactive tools like user interfaces help democratize data access for end-users by hiding underlying programming details and exposing the necessary widget interface to users.
1 code implementation • 26 Apr 2019 • Sanjay Krishnan, Eugene Wu
The analyst effort in data cleaning is gradually shifting away from the design of hand-written scripts to building and tuning complex pipelines of automated data cleaning libraries.
Databases
no code implementations • 22 Jan 2018 • Fotis Psallidas, Eugene Wu
Our experiments on real-world applications highlight that Smoke can meet the latency requirements of interactive visualizations (e. g., <150ms) and outperform hand-written implementations of data profiling primitives.
Databases
no code implementations • 15 Jan 2016 • Sanjay Krishnan, Jiannan Wang, Eugene Wu, Michael J. Franklin, Ken Goldberg
Data cleaning is often an important step to ensure that predictive models, such as regression and classification, are not affected by systematic errors such as inconsistent, out-of-date, or outlier data.
no code implementations • 15 Aug 2014 • Leilani Battle, Edward Benson, Aditya Parameswaran, Eugene Wu
We develop algorithms and indexes to support cost-sensitive prediction, i. e., making decisions using machine learning models taking feature evaluation cost into account.