Search Results for author: Eugene Wu

Found 13 papers, 5 papers with code

JoinBoost: Grow Trees Over Normalized Data Using Only SQL

no code implementations1 Jul 2023 Zezhou Huang, Rathijit Sen, Jiaxiang Liu, Eugene Wu

Although dominant for tabular data, ML libraries that train tree models over normalized databases (e. g., LightGBM, XGBoost) require the data to be denormalized as a single table, materialized, and exported.

NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries

no code implementations19 Sep 2022 Yiru Chen, Ryan Li, Austin Mac, Tianbao Xie, Tao Yu, Eugene Wu

We develop NL2INTERFACE to explore the potential of generating usable interactive multi-visualization interfaces from natural language queries.

Natural Language Queries

Enabling SQL-based Training Data Debugging for Federated Learning

no code implementations26 Aug 2021 Yejia Liu, Weiyuan Wu, Lampros Flokas, Jiannan Wang, Eugene Wu

The SQL-based training data debugging framework has proved effective to fix this kind of issue in a non-federated learning setting.

Federated Learning

Explaining Inference Queries with Bayesian Optimization

1 code implementation10 Feb 2021 Brandon Lockhart, Jinglin Peng, Weiyuan Wu, Jiannan Wang, Eugene Wu

BO - a technique for finding the global optimum of a black-box function - is used to find the best predicate.

Bayesian Optimization

Continuous Prefetch for Interactive Data Applications

1 code implementation15 Jul 2020 Haneen Mohammed, Ziyun Wei, Eugene Wu, Ravi Netravali

Interactive data visualization and exploration (DVE) applications are often network-bottlenecked due to bursty request patterns, large response sizes, and heterogeneous deployments over a range of networks and devices.

Databases

Complaint-driven Training Data Debugging for Query 2.0

1 code implementation12 Apr 2020 Weiyuan Wu, Lampros Flokas, Eugene Wu, Jiannan Wang

As the need for machine learning (ML) increases rapidly across all industry sectors, there is a significant interest among commercial database providers to support "Query 2. 0", which integrates model inference into SQL queries.

Monte Carlo Tree Search for Generating Interactive Data Analysis Interfaces

no code implementations7 Jan 2020 Yiru Chen, Eugene Wu

Interactive tools like user interfaces help democratize data access for end-users by hiding underlying programming details and exposing the necessary widget interface to users.

AlphaClean: Automatic Generation of Data Cleaning Pipelines

1 code implementation26 Apr 2019 Sanjay Krishnan, Eugene Wu

The analyst effort in data cleaning is gradually shifting away from the design of hand-written scripts to building and tuning complex pipelines of automated data cleaning libraries.

Databases

Smoke: Fine-grained Lineage at Interactive Speed

no code implementations22 Jan 2018 Fotis Psallidas, Eugene Wu

Our experiments on real-world applications highlight that Smoke can meet the latency requirements of interactive visualizations (e. g., <150ms) and outperform hand-written implementations of data profiling primitives.

Databases

ActiveClean: Interactive Data Cleaning While Learning Convex Loss Models

no code implementations15 Jan 2016 Sanjay Krishnan, Jiannan Wang, Eugene Wu, Michael J. Franklin, Ken Goldberg

Data cleaning is often an important step to ensure that predictive models, such as regression and classification, are not affected by systematic errors such as inconsistent, out-of-date, or outlier data.

Active Learning EEG +1

Indexing Cost Sensitive Prediction

no code implementations15 Aug 2014 Leilani Battle, Edward Benson, Aditya Parameswaran, Eugene Wu

We develop algorithms and indexes to support cost-sensitive prediction, i. e., making decisions using machine learning models taking feature evaluation cost into account.

BIG-bench Machine Learning Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.