Search Results

Data Interpreter: An LLM Agent For Data Science

1 code implementation28 Feb 2024

Large Language Model (LLM)-based agents have demonstrated remarkable effectiveness.

Language Modelling Large Language Model +1

Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science

3 code implementations20 Mar 2016

As the field of data science continues to grow, there will be an ever-increasing demand for tools that make machine learning accessible to non-experts.

Automated Feature Engineering BIG-bench Machine Learning +2

Automating biomedical data science through tree-based pipeline optimization

1 code implementation28 Jan 2016

Over the past decade, data science and machine learning has grown from a mysterious art form to a staple tool across a variety of fields in academia, business, and government.

BIG-bench Machine Learning General Classification +1

Identifying and Harnessing the Building Blocks of Machine Learning Pipelines for Sensible Initialization of a Data Science Automation Tool

1 code implementation29 Jul 2016

In this chapter, we present a genetic programming-based AutoML system called TPOT that optimizes a series of feature preprocessors and machine learning models with the goal of maximizing classification accuracy on a supervised classification problem.

AutoML BIG-bench Machine Learning +2

Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows

1 code implementation30 Apr 2021

Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data.

Databases Human-Computer Interaction

Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence

2 code implementations12 Feb 2020

Smarter applications are making better use of the insights gleaned from data, having an impact on every industry and research discipline.

BIG-bench Machine Learning

Xorbits: Automating Operator Tiling for Distributed Data Science

1 code implementation29 Dec 2023

However, existing systems often struggle with processing large datasets due to Out-of-Memory (OOM) problems caused by poor data partitioning.

Distributed, Parallel, and Cluster Computing

Syft 0.5: A Platform for Universally Deployable Structured Transparency

1 code implementation26 Apr 2021

We present Syft 0. 5, a general-purpose framework that combines a core group of privacy-enhancing technologies that facilitate a universal set of structured transparency systems.

Privacy Preserving

A generic framework for privacy preserving deep learning

4 code implementations9 Nov 2018

We detail a new framework for privacy preserving deep learning and discuss its assets.

Federated Learning Privacy Preserving +1