Feature Engineering

390 papers with code • 1 benchmarks • 5 datasets

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Benchmarks

Add a Result

These leaderboards are used to track progress in Feature Engineering

Trend	Dataset	Best Model	Paper	Code	Compare
	2019_test set	CNN			See all

Libraries

Use these libraries to find Feature Engineering models and implementations

shenweichen/DeepCTR

6 papers

7,338

xue-pai/FuxiCTR

6 papers

773

UlionTse/mlgb

6 papers

305

DataCanvasIO/DeepTables

4 papers

635

See all 12 libraries.

Datasets

Subtasks

Imputation

Latest papers

Most implemented Social Latest No code

Large-Scale Multi-Domain Recommendation: an Automatic Domain Feature Extraction and Personalized Integration Framework

faceonlive/ai-research • 12 Apr 2024

Besides, by personalized integration of domain features from other domains for each user and the innovation in the training mode, the DFEI framework can yield more accurate conversion identification.

131

12 Apr 2024

Paper
Code

Predicting Mergers and Acquisitions: Temporal Dynamic Industry Networks

dayuyang1999/merger_acquisition_prediction • • 10 Apr 2024

M&A activities are pivotal for market consolidation, enabling firms to augment market power through strategic complementarities.

10 Apr 2024

Paper
Code

Leveraging Latents for Efficient Thermography Classification and Segmentation

faceonlive/ai-research • 9 Apr 2024

In this work, we present a novel algorithm for both breast cancer classification and segmentation.

131

09 Apr 2024

Paper
Code

A Two Dimensional Feature Engineering Method for Relation Extraction

faceonlive/ai-research • 7 Apr 2024

The results indicate that two-dimensional feature engineering can take advantage of a two-dimensional sentence representation and make full use of prior knowledge in traditional feature engineering.

131

07 Apr 2024

Paper
Code

Predictive Analytics of Varieties of Potatoes

fabstat/burbank • 4 Apr 2024

We explore the application of machine learning algorithms to predict the suitability of Russet potato clones for advancement in breeding trials.

04 Apr 2024

Paper
Code

Iterative Feature Boosting for Explainable Speech Emotion Recognition

alaaNfissi/Iterative-Feature-Boosting-for-Explainable-Speech-Emotion-Recognition • International Conference on Machine Learning and Applications (ICMLA) 2024

In speech emotion recognition (SER), using pre- defined features without considering their practical importance may lead to high dimensional datasets, including redundant and irrelevant information.

19 Mar 2024

Paper
Code

Machine Learning-Based Completions Sequencing for Well Performance Optimization

anjieliu121/well_performance_optimization • 23 Feb 2024

Establishing accurate field development parameters to optimize long-term oil production takes time and effort due to the complexity of oil well development, and the uncertainty in estimating long-term well production.

23 Feb 2024

Paper
Code

Descriptive Kernel Convolution Network with Improved Random Walk Kernel

mengchillee/rwk_plus • • 8 Feb 2024

In this paper, we first revisit the RWK and its current usage in KCNs, revealing several shortcomings of the existing designs, and propose an improved graph kernel RWK+, by introducing color-matching random walks and deriving its efficient computation.

08 Feb 2024

Paper
Code