Feature Engineering

392 papers with code • 1 benchmarks • 5 datasets

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Benchmarks

Add a Result

These leaderboards are used to track progress in Feature Engineering

Trend	Dataset	Best Model	Paper	Code	Compare
	2019_test set	CNN			See all

Libraries

Use these libraries to find Feature Engineering models and implementations

shenweichen/DeepCTR

6 papers

7,346

xue-pai/FuxiCTR

6 papers

781

UlionTse/mlgb

6 papers

310

DataCanvasIO/DeepTables

4 papers

635

See all 12 libraries.

Datasets

Subtasks

Imputation

Most implemented papers

Most implemented Social Latest No code

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

qianguih/voxelnet • • CVPR 2018

Accurate detection of objects in 3D point clouds is a central problem in many applications, such as autonomous navigation, housekeeping robots, and augmented/virtual reality.

Paper
Code

Wide & Deep Learning for Recommender Systems

microsoft/recommenders • • 24 Jun 2016

Memorization of feature interactions through a wide set of cross-product feature transformations are effective and interpretable, while generalization requires more feature engineering effort.

Paper
Code

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

guillaumegenthial/sequence_tagging • • ACL 2016

State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing.

Paper
Code

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

xue-pai/FuxiCTR • • 13 Mar 2017

Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems.

Paper
Code

Deep & Cross Network for Ad Click Predictions

shenweichen/DeepCTR • • 17 Aug 2017

Feature engineering has been the key to the success of many prediction models.

Paper
Code

Named Entity Recognition with Bidirectional LSTM-CNNs

flairNLP/flair • • TACL 2016

Named entity recognition is a challenging task that has traditionally required large amounts of knowledge in the form of feature engineering and lexicons to achieve high performance.

Paper
Code

DeepFM: An End-to-End Wide & Deep Learning Framework for CTR Prediction

shenweichen/DeepCTR • • 12 Apr 2018

In this paper, we study two instances of DeepFM where its "deep" component is DNN and PNN respectively, for which we denote as DeepFM-D and DeepFM-P. Comprehensive experiments are conducted to demonstrate the effectiveness of DeepFM-D and DeepFM-P over the existing models for CTR prediction, on both benchmark data and commercial data.

Paper
Code

Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data

Atomu2014/product-nets-distributed • • 1 Jul 2018

User response prediction is a crucial component for personalized information retrieval and filtering scenarios, such as recommender system and web search.

Paper
Code