Feature Engineering

392 papers with code • 1 benchmarks • 5 datasets

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Benchmarks

Add a Result

These leaderboards are used to track progress in Feature Engineering

Trend	Dataset	Best Model	Paper	Code	Compare
	2019_test set	CNN			See all

Libraries

Use these libraries to find Feature Engineering models and implementations

shenweichen/DeepCTR

6 papers

7,353

xue-pai/FuxiCTR

6 papers

785

UlionTse/mlgb

6 papers

311

DataCanvasIO/DeepTables

4 papers

636

See all 12 libraries.

Datasets

Subtasks

Imputation

Latest papers

Most implemented Social Latest No code

Deep Learning Applications for Intrusion Detection in Network Traffic

fisher85/ml-cybersecurity • Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS) 2024

The CNN-BiLSTM neural network is synthesized to assess the applicability of deep learning methods for intrusion detection.

13 Jan 2024

Paper
Code

Universal Time-Series Representation Learning: A Survey

itouchz/awesome-deep-time-series-representations • 8 Jan 2024

Time-series data exists in every corner of real-world systems and services, ranging from satellites in the sky to wearable devices on human bodies.

08 Jan 2024

Paper
Code

TSPP: A Unified Benchmarking Tool for Time-series Forecasting

NVIDIA/DeepLearningExamples • • 28 Dec 2023

While machine learning has witnessed significant advancements, the emphasis has largely been on data acquisition and model creation.

12,642

28 Dec 2023

Paper
Code

Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation

rashaalshawi/dual-attention-u-net-with-feature-infusion-pushing-the-boundaries-of-multiclass-defect-segmentation • 21 Dec 2023

The proposed architecture, Dual Attentive U-Net with Feature Infusion (DAU-FI Net), addresses challenges in semantic segmentation, particularly on multiclass imbalanced datasets with limited samples.

21 Dec 2023

Paper
Code

Graph Coordinates and Conventional Neural Networks -- An Alternative for Graph Neural Networks

i721/GraphCoordinates • • 3 Dec 2023

We propose Topology Coordinate Neural Network (TCNN) and Directional Virtual Coordinate Neural Network (DVCNN) as novel and efficient alternatives to message passing GNNs, that directly leverage the graph's topology, sidestepping the computational challenges presented by competing algorithms.

03 Dec 2023

Paper
Code

Understanding learning from EEG data: Combining machine learning and feature engineering based on hidden Markov models and mixed models

gabrielrpalma/understandinglearningwithml • 14 Nov 2023

Our findings suggest that standardising the theta EEG data and using deep neural networks enhances the classification of learner and non-learner subjects in a spatial learning task.

14 Nov 2023

Paper
Code

Auto deep learning for bioacoustic signals

giuliotosato/autokeras-bioacustic • • 8 Nov 2023

This study investigates the potential of automated deep learning to enhance the accuracy and efficiency of multi-class classification of bird vocalizations, compared against traditional manually-designed deep learning models.

08 Nov 2023

Paper
Code

Classification of Various Types of Damages in Honeycomb Composite Sandwich Structures using Guided Wave Structural Health Monitoring

shrutisawant099/damage-classification-using-feature-engineering • 7 Nov 2023

We believe that we are the first to report numerical models for four types of damages in HCSS, which is followed up with experimental validation.

07 Nov 2023

Paper
Code

Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series

IoannisNasios/M5_Uncertainty_3rd_place • • International Journal of Forecasting 2022

The keypoints of our methodology are: a) transform the task to regression on sales for a single day b) information rich feature engineering c) create a diverse set of state-of-the-art machine learning models and d) carefully construct validation sets for model tuning.

19 Oct 2023

Paper
Code

FASER: Binary Code Similarity Search through the use of Intermediate Representations

br0kej/FASER • • 5 Oct 2023

Being able to identify functions of interest in cross-architecture software is useful whether you are analysing for malware, securing the software supply chain or conducting vulnerability research.

05 Oct 2023

Paper
Code

Feature Engineering

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result