Feature Engineering

392 papers with code • 1 benchmarks • 5 datasets

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Libraries

Use these libraries to find Feature Engineering models and implementations
6 papers
7,345
6 papers
780
6 papers
309
See all 12 libraries.

Subtasks


Universal Time-Series Representation Learning: A Survey

itouchz/awesome-deep-time-series-representations 8 Jan 2024

Time-series data exists in every corner of real-world systems and services, ranging from satellites in the sky to wearable devices on human bodies.

67
08 Jan 2024

TSPP: A Unified Benchmarking Tool for Time-series Forecasting

NVIDIA/DeepLearningExamples 28 Dec 2023

While machine learning has witnessed significant advancements, the emphasis has largely been on data acquisition and model creation.

12,599
28 Dec 2023

Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation

rashaalshawi/dual-attention-u-net-with-feature-infusion-pushing-the-boundaries-of-multiclass-defect-segmentation 21 Dec 2023

The proposed architecture, Dual Attentive U-Net with Feature Infusion (DAU-FI Net), addresses challenges in semantic segmentation, particularly on multiclass imbalanced datasets with limited samples.

9
21 Dec 2023

Graph Coordinates and Conventional Neural Networks -- An Alternative for Graph Neural Networks

i721/GraphCoordinates 3 Dec 2023

We propose Topology Coordinate Neural Network (TCNN) and Directional Virtual Coordinate Neural Network (DVCNN) as novel and efficient alternatives to message passing GNNs, that directly leverage the graph's topology, sidestepping the computational challenges presented by competing algorithms.

0
03 Dec 2023

Understanding learning from EEG data: Combining machine learning and feature engineering based on hidden Markov models and mixed models

gabrielrpalma/understandinglearningwithml 14 Nov 2023

Our findings suggest that standardising the theta EEG data and using deep neural networks enhances the classification of learner and non-learner subjects in a spatial learning task.

0
14 Nov 2023

Auto deep learning for bioacoustic signals

giuliotosato/autokeras-bioacustic 8 Nov 2023

This study investigates the potential of automated deep learning to enhance the accuracy and efficiency of multi-class classification of bird vocalizations, compared against traditional manually-designed deep learning models.

4
08 Nov 2023

Classification of Various Types of Damages in Honeycomb Composite Sandwich Structures using Guided Wave Structural Health Monitoring

shrutisawant099/damage-classification-using-feature-engineering 7 Nov 2023

We believe that we are the first to report numerical models for four types of damages in HCSS, which is followed up with experimental validation.

0
07 Nov 2023

Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series

IoannisNasios/M5_Uncertainty_3rd_place International Journal of Forecasting 2022

The keypoints of our methodology are: a) transform the task to regression on sales for a single day b) information rich feature engineering c) create a diverse set of state-of-the-art machine learning models and d) carefully construct validation sets for model tuning.

35
19 Oct 2023

FASER: Binary Code Similarity Search through the use of Intermediate Representations

br0kej/FASER 5 Oct 2023

Being able to identify functions of interest in cross-architecture software is useful whether you are analysing for malware, securing the software supply chain or conducting vulnerability research.

9
05 Oct 2023

Feature Interaction Aware Automated Data Representation Transformation

ehtesam3154/inhrecon 29 Sep 2023

Creating an effective representation space is crucial for mitigating the curse of dimensionality, enhancing model generalization, addressing data sparsity, and leveraging classical models more effectively.

0
29 Sep 2023