Feature Engineering

392 papers with code • 1 benchmarks • 5 datasets

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Benchmarks

Add a Result

These leaderboards are used to track progress in Feature Engineering

Trend	Dataset	Best Model	Paper	Code	Compare
	2019_test set	CNN			See all

Libraries

Use these libraries to find Feature Engineering models and implementations

shenweichen/DeepCTR

6 papers

7,346

xue-pai/FuxiCTR

6 papers

781

UlionTse/mlgb

6 papers

310

DataCanvasIO/DeepTables

4 papers

635

See all 12 libraries.

Datasets

Subtasks

Imputation

Latest papers with no code

Most implemented Social Latest No code

Large Language Models for Networking: Workflow, Advances and Challenges

no code yet • 19 Apr 2024

The networking field is characterized by its high complexity and rapid iteration, requiring extensive expertise to accomplish network tasks, ranging from network design, diagnosis, configuration and security.

Paper
Add Code

TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches

no code yet • 18 Apr 2024

This study employs deep learning techniques to explore four speaker profiling tasks on the TIMIT dataset, namely gender classification, accent classification, age estimation, and speaker identification, highlighting the potential and challenges of multi-task learning versus single-task models.

Paper
Add Code

PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network

no code yet • 16 Apr 2024

In this study, we propose PreGSU, a generalized pre-trained scene understanding model based on graph attention network to learn the universal interaction and reasoning of traffic scenes to support various downstream tasks.

Paper
Add Code

Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

no code yet • 16 Apr 2024

Light curves serve as a valuable source of information on stellar formation and evolution.

Paper
Add Code

Survey on Embedding Models for Knowledge Graph and its Applications

no code yet • 14 Apr 2024

Knowledge Graph (KG) is a graph based data structure to represent facts of the world where nodes represent real world entities or abstract concept and edges represent relation between the entities.

Paper
Add Code

Transform then Explore: a Simple and Effective Technique for Exploratory Combinatorial Optimization with Reinforcement Learning

no code yet • 6 Apr 2024

The technique is originated from physics, but is very effective in enabling RL agents to explore to continuously improve the solutions during test.

Paper
Add Code

Sentiment analysis and random forest to classify LLM versus human source applied to Scientific Texts

no code yet • 5 Apr 2024

After the launch of ChatGPT v. 4 there has been a global vivid discussion on the ability of this artificial intelligence powered platform and some other similar ones for the automatic production of all kinds of texts, including scientific and technical texts.

Paper
Add Code

The Death of Feature Engineering? BERT with Linguistic Features on SQuAD 2.0

no code yet • 4 Apr 2024

We conclude that the BERT base model will be improved by incorporating the features.

Paper
Add Code

AI WALKUP: A Computer-Vision Approach to Quantifying MDS-UPDRS in Parkinson's Disease

no code yet • 2 Apr 2024

Parkinson's Disease (PD) is the second most common neurodegenerative disorder.

Paper
Add Code

Leveraging Machine Learning for Early Autism Detection via INDT-ASD Indian Database

no code yet • 2 Apr 2024

Using the proposed model, we succeeded in predicting ASD using a minimized set of 20 questions rather than the 28 questions presented in AMI with promising accuracy.

Paper
Add Code

Feature Engineering

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result