• Home
  • Categories
  • Pricing
  • Submit
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Ever. All rights reserved.·Terms of Service·Privacy Policy·Cookies
    Decorative pattern
    Decorative pattern
    1. Home
    2. Data Science
    3. Awesome Feature Engineering

    Awesome Feature Engineering

    A curated list of resources dedicated to feature engineering techniques for machine learning, covering methods to extract and transform features for improved model performance.

    Surveys

    Loading more......

    Information

    Websitegithub.com
    PublishedMar 18, 2026

    Categories

    1 Item
    Data Science

    Tags

    3 Items
    #feature-engineering#machine-learning#data-science

    Overview

    Awesome Feature Engineering provides resources for creating better features from raw data, a critical step that often determines machine learning model success.

    What is Feature Engineering?

    The process of using domain knowledge to extract features from raw data via data mining techniques to improve machine learning algorithm performance.

    Core Techniques

    Numerical Features

    • Scaling and normalization
    • Binning/discretization
    • Polynomial features
    • Log transformations
    • Power transformations

    Categorical Features

    • One-hot encoding
    • Label encoding
    • Target encoding
    • Frequency encoding
    • Binary encoding

    Datetime Features

    • Extracting components (year, month, day)
    • Cyclical encoding (sin/cos)
    • Time since events
    • Holiday indicators

    Text Features

    • TF-IDF
    • Word embeddings
    • N-grams
    • Count vectorization
    • Sentiment scores

    Image Features

    • Color histograms
    • Edge detection
    • SIFT/SURF features
    • Deep learning embeddings

    Advanced Methods

    Automated Feature Engineering

    • Featuretools
    • tsfresh (time series)
    • AutoFeat
    • Feature-engine

    Feature Selection

    • Filter methods (correlation, mutual information)
    • Wrapper methods (RFE)
    • Embedded methods (L1 regularization)
    • Importance-based selection

    Feature Interactions

    • Polynomial interactions
    • Ratio features
    • Difference features
    • Product features

    Domain-Specific

    Time Series

    • Lag features
    • Rolling statistics
    • Seasonal decomposition
    • Fourier features

    Geospatial

    • Distance calculations
    • Spatial aggregations
    • Coordinate transformations
    • Clustering-based features

    Tools & Libraries

    • scikit-learn preprocessing
    • pandas for data manipulation
    • Featuretools for automated generation
    • category_encoders for categorical encoding
    • tsfresh for time series

    Best Practices

    • Understand your data first
    • Domain knowledge is crucial
    • Avoid data leakage
    • Cross-validate feature engineering
    • Document transformations
    • Monitor feature drift

    Evaluation

    • Feature importance analysis
    • A/B testing with/without features
    • Learning curves
    • Validation set performance