#
categorical-features
Here are 43 public repositories matching this topic...
-
Updated
Jan 21, 2020 - Jupyter Notebook
Tensorflow implementation of Product-based Neural Networks. An extended version is at https://github.com/Atomu2014/product-nets-distributed.
-
Updated
Jan 30, 2020 - Python
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
python
data-science
machine-learning
data-mining
random-forest
kaggle
id3
gbdt
gbm
gbrt
gradient-boosting-machine
cart
adaboost
decision-trees
gradient-boosting
c45-trees
categorical-features
gradient-boosting-machines
regression-tree
-
Updated
Jun 29, 2021 - Python
This repository contains a notebook demonstrating a practical implementation of the so-called Entity Embedding for Encoding Categorical Features for Training a Neural Network.
-
Updated
Feb 23, 2019 - Jupyter Notebook
Open
add pkgdown site
1
bfgray3
opened
Jan 19, 2019
Interactive ML Toolset
python
data-science
machine-learning
tutorial
binder
pipeline
notebook
parallel-computing
kaggle
feature-selection
feature-extraction
ensemble-learning
nbviewer
feature-engineering
reproducibility
categorical-features
-
Updated
Apr 14, 2020 - Python
vc1492a
commented
Sep 18, 2018
As to generalize the terminology away from form population and more towards providing recommendations generally.
glmdisc Python package: discretization, factor level grouping, interaction discovery for logistic regression
-
Updated
Nov 30, 2020 - Python
Predicting the ideological direction of Supreme Court decisions: ensemble vs. unified case-based model
pipeline-framework
feature-importance
classification-model
categorical-features
ensemble-machine-learning
curse-of-dimensionality-solution
data-leakage
-
Updated
Oct 14, 2018 - Jupyter Notebook
Multimodal deep learning package that uses both categorical and text-based features in a single deep architecture for regression and binary classification use cases.
deep-learning
wide-and-deep
factorization-machine
neural-factorization-machines
categorical-features
deepfm
multimodal
multimodal-deep-learning
deep-and-cross
-
Updated
Jul 23, 2020 - Python
A small tutorial to demonstrate the power of CatBoost Algorithm
data-science
machine-learning
tutorial
gpu
gpu-computing
decision-trees
gradient-boosting
catboost
categorical-features
catboost-algorithm
catboost-tutorial
-
Updated
Jun 1, 2021 - Jupyter Notebook
-
Updated
Apr 9, 2021 - Jupyter Notebook
Kaggle Categorical Feature Encoding Challenge II, private score 0.78795 (110 place)
-
Updated
Apr 20, 2020 - Jupyter Notebook
Generic encoding of record types
data-science
machine-learning
data-mining
generic-programming
data-analysis
preprocessing
one-hot-encode
categorical-data
categorical-features
-
Updated
Jan 27, 2019 - Haskell
Supervised Learning Problem. In this categorizing the customers in four groups, as follows: 1- Basic Service 2- E-Service 3- Plus Service 4- Total Service.
-
Updated
Jan 26, 2020 - Jupyter Notebook
A python package to compute pairwise Euclidean distances on datasets with categorical features in little time
-
Updated
Aug 8, 2020 - Python
This study creates machine learning models to predict the seriousness of car crashes using 2019 and 2020 crash reports from the publicly accessable database maintained by the Chicago Police Department. A car crash is considered serious if the crash results in an injury or the car is towed due to the crash. Models use categorical features that describe conditions at the time of the crash and crash causes to predict the required target. The current focus is to classify whether a crash results in an injury. All machine learning models are trained, validated, and tested on randomly split 2019 crash reports. The best model (along with all others) are then tested using the full set of 2020 crash reports.
data-science
machine-learning
study
analysis
crash-reports
machinelearning
chicago
injury
car-crashes
categorical-features
chicago-data-portal
chicago-police-department
traffic-crashes
-
Updated
Mar 31, 2021 - Jupyter Notebook
Kaggle Competition (Encoding categorical variables)
-
Updated
Apr 3, 2020 - Jupyter Notebook
A set of tools for machine learning (for the current day, there are active learning utilities and implementations of some stacking-based techniques).
-
Updated
Dec 31, 2018 - Python
Category encoders integrated with Fast.ai
python
pytorch
notebooks
data-preprocessing
data-preparation
fastai
categorical-features
fasttext-embeddings
fastai-category-encoders
-
Updated
Jan 25, 2021 - Python
Explore various natural language processing models using Python.
nlp
natural-language-processing
spark
sentiment-analysis
pipeline
nltk
bag-of-words
feature-vector
spam-detection
categorical-features
pyspark-mllib
python-nlp-libraries
-
Updated
Nov 22, 2020 - Jupyter Notebook
Feature Importance of categorical variables by converting them into dummy variables (One-hot-encoding) can skewed or hard to interpret results. Here I present a method to get around this problem using H2O.
-
Updated
Jun 10, 2019 - Jupyter Notebook
Why data analysis? , How to understand the problem, what to do for data analysis, and how clean the data for building Machine Learning models
python
correlation
exploratory-data-analysis
binning
datawrangling
descriptive-statistics
normalization
grouping
dataanalysis
categorical-features
missing-values
anova-test
pre-processing-data
-
Updated
Feb 20, 2021 - Jupyter Notebook
-
Updated
Jul 19, 2018 - Python
My solution for Kaggle competition "categorical feature encoding challenge II" with public and private score of 0.783.
machine-learning
kaggle
kaggle-competition
ensemble-model
gradient-boosting-classifier
kaggle-dataset
ensemble-classifier
categorical-data
categorical-features
kaggle-solution
-
Updated
Jul 27, 2020 - Jupyter Notebook
Binary classification, with every feature as categoricals
machine-learning
random-forest
exploratory-data-analysis
jupyter-notebook
python3
kaggle-competition
xgboost
class-imbalance
categorical-features
-
Updated
Nov 12, 2019 - Jupyter Notebook
Medium Post: some techniques useful to deal with missing values of Categorical Features
-
Updated
Nov 17, 2020 - Jupyter Notebook
-
Updated
Nov 16, 2020 - Jupyter Notebook
Improve this page
Add a description, image, and links to the categorical-features topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the categorical-features topic, visit your repo's landing page and select "manage topics."
It would be great to have FBeta, F2, or F0.5 metrics to be implemented without the need for a custom metric class defined by user.
catboost version: 0.26