GitHub - amueller/dabl: Data Analysis Baseline Library

 5 years ago
source link: https://github.com/amueller/dabl
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.



The data analysis baseline library.

  • "Mr Sanchez, are you a data scientist?"
  • "I dabl, Mr president."


This is pre-alpha software and is still very-much in flux.

Current scope and upcoming features

This library is very much still under development. Current code focusses mostly on exploratory visualiation and preprocessing. There are also drop-in replacements for GridSearchCV and RandomizedSearchCV using successive halfing. The next step in the development will be adding portfolios in the style of POSH auto-sklearn to find strong models quickly. In essence that boils down to a quick search over different gradient boosting models and other tree ensembles and potentially kernel methods.

Stay Tuned!

About Joyk

Aggregate valuable and interesting links.
Joyk means Joy of geeK