datacleaner - A Python tool that automatically cleans data sets and readies them for analysis
datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
datacleaner is not magic
datacleaner works with data in pandas DataFrames.
datacleaner is not magic, and it won't take an unorganized blob of text and automagically parse it out for you.
What datacleaner will do is save you a ton of time encoding and cleaning your data once it's already in a format that pandas DataFrames can handle.
License
Please see the repository license for the licensing and usage information for datacleaner.
Generally, we have licensed datacleaner to make it as widely usable as possible.
Installation
[empty for now]
Usage
datacleaner can be used on the command line. Use --help
to see its usage instructions.
[empty for now]
Contributing to datacleaner
We welcome you to check the existing issues for bugs or enhancements to work on. If you have an idea for an extension to datacleaner, please file a new issue so we can discuss it.