-
Notifications
You must be signed in to change notification settings - Fork 170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] summarize
function
#1230
[WIP] summarize
function
#1230
Conversation
🚀 Deployed on https://deploy-preview-1230--pyjanitor.netlify.app |
Codecov Report
@@ Coverage Diff @@
## dev #1230 +/- ##
===========================================
- Coverage 97.71% 83.06% -14.66%
===========================================
Files 78 79 +1
Lines 3770 3886 +116
===========================================
- Hits 3684 3228 -456
- Misses 86 658 +572 |
31debc3
to
d38a0ad
Compare
6bf140c
to
a8d9c0a
Compare
ecde4a2
to
3fceb3a
Compare
@samukweku apologies for the delay in reviewing. I was originally planning to get to it after this week’s storm of events is over. Was there a reason for closing? |
haha ... not at all @ericmjl this was a draft ... I'll resurrect it when I feel I have gotten the logic right ... truth be told i'm wary of introducing a function that mimics what Pandas already does (in this case |
PR Description
Please describe the changes proposed in the pull request:
col
classAt its core, it is nothing more than a for loop. All the hard work is passed on to Pandas. It does not supplant
agg
- users should reach forsummarize
only ifagg
does not do the job - it's major addition is for grouping flexibly on multiple columns , via thecol
class - thanks to theselect_columns
syntax.**This PR resolves #1225 **
Examples:
summarize
can be a useful abstraction for scenarios whereagg
doesnt quite do the job easily - an example is from this blogpost:PR Checklist
Please ensure that you have done the following:
<your_username>
:dev
, but rather from<your_username>
:<feature-branch_name>
.AUTHORS.md
.CHANGELOG.md
under the latest version header (i.e. the one that is "on deck") describing the contribution.Automatic checks
There will be automatic checks run on the PR. These include:
Relevant Reviewers
Please tag maintainers to review.