additional options for pivot_wider and pivot_longer #10

rdboyes · 2023-03-15T13:49:10Z

As implemented, @pivot_wider and @pivot_longer only support a small number of the options which are supported by their tidyverse counterparts pivot_longer and pivot_wider. The additional options are ones that I personally rarely use and/or don't have clear analogs in stack() and unstack(). I'm interested to hear if there are specific options that others get a lot of use out of that should be prioritized.

The text was updated successfully, but these errors were encountered:

kdpsingh · 2023-03-15T20:21:11Z

As a general philosophy, we shouldn't go out of our way to support functionality that isn't directly built into DataFrames.jl or is trivial to add on top. If something is missing in that package, I would recommend we file an issue there.

The main functionality I'd like to see is the ability to specify more than one selection separated by commas for pivot_longer().

kdpsingh · 2023-03-22T18:53:43Z

No urgency on this @rdboyes, but I just want to figure out if we should close this issue.

Should we work on adding the ability to select multiple columns separated by commas? Anything else from your end that we should prioritize here?

Just want to make sure if no changes planned, that we close this. But I do think there's room for that one tiny improvement before closing. Happy to help with implementation if you'd like me to take a look.

DoktorMike · 2025-01-14T20:55:42Z

Something I specifically miss is the ability to use names_from=[VarA, VarB] as I quite often end up with several columns I want to turn into wide format.

kdpsingh · 2025-01-15T02:57:28Z

Do you happen to know if there is a syntax to achieve this in DataFrames.jl? Need to explore a bit, so a minimal DF example would be helpful.

DoktorMike · 2025-01-17T12:49:03Z

Do you happen to know if there is a syntax to achieve this in DataFrames.jl? Need to explore a bit, so a minimal DF example would be helpful.

Here's an example that I made which you can see is rather contrived. I used an old solution from bkamin to made it work.
This stackoverflow also discusses it: https://stackoverflow.com/questions/64738653/pivottable-in-over-multiple-columns-in-julia

using DataFrames, Random

Random.seed!(1);

df = DataFrame(
    Sales = rand(1:3, 15) |> sort,
    Label1 = rand('A':'B', 15) .|> Symbol,
    Label2 = rand('Q':'R', 15) .|> Symbol,
    Label3 = rand('E':'F', 15) .|> Symbol
)

# Number of rows keep Sales
unstack(
    combine(
        groupby(
            select(df, :Sales, [:Label1, :Label2, :Label3] => ByRow(Symbol) => :Label),
            [:Sales, :Label]
        ), nrow
    ), :Label, :nrow
)

# Row │ Sales  BQE      AQE      ARF      ARE      AQF     BRF      BQF
#     │ Int64  Int64?   Int64?   Int64?   Int64?   Int64?  Int64?   Int64?
# ────┼─────────────────────────────────────────────────────────────────────
#   1 │     1        1        1        1        1       1  missing  missing
#   2 │     2        2  missing        1  missing       1        1  missing
#   3 │     3  missing  missing  missing  missing       1        2        2


# Sum over Sales
unstack(
    combine(
        groupby(
            select(df, :Sales, [:Label1, :Label2, :Label3] => ByRow(Symbol) => :Label),
            [:Label]
        ), :Sales => sum
    ), :Label, :Sales_sum
)

#  Row │ BQE     AQE     ARF     ARE     AQF     BRF     BQF
#      │ Int64?  Int64?  Int64?  Int64?  Int64?  Int64?  Int64?
# ─────┼────────────────────────────────────────────────────────
#    1 │      5       1       3       1       6       8       6

kdpsingh transferred this issue from TidierOrg/Tidier.jl Jul 31, 2023

cnrrobertson mentioned this issue May 23, 2024

Ability to specify lists, Not lists, colon, or nothing for @pivot_longer #104

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

additional options for pivot_wider and pivot_longer #10

additional options for pivot_wider and pivot_longer #10

rdboyes commented Mar 15, 2023

kdpsingh commented Mar 15, 2023

kdpsingh commented Mar 22, 2023

DoktorMike commented Jan 14, 2025

kdpsingh commented Jan 15, 2025

DoktorMike commented Jan 17, 2025 •

edited

Loading

additional options for pivot_wider and pivot_longer #10

additional options for pivot_wider and pivot_longer #10

Comments

rdboyes commented Mar 15, 2023

kdpsingh commented Mar 15, 2023

kdpsingh commented Mar 22, 2023

DoktorMike commented Jan 14, 2025

kdpsingh commented Jan 15, 2025

DoktorMike commented Jan 17, 2025 • edited Loading

DoktorMike commented Jan 17, 2025 •

edited

Loading