Skip to content

Commit d231fad

Browse files
committed
null处理新方案
1 parent 2f0e3e0 commit d231fad

File tree

3 files changed

+30
-8
lines changed

3 files changed

+30
-8
lines changed

README.md

+21-6
Original file line numberDiff line numberDiff line change
@@ -135,6 +135,21 @@ df = codegen_exec(df.lazy(), _code_block_1, _code_block_2).collect(engine="gpu")
135135
1. 根据算子前缀分类(`get_current_by_prefix`),限制算子必需以`ts_``cs_``gp_`开头
136136
2. 根据算子全名分类(`get_current_by_name`), 不再限制算子名。比如`cs_rank`可以叫`rank`
137137

138+
## Null处理/停牌处理
139+
140+
https://github.com/pola-rs/polars/issues/12925#issuecomment-2552764629
141+
非常棒的点子,总结下来有两种实现方式:
142+
143+
1.`null`分成一组,`not_null`分成另一组,要计算两次
144+
2. 仅一组,但复合排序,将`null`排在前面,`not_null`排后面,只要计算一次
145+
146+
```python
147+
X1 = (ts_returns(CLOSE, 3)).over(CLOSE.is_not_null(), _ASSET_, order_by=_DATE_),
148+
X2 = (ts_returns(CLOSE, 3)).over(_ASSET_, order_by=[CLOSE.is_not_null(), _DATE_]),
149+
```
150+
151+
目前使用的是第2种
152+
138153
## 二次开发
139154

140155
1. 备份后编辑`demo_express.py`, `import`需要引入的函数
@@ -161,12 +176,12 @@ df = codegen_exec(df.lazy(), _code_block_1, _code_block_2).collect(engine="gpu")
161176
9. `gp_`开头的函数都会返回对应的`cs_`函数。如`gp_func(A,B,C)`会替换成`cs_func(B,C)`,其中`A`用在了`groupby([date, A])`
162177
10. 支持`A,B,C=MACD()`元组解包,在底层会替换成
163178

164-
```python
165-
_x_0 = MACD()
166-
A = unpack(_x_0, 0)
167-
B = unpack(_x_0, 1)
168-
C = unpack(_x_0, 2)
169-
```
179+
```python
180+
_x_0 = MACD()
181+
A = unpack(_x_0, 0)
182+
B = unpack(_x_0, 1)
183+
C = unpack(_x_0, 2)
184+
```
170185

171186
## 下划线开头的变量
172187

expr_codegen/_version.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
__version__ = "0.10.5"
1+
__version__ = "0.10.6"

expr_codegen/polars_over/code.py

+8-1
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,14 @@ def codegen(exprs_ldl: ListDictList, exprs_src, syms_dst,
7676
# 不想等,打印注释,显示会更直观察
7777
func_code.append(f"# {va} = {s1}")
7878
if k[0] == TS:
79-
func_code.append(f"{va}=({s2}).over(_ASSET_, order_by=_DATE_),")
79+
# https://github.com/pola-rs/polars/issues/12925#issuecomment-2552764629
80+
_sym = [f"{s}.is_not_null()" for s in set(sym)]
81+
if len(_sym) > 1:
82+
_sym = f"pl.all_horizontal({','.join(_sym)})"
83+
else:
84+
_sym = ','.join(_sym)
85+
# func_code.append(f"{va}=({s2}).over({_sym}, _ASSET_, order_by=_DATE_),")
86+
func_code.append(f"{va}=({s2}).over(_ASSET_, order_by=[{_sym}, _DATE_]),")
8087
elif k[0] == CS:
8188
func_code.append(f"{va}=({s2}).over(_DATE_),")
8289
elif k[0] == GP:

0 commit comments

Comments
 (0)