Skip to content
Commit d39d6169 authored by Douglas RAILLARD's avatar Douglas RAILLARD Committed by Douglas Raillard
Browse files

lisa.datautils: Speed up the conversion from strings to bytes

Use pandas categorical dtype as an intermediate type to deduplicate the
strings to decode. Since decoding is expensive, this is faster, and the
result is also less memory hungry since strings are deduplicated.
parent 15c881c8
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment