I have a large dataset from which a subset based on some columns' values is wanted. I want to create another database from this subset. How would I do that with sqlite3 in Python?
The column from which I want to compare to get the subset contain date in the format of YYYYMMDD e.g. 20120429.
I want to get the observation before a certain date.
Also, how can I winsorise and calculate the average value of some columns based on another column's value?
Thanks