I have a research hypothesis and now I'am trying to look at it from different angles.Now I am a bit puzzled.Maybe someone is also interested in machine learning application(especially clustering) in financial area and can advise something.
There are many portfolio theories, for example, Markowitz portfolio theory, that assign weights for every share in portfolio.Imagine, that at the end of a certain year I am estimating results and planning my asset portfolio for the next year. Shares, that perfomed good, can fall during the next year and vice versa. So, it seems reasonable not to rely entirely on the yield and volatility of a security's price over the year.
So, my hypothesis is that we can analyse annual financial reports of stock issuers. For example, choose group of financial multipliers (P/B,P/E,ROE) also maybe compute some annual metrics such as beta coefficient, average price change over the year (in %) and group shares into different segments. After that, we can pick elements from all clusters and only after that assign weights using traditional theories.Ideally, we will get a group of stocks with different performance and seems that it could have a positive impact on portfolio sustainability.
What do you think? Does it sound adequately? Are there any pitfalls in my theory? What to look for when selecting features for clustering?
Thanks everyone in advance!