Given a sample of observations $X$, by changing a parameter $p$ we can divide $X$ into two subsamples $X_1$ and $X_2$ (this division is done in a non-trivial way which is nonetheless irrelevant to the problem at hand). We would like to find a value of $p$ for which subsamples $X_1$ and $X_2$ are as much as possible "distinct". To this purpose, we can consider how "far" apart the empirical distribution functions of $X_1$ and $X_2$ are. I am deciding between two objective functions:
1- $max_p \{max_x (\mid F_{X_1}(x)- F_{X_2}(x)\mid)\}$
2- $max_p \{\sum_x (\mid F_{X_1}(x) - F_{X_2}(x)\mid)\}$,
i.e., the first maximization problem searches for a value of $p$ that maximizes the biggest absolute difference between the EDFs, while the second one chooses a value of $p$ for which the area between the graphs of EDFs is maximized.
I acknowledge that the notion of distinctness might be ambiguous, however, I am looking for pros and cons of each objective functions and if there is any similar problem studied in the literature.
