23.1.12

Acquiring Another Dataset Pt. II

From the portugal paper --"the test set adopted for the qualitative evaluation of the proposed method is the one presented in (Dalitz et al., 2008) and already described."

Dalitz raises and answers some questions in his 2008 paper about a dataset:

"How do we measure the distance of a given segmentation from a perfect 'ground truth' segmentation, and how do we obtain the ground truthing data?"

"Even though the labeling of the ground-truth data could be done manually, this is very time consuming and has the disadvantage of an ad-hoc classification of dubious pixels belonging both to a staffline and a crossing symbol. Therefore, we generate our music images from postscript images created with music typesetting software, which allows for “perfect” staff removal."

Dalitz's data set is available over here (along with another handwritten data set).

No comments:

Post a Comment