They did use observed data.
The model is sampled from the observations - blurring (to reduced detail), then PCA.
MCMC(Markov Chain Monte Carlo) and sampling the posterior distribution is essential to show they didn't "add" anything that wasn't already there.
It's a model of physically observed components (dictionary lookup) that are combined into images that are consistent with the original observations.