Abstract : We propose in this paper a new, alternative approach for the problem of ﬁnding a set of representative objects in large datasets. To do so, we ﬁrst formulate the general Instance Selection Problem (ISP) and then study three variants of that in order to select instances from dierent regions of the data.These variants aim at finding the objects located in three very different locations of the data: the inner frontier, the central area and the outer frontier. Solutions to these problems have been discussed and their complexities have been studied.To illustrate the effectiveness of the proposed techniques, we ﬁrst use a small, synthetic dataset for visualization purpose. We then study them on the Reuters dataset and show that the integration of instances selected by the ISP techniques is able to provide a good representation of the data and can be considered as a complementary approach for the state-of-the-art methods. Finally, we examine the quality of the selected objects by applying a topic-based analysis in order to show how well the selected documents cover the topics in the Reuters dataset.