Motion capture data can be characterized as a series of multi-dimensional spatio-temporal data, which is recorded by tracking the number of key points in space over time with a 3-dimensioanl representation. Such complex characteristics make the processing of motion capture data a non-trivial task. Hence, techniques that can provide an approximated, less complicated representation of such data are highly desirable. In this paper, we propose a novel technique that uses temporal clustering to generate an approximate representation of motion capture data. First, we segment the motion in the time domain with an optimal partition algorithm so that the within-segment sum of squared error (WSSSE) is minimized. Then, we represent the motion capture data as the averages taken over all the segments, resulting in a representation of much lower complexity. Experimental results suggest that comparing with the state-of-the-art methods, our proposed representation technique can better approximate the motion capture data.