June 8, 2004

Learning Models of Human Behavior with Sequential Patterns

graph image

This paper was further reading along the lines of model building for the anesthesiology application. It was presented at a AAAI workshop in 2002 (which I actually attended). The problem that the authors were addressing was that of trying to pull statistically significantly interesting patterns out of a stream of sensor data.

Their data came from a handful of households that had been instrumented with simple sensors along the lines of Intille's work. The sensors were primarily motion sensors set up in various rooms of four real households. There were between 10 and 20 sensors installed that collected data for between 63 and 123 days about 60% of which turned out to be useful data

From this data, they applied some sequential pattern discovery algorithms to try and find sequences of sensor firings which were well supported by the data. The results were swamped with noise. The basically ended up finding support for many patterns. So many patterns in fact that the problem of interpreting the raw data became the problem of interpreting the patterns that they discovered. As a result they resorted to putting a number of heuristic filters into the system. The filters made sense but were unsatisfying from a design point of view. The treated events close in time as likely being from the same activity, they "manually" removed subpatterns which were subsumed by others and the wrote some filters to process individual sensors whose design didn't match well with their application.

I agree with their own critique of their work in which they say that they didn't do any sort of analysis to determine novelty of a pattern. Comparing what they saw to a background model would have caused the longer sequences to appear more unusual in a natural way without adding hand tuned rules.

I felt like both their sensors and their patterns lacked anything in the way of semantics. It was impossible to draw any conclusion about what was going on in the patterns, primarily because the sensors themselves lacked any type of semantics. It's hard to interpret a series of motion sensors firing even if it happens regularly.

They also had the real world problem of strange user routines, differing patterns on weekdays, pets messing up the data, sensors failing and vacations, which we, as researchers, tend to write off as anomalies, but are probably more likely the norm in a household.

I appreciated this paper because it helped to clarify what our anesthesiology work is not doing. In particular, we are not trying to pull signal out of a lot of noise. We are going to be doing more of a model merging operation than a model discovery operation. Our models are clearly delineated in time and have a semantics associated with them that is provided by the person who is recording the data stream.

This paper had some good data and seemed a little bit like computational biology work in which computers are put to the task of finding conserved sequences in genomes.

Posted by djp3 at June 8, 2004 9:43 AM | TrackBack (0)
Comments
Post a comment

Post a comment