Documente Academic
Documente Profesional
Documente Cultură
Used the historical Motor Vehicle Collision data collected over the
past five years from NYC Open Data, which tracks details of
accidentsincluding time, location, area and the accident contribution
factors.
DATE
TIME
NUM INJURED
NUM KILLED
pre-LDA: clean
Tokenization Stop words Stemming
Prior:
Doc: topic distribution
Tpc: word distribution
Posterior:
p(word|Tpc)
p(Tpc|Doc)
Word Topic Document
Clustering Result
-Heat map by week
Normalization
No remarkable difference on
week between three clusters
Clustering Result
-Heat map by month
Normalization
The frequency of month 8-12
in cluster 3 is high
The frequency of month 5-8 in
cluster 1 and cluster 2 is high
Clustering Result
-Moving average by month