Documente Academic
Documente Profesional
Documente Cultură
• Notepad++: Edited Python files in the GALILEO program potential when analyzing Conclusions
• Jupyter Notebook: Ran GALILEO files on many data sets categorical datasets • Too many unique values or too few data instances
• Scikit-learn: Made confusion matrices for data sets causes under-fitting problems for GALILEO
• MatPlotLib: Plotted graphs showing AIC/BIC/DIC • Only works well with datasets that have enough
samples for their attribute space
• The confusion matrix shows • Provides new ways to view a data set
0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40
a few main concentrations
with little spread, pointing
• Needs more work and testing to identify weaknesses
towards precise lines around • Can have a huge impact on the digital world of data
the synthesized clusters