More on Projects
for Comp 8390
*******
Note that this information is also posted on
the announcement page
of the course web
page.
http://cs.uwindsor.ca/~cezeife/courses/60-539/
Remember that project presentations will be
made on
Apr. 1 and/or Apr.
8.
Each project (group or individual) is
presented in roughly 5 to 10
minutes giving
information on:
Project topic,
work involved and importance
New things learnt
for the project
Relevance to data
warehousing or mining
Parts of projects
completed and parts left to complete
Note that for
group projects, each group member's contribution has to
be stated
clearly. It is better if each member of the group
actually
participates during the presentation.
Note that
all members of a
project group might not necessarily get the
same mark if they
do not contribute the same amount of work
and do not show
the same level of understanding in the work
involved.
Note that
research projects should clearly present the topic
with an example
discussing a few techniques in the literature
already for
solving this topic. For example, a
research in
Data Warehousing
Cleaning, should clearly summarize what the
topic is about
and present with an example about 2 or 3 methods
they already researched
for handling data warehouse cleaning.
A comprehensive
list of techniques must be included in the final
report that must
have not less than 10 current and most
relevant
references with at least 5 papers as the core of the report.
Question time
takes between 3 and 5 minutes.
By your project
presentation date, I expect that you have
completed at
least 50% of the work involved in the project.
The complete
project report and demonstration (if a system is built
or an algorithm
implemented) has to be done by project demonstration date of April 15th.
Final Project
demonstration (different from project presentation) time for each project
should be booked
for Monday, April
15th. Project demo time slots
are: 2:30 -
3:00pm, 3:30 - 4:00pm, 4:00 - 4:30, 4:30 - 5:00pm, 5:00-5:30pm.
Reports should be
submitted at least a day or two before demo.
Project Report
-----
Report should not be more than 20 pages on
12 point, 1.5 or double line
spaced font and
should include the following points:
(a) purpose of
application,
(b) system
designers and their contributions,
(c) database
schema including constraints, business rules,
(d) summary of
interface,
(e) features of
the system built or studied
(f) installation
guide,
(g) listing of
all files involved in the application and their functions,
and relationships
to each other.
(h) user manual
and any other pertinent information.
1. More
on Seminars
********
Remember that research seminars begin on
Monday, March 11. Refer back to the
seminar schedule list I gave to you for information on who
is presenting
when and which papers to read in preparation.
Students are
expected to grade seminar based on clarity, organization,
quality of talk
and technical content. Clarity judges
how you
understand what
the presenter is giving from the way they present it
(not based on
your lack of concentration on what they are talking
about).
Organization judges how cohesive (the flow or togetherness) of
the material
taught (e.g., do they clearly tell you the topic, explain the purpose before
discussing the solutions and results or do they
jump in and talk
of results when you do not even know what the
topic is about.
Efforts put into slide presentations also come in
here). Quality of talk judges the correctness of the
presentation
from the paper
presented (that is, how well does the presenter
understand
materials in this paper or are they providing wrong
information?). Technical content judges level of difficulty
of
the paper
presented and efforts put in by the student.
Some
students avoid
actually trying to understand and explain the
technical
solution (algorithms) presented in the paper, but
rather spend too
much time discussing purpose and performance
analysis.
Recall that
seminar reports should be a maximum of 5 pages
in 12-point double
line format for your research paper you are
supposed to
summarize. That is, a maximum of 5 pages
for one
paper. Hand in
your research report on or after the day
your research
paper is presented. Last reports should be submitted
on the last day
of seminar presentations.
On seminars, the course information documents 1 and 2 with their links also found through the course announcement page, provide some details such as:
Seminar report should clearly state title of
paper, authors, proceedings
or journal it came from, year of
publication, name of student and
seminar group. Then, include clearly,
problem addressed by paper,
contributions of this paper, solution
provided with clear algorithms and
running example, limitations and advantages
of solution and your opinion of work. Seminar reports are due on the
day of presentation of the paper or latest the last day of seminar
presentations.
For seminar presentation, the goal is to focus
on making clear:
Then, include clearly, problem addressed by
paper,
contributions of this paper, solution
provided with clear algorithms and
running example,
limitations and advantages of solution and your opinion of work. Check for example, how I tried to
present the Apriori and the FP-tree growth algorithms
for mining frequent patterns. Follow that approach as much as possible. Think of the Input data, Output data, then
the algorithm steps for the solution presented by paper. Summarize these first and try to use an
example to make yourself clear.
Check a
sample presentation power-point slides I left for you in the Comp 8390 (60-539)
web page, database tools. The link is:
http://cezeife.myweb.cs.uwindsor.ca/courses/60-539/databases/index.htm.
Please,
find a sample seminar presentation grading sheet I will hand out to class
during each seminar or keep in Brightspace folder,
for their own assessment of your presentation.
Focus on grading a seminar is on clarity (2 marks), organization (2 marks),
Quality of presentation (3 marks), Technical content (3 marks) for 10 marks
total.
You should also
start to book the time slot for your project
demo.
2. More
on Test
****
Test covers materials
taught in the lecture part of course and distributed into 3 sections for i) database foundation, ii) data warehouse, XML and NOSQL
databases and iii) data mining techniques. The sample test handed out in class
is a good model with a fairly good coverage of test materials (note that XML
and NOSQL Databases are new addition and not included in the sample test).
For section A
of test, Know how to design source data bases that are normalized, querying
through Sql, relational algebra and calculus; disk
anatomy (e.g., what is a sector, cylinder,
platter, track and give the size of a sector, what is the size of
a track, disk etc.). File organizations
and Indexes etc.
For section B, know how to design data warehouses from source databases and
multi-dimensional modelling aspects of the data warehouse, data warehouse
querying. Also know the differences between components of other non-traditional
databases discussed in class (XML and NOSQL databases) and the traditional
databases.
For section C,
know how to mine association rules using the basic Apriori
and FP-growth techniques, olap querying and
operations such as drill-down and roll-up etc.