CSCI 4152/6509 - Course Project
[ Home |
Calendar |
Project |
P0 |
Misc |
A0 |
A1 |
A2 |
A3 |
A4 |
Login ]
Page Content
Choosing Project Topic
The course project for graduate students (CSCI 6509), who are in a
research stream (e.g., MCS, PhD) should follow the
basic structure of a typical research project, such as the research
work on a thesis, only on a smaller scale.
The undergraduate students (CSCI 4152) can also choose a research
project, or they can do a more implementation-focused project without
strong emphasis on related-work research.
The graduate students who are not in a research stream (e.g., MACS or MEC
students) may select a research topic, but also an application or
business-oriented project with core methodology based on NLP.
You can form project teams of up to four students, or work
individually.
The final report should be in the form of a technical report.
The presentations will be up to 8 minutes long, followed by 4 minutes
for questions and switching speakers.
The research graduate students (thesis streams) are required to give
individual presentations, while other students can choose to give
individual or team presentations.
Regarding presentation dates, once the presentation time slots are
shown on the course calendar, you will need to let the instructor know
your preference by email.
The presentations will be scheduled based on first-come-first-served
basis.
Deliverables
- P0 – project topic proposal, by email, due on Friday, Oct 1, 2021,
- P1 – a project statement, using GitLab, due on Friday, Oct 29, 2021,
- P – oral presentation, book time slot early, submit slides
or slides overview (PDF or PPT) using GitLab 24h ahead, and
- R – project report, submit using GitLab,
due on Tuesday, Dec 7, 2021.
Note about emails: All emails related to the course must have
the course number included in the subject,
such as CSCI4152, CSCI6509, or, the best option, CSCI4152/6509.
In more details, your subject lines for two required emails must be:
- CSCI4152/6509: Presentation time
- CSCI4152/6509: P0 submission
P0 – Project Topic Proposal
Worth: 1% of the final mark.
You will need to choose a topic for your project.
By the due date, you need to send to the instructor an email in plain
text with the following information:
- tentative title,
- the list of team members, and
- one paragraph description of the topic.
Please do this as soon as you have chosen a topic. If two or more
students or groups have the same or
very similar topics, the team that sends P0 later may be required to change
the topic. If the topic is not sufficiently relevant to the course,
you may be asked to change it.
The following is a sample format of the email for P0:
To: Vlado Keselj <vlado@cs.dal.ca>
Subject: CSCI4152/6509: P0 submission
Hi,
This is our P0 submission:
Title: Classification Using Advanced NLP Technique
Team members:
Firstname Lastname, CSID, Banner#, email
First2 Last2, CSID2, Banner#2, email2
Description:
This is a paragraph description of the research problem, including
possibly some ideas about where to get data, and what methods to use.
Regards,
Firstname
P1 – Project Statement
Evaluation weight: Worth 5% of the final mark.
Deliverable: The P1 submission is one PDF document that MUST
be named p1.pdf
(mandatory lowercase letters)
and submitted using the project GitLab repository created by the
instructor for the course by the P1 deadline.
By the P1 deadline, all project repositories should have been created
in the FCS GitLab server and you should know your project id, which
has the format P-dd, where dd is the number of your project; e.g.,
P-01, P-02, etc.
The P1 submission must be submitted in the main directory of your
GitLab repository. The exact location of the repository will be
explained in an email sent to the class email list.
If you want, you can also submit additional documents (such as Word or LaTeX version, or a
plain text version).
The P1 document should be about 2 pages long, although it is
perfectly acceptable to have a longer document.
Content:
The P1 document must include:
- Project title,
- Names of the member(s) of the group,
- Problem statement,
- List of possible approaches with citations to relevant work,
- Project plan for the rest of the term, and
- List of references.
The statement should identify a feasible project.
Style: Style is quite important for P1 submission. In
addition to inciting student teams to keep working on the project
during the term, a purpose of P1 submission is to have a small
exercise in preparing a small research paper in scientific style
with appropriate use of fonts, sections numbers, citations, and
references.
You can find more notes on the style of P1 and the final report
in the part P1 and Project Report Style Notes.
P1 will be marked based on its completeness, style and structure,
clarity of presentation, and research on and analysis of related
work. P1 submission is a small test for submission of the final
report. It is recommended that you use LaTeX. Present references at
the end with full information in a way as typically done in research
papers.
Use appropriate citations in the main text to the list of references.
P – Project Presentation
Worth: 10% of the final mark, including class participation.
Too book a presentation timeslot, you should check the course
calendar web page for free timeslots, and send an email request to
vlado.ca with subject line "CSCI4152/6509: Presentation time"
to book a time slot.
You are required to submit the slides of the presentation at least
24 hours before your presentation
using GitLab.
If you have individual slides and prefer to keep them separate, you
can use the GitLab project assigned to you personally, or you want to
share your slides with your team members you can use GitLab team
project repository.
In both cases, your slides should be saved in a directory named
`presentation'.
The slides can be original slides, or a slides handout (e.g., 6 slides per page),
and they must be in PDF or PowerPoint format.
You can submit additional files as well, such as presentation.tex.
Duration:
The presentations should last up to 8 minutes, with 4 additional
minutes reserved for questions and changing speakers, for the total of
12 minutes.
Content:
There is a significant flexibility in choosing the topic of your presentation, but it
should be related to the project. It could be the work you have done up
to that time, or what you plan to do. It is a good idea to include
research or other related work that you did so far. You could also
present a related method from the textbook or another paper.
Evaluation scheme for presentations:
- content: how interesting and valuable is the presentation,
appropriateness of the topic, appropriateness for the audience and
the time allocated,
- presentation: clarity,
it should be vivid, interesting;
organization and structure of the presentation should be well-planned;
time length should be appropriate,
- slides: organization of the presentation; slides content: appropriate
amount of text, use of figures,
- question answering: listening and answering the questions being asked,
appropriate answers, answering the actual question to the point,
but not going into a too lengthy additional discussion.
R – Project Report
Worth 20% of the final mark.
The written project report is submitted in an electronic
form using GitLab.
The report must be a PDF file in the main project directory, and it
must be named report.pdf.
You can also submit additional code, data, and other files that you find relevant.
The reports are kept in archive with the instructor for several years.
Peer evaluation is not currently a mandatory part of the
evaluation scheme, but the students will be asked to submit their peer
evaluation after project report submission. A peer evaluation
consists of feedback about contribution of all team members to the
project and it is useful to discourage and detect very imbalanced
contributions and give stronger incentive to all members of a team to
contribute. If peer evaluations indicate imbalance in contributions
in a project team, they may affect individual marks for the report, in
a positive or negative way.
A typical recommended structure of a project report for a
reaseach-oriented project is:
- Title, Author(s), Course name,
- Abstract – make sure it is an abstract of the whole paper and not
just a part of the introduction. The abstract should be brief,
definitely not longer than a half of a page.
- Introduction – introduce the problem; get a reader's attention;
explain motivation and significance of the problem,
define project objectives. It is said
that the title, abstract, introduction, and the whole paper
should in a way express the same story in 10, 100, 1000, and
10,000 words, respectively. (Do not take these numbers literally!)
- Related Work – cover related work. Has this topic been
studied yet? Do not just give an annotated list of citations. Give a critical
analysis of the previous work.
- Problem Definition and Methodology – there is no good
research without a research problem. Define it precisely. Describe
your methodology, algorithms, system overview, or similar.
- Experiment Design – experiments are not mandatory, but some
form of evaluation of your approach should exist.
Evaluation, Discussion of evaluation results
- Conclusion
- References
- Appendices
This structure is just a guideline and parts may not be relevant to
your project. There are no fixed requirements about the length of the paper, since it
may depend on the type of the project and number of people in a group.
It is expected that a project report contains at least 8 pages, and it
may be sufficient for an excellent project if some implementational or
experimental work has been done.
P1 and Project Report Style Notes
P1 (Project Statement) can be considered to be a small exercise in
writing the project report, and it should follow the same style
guidelines, except the structure and the length.
The project report should be written in a good style from a technical
Computer Science perspective. This is a bit loose specification. A
couple of additional more concrete guidelines are provided below:
- use a readable serif font in the main body of the text, font
size 10-12pt; e.g., Times Roman is a standard serif font, or TeX and
LaTeX use a readable serif font by default;
- justified text in the main body (not ragged right, for example),
- Abstract and References parts should not be numbered, but the
sections and subsections between them must be numbered and do not
use colon (:) at the end of the section and subsection titles.
- Figures and Tables should be numbered and have captions at the
bottom. (Tables may have captions on the top alternatively.)
- The document should be printer-friendly: use of color is okay,
but no information should be lost after printing the document in
black-and-white. There must be no large area of black or very dark
color, since this is waste of ink and generally does not have a good
appearance on a plain paper.
- The text should be readable and written in a narrative style in
the sense that it should have a normal flow when reading. For
example, it should not be mainly itemized. However, it should also
not be purely narrative since it is a technical document, and it is
expected that it contains some definitions, formulas, algorithms,
code snippets, figures and similar (not necessarily all of those).
Example Approach to Course Project
- Choose an NLP-related problem that is important and interesting
in your opinion. You should have some ideas about how it could
be solved, and about what interesting results you could obtain
by the end of term. The discussed problem should be
feasible in this sense, but it should not be trivial.
- The next step is to search through existing published work and
find out about existing solutions on the same problem, or to the
closest similar problem. You can start with the textbook.
- Design your method, implement it, and run experiments; possibly
try method variations.
- Analyze results. Revisit your methodology if needed.
- Finish the report. Keep writing during the term.
While the above guidelines describe a typical research project in NLP,
you can also consider some alternative forms:
Alternative Project Types
- theoretical project: You can focus on establishing a
formal framework and proving theoretical results, usually
regarding algorithm complexity of some solutions. Still, it may
be a good idea to have some experimental results even in this
type of project. This kind of project is still very research-oriented.
- implementation: You can put more emphasis on
the implementational part of the project. This usually means
developing a system prototype with multiple functionalities.
In this case, you can devote more space in your report to the
design, testing, and user documentation.
This kind of project could fit well undergraduate students, and they could
choose to implement some algorithm that is well-understood, and not necessarily
very relevant to the latest research.
- software evaluation: Choose one or more existing software
tools, download them, learn to use them, and use them to solve a
problem. Report on your evaluation of the tool, instructions
about its usage, advantages and limitations of the tool, and your
particular approach.
This kind of project is likely more appropriate for undergraduate students.
- survey: The survey format is a critical review of the current
research in a narrow sub-area of NLP. If you choose this option,
make sure that you do not cover a too wide, or already well-understood area,
with published surveys on the topic.
I would discourage you to go with this option, unless it is a
part of your wider research program. It is difficult to write a
good survey paper in a one-term time.
Resources
- NLP Research Links on the course web page
- http://acl.ldc.upenn.edu/ —
ACL Anthology
- Google scholar and other scientific Internet resources
Topics of Some Previous Course Projects
The topics of some previous course projects are included in the
lecture notes.
2002-2021 © Vlado Keselj, last update: 16-Dec-2021