Mariam Salloum

Textbooks

Course Description

We are living in the age of big data, where data is measured in terabytes and zetabytes, streamed in real-time, and derived at unprecedented speeds in diverse forms. Big data promises to impact the world as we know it, from increased productivity at our workplace to how we live our daily social lives. However, it also presents tremendous challenges as entities from individuals, companies, organizations, political groups, to governments strive to manage and gain insights from vast torrents of complex data.

This course will introduce you to the fundamental platforms and tools currently available to collect, integrate, store and analyze Big Data. The focus of the course is to conduct a quarter long team project using the tools and platforms used to manage big data.

Assignments & Project

This class is a project based course. There are several components:

  • A set of introductory projects where the fundamentals of big data platforms and tools are learned. Students can work in teams of two.

  • Quarter-long project whose theme is “Big data analysis”. Each project must have three components
    (a) collecting data
    (b) preprocess or analyze data in a distributed method using either MapReduce or Spark, then saving the result in a key-value store (Cassandra)
    (c) building a web interface to explore the preprocessed data.

  • Project requirements include:
    (a) Project proposal
    (b) Status reports
    (c) Team self-assessment and reflection questionnaire
    (d) Report to be completed at the end of the quarter
    (e) Presentation

Grade Breakdown

Grades will be weighted as follows:

Item Percentage
Assignments 15%
Project 80%
Class Participation 5%

Class participation and attendance is expected. If you have to miss a class, then please let me know ahead of time. There will be 3 programming assignments in the first half of the quarter. The introductory assignments are aimed to introduce you to the fundatmental platforms and tools that will be used in your projects. Assignments can be turned in no more than 48 hours after the deadline, and will incur 10% for each late day. The course project requirements will be provided during the quarter. The requirements include a course proposal, weekly status reports, a final report and assemestment surveys, and finally a class presentation.

Academic Integrity

Academic integrity is fundamentally about ethical behavior. Appropriate collaboration and research of previous work is an important part of the learning process. However, not all collaboration or use of existing work is ethical. The overarching principles which should guide you when determining whether or not it is appropriate to use a source or collaborate with a classmate involve answering these questions: Does this fit within the spirit of the assignment/activity?

In any ethical decision there is always judgment involved. Some assignments and activities involve collaborating with a team, in others you are asked to work individually. You are expected to have some common sense and to use it.

Does this help me or someone else in the class to improve our skills and/or understanding of class material?

As a guiding principle, talking about concepts is usually good, talking about specific answers or approaches to problems is usually not.

Does this misrepresent my own (or someone else's) capabilities and understanding of materials for the purpose of grading?

Attribution of sources is a key idea here; if you use work which is not your own, that work should be cited. For this class, citation is not required to be in a specific format, but any citation should clearly identify the author and source of any work which is not your own. Refer to the university policy on plagiarism and cheating.

Have any specific instructions been given for this assignment?

Not all assignments are the same. On some you will be given explicit instructions about what level of collaboration is appropriate, and you are expected to abide by those restrictions even if you disagree with them.

If you are at all uncertain about an action, whether it be working with another student, researching existing code, or something else, you are always welcome to ask the instructor for clarification.

The severity of sanctions imposed for an academic integrity violation will depend on the severity of the transgression and ascertained intent of the student. Penalties may range from failing the assignment to failing the course. Again, actions will adhere to the Academic Honesty policies of BCOE and UCR