Jon Tupitza
School of Data Science
University of Virginia
By definition, "data science" must make meaning out of ever-growing pools of data. But the researcher quickly discovers that the hand examination of any data, while useful for granular analysis, is never adequate for large samples. To produce data science at scale, researchers must make effective use of workflows, pipelines, and processes to ingest, parse, and transform data with tools and automation.
This course will center on exposing students to contemporary pipelines for data analysis through a series of steadily escalating use cases. The course will begin with simple local database construction and evolve to cloud-based infrastructure such as Azure, AWS or Google Cloud. This progression will include learning a variety of systems for data collection, orchestration, transformation, consumption, and others as appropriate.
Lectures will be on Mondays most of the time. Occasionally, they may be recorded and delivered to watch on demand. This may happen as circumstances dictate (e.g., weather events). Class discussion, more specifics of the content, and Q&A will be held either in-person, via Zoom or our Discord discussion channel. Wednesdays and Fridays will typically be used for Labs and Hands-on application of what we read/lectured on from Monday in-person. Some content may be delivered either via Zoom or prerecorded video to reinforce the weeks' content and to demonstrate various configurations and programming techniques.
In-person office hours can be scheduled anytime after class on Mondays and Wednesdays, or at your request. Also, questions can be submitted via email either to the Professor (Jon Tupitza) or the TA (Mauricio Torres)). If you want Office Hours to talk more or connect on Careers, etc. ...anything then please email me so we can set up times to meet.
This course will emphasize hands-on experience in the creation, management, and consumption of various computational services that support the practice of data science. For the purposes of getting started I will assume that you fit roughly into at least one of three categories as you approach the subject:
Students will learn how to implement data science systems according to best practices, with an emphasis upon creating reusable and portable environments.
| Component | Weight | Notes / Due |
|---|---|---|
| Lectures, readings and other material | Weekly before class discussion | |
| Labs | 30% | Weekly |
| Engaged Discussions | 5% | |
| Quizzes | 15% | 3 or 4 quizzes |
| Data Projects | 50% | 2 projects |
Many interactions can occur via Email as thoughtfully formulated questions typically result in thoughtfully formulated, detailed answers. The Professor and TA will do their best to reply to all questions within a few hours. Individual office hours can be scheduled upon request; however, generally, they'll be held following class on Monday and Wednesday somewhere in the Data Science building.
Students in this course will be expected to use the following pieces of software on a weekly basis:
Create a free GitHub account
Install git locally
Install GitHub Desktop
Fork the Course GitHub Repository
Create a free Discord account / join server
The course will be conducted according to the UVA honor system. Programming assignments and exams are to be completed by the individual (no group collaborations). You will sign an honor pledge for all assignments, quizzes, and exams; more importantly, I expect you to adhere to the intent of the pledge. Cooperative efforts at understanding the material and technologies of the course are encouraged.
All suspected violations will be forwarded to the Honor Committee, and you may, at my discretion, receive an immediate zero on that assignment regardless of any action taken by the Honor Committee.
If you believe you may have committed an Honor Offense, you may wish to file a Conscientious Retraction by calling the Honor Offices at (434) 924-7602. For your retraction to be considered valid, it must, among other things, be filed with the Honor Committee before you are aware that the act in question has come under suspicion by anyone. More information can be found at http://honor.virginia.edu. Your Honor representatives can be found at: http://honor.virginia.edu/representatives.
It is my goal to create a learning experience that is as accessible as possible. If you anticipate any issues related to the format, materials, or requirements of this course, please meet with me outside of class so we can explore potential options. Students with disabilities may also wish to work with the Student Disability Access Center to discuss a range of options to removing barriers in this course, including official accommodations. Please visit their website for information on this process and to apply for services online: sdac.studenthealth.virginia.edu. If you have already been approved for accommodations through SDAC, please send me your accommodation letter and meet with me so we can develop an implementation plan together.
Lectures and other learning material will be made available throughout the semester, and all assignments and exams will be granted ample time for complmetion. Should you require accommodations through SDAC for extra time, please contact the instructor.
The University of Virginia is dedicated to providing a safe and equitable learning environment for all students. To that end, it is vital that you know two values that I and the University hold as critically important:
If you or someone you know has been affected by power-based personal violence, more information can be found on the UVA Sexual Violence website that describes reporting options and resources available - www.virginia.edu/sexualviolence.
As your professor and as a person, know that I care about you and your well-being and stand ready to provide support and resources as I can. As a faculty member, I am a responsible employee, which means that I am required by University policy and federal law to report what you tell me to the University's Title IX Coordinator. The Title IX Coordinator's job is to ensure that the reporting student receives the resources and support that they need, while also reviewing the information presented to determine whether further action is necessary to ensure survivor safety and the safety of the University community. If you wish to report something that you have seen, you can do so at the Just Report It portal (http://justreportit.virginia.edu/). The worst possible situation would be for you or your friend to remain silent when there are so many here willing and able to help.
It is the University's long-standing policy and practice to reasonably accommodate students so that they do not experience an adverse academic consequence when sincerely held religious beliefs or observances conflict with academic requirements.
Students who wish to request academic accommodation for a religious observance should submit their request in writing directly to me as far in advance as possible. Students who have questions or concerns about academic accommodations for religious observance or religious beliefs may contact the University’s Office for Equal Opportunity and Civil Rights (EOCR) at UVAEOCR@virginia.edu or 434-924-3200.
I will preserve weekly class discussions in Discord for the duration of the semester. Because these discussions include fellow students, you and they may be personally identifiable in these discussion logs. These logs may only be used for the purpose of individual or group study with other students enrolled in this class during this semester. You may not distribute them in whole or in part through any other platform or to any persons outside of this class, nor may you make your own copies of class discussions unless written permission has been obtained from the Instructor and all participants in the class have been informed. For additional details, please see Provost Policy 008.
If you have a question about any aspect of this course - a particular topic, method, concept, etc. - please contact the TAs or me via Discord or email. It is often the case that you're not the only one having trouble understanding it!