r exams SS2021 and retake exam 2021 PDF

Title r exams SS2021 and retake exam 2021
Author zhaoxuan guan
Course Data analysis and visualization in R
Institution Technische Universität München
Pages 20
File Size 656.1 KB
File Type PDF
Total Downloads 827
Total Views 923

Summary

Computational Molecular MedicineDepartment of InformaticsTechnical University of MunichSPersonal stickerCompliance to the code of conduct I hereby assure that I solve and submit this exam myself under my own name by only using the allowed tools listed below.Signature or full name if no pen input ava...


Description

Computational Molecular Medicine Department of Informatics Technical University of Munich

Compliance to the code of conduct I hereby assure that I solve and submit this exam myself under my own name by only using the allowed tools listed below.

Personal sticker S5078

Signature or full name if no pen input available

Data Analysis and Visualization in R Exam: Examiner:

Exam empty

IN2339 / Final Exam Prof. Dr. Julien Gagneur

Date: Time:

– Page 1 / 20 –

Friday 5th March, 2021 11:00 – 12:30

IN-daviz-5-20210305-E5078-01

Working instructions • This exam consists of 20 pages with a total of 14 problems. Please make sure now that you received a complete copy of the exam. • The total amount of achievable credits in this exam is 27 credits. • Detaching pages from the exam is prohibited. • Allowed resources: – slides, exercises and notes from the lectures – Other content from the internet – You are not allowed to communicate with anyone except with the examiners during the exam and during the oral questioning (one hour following the written exam). Hence, you can consult a forum for an existing post but you are not allowed to post any question nor result on any communication media (e.g. forum, WhatsApp group, social media, etc.) up to one hour following the exam. – You should answer the questions using the knowledge (data analysis and statistical methods), and R packages listed below. In this respect, consulting other content from the internet is probably a bad idea as they may hint towards methods and code that were not taught. – The R libraries you can use are: caret, data.table, dplyr, dslabs, fossil, GGally, ggbeeswarm, ggplot2, ggrepel, magrittr, mclust, mlbench, pheatmap, plotROC, randomForest and tidyr. Load them in your R session by running the following code: library(caret); library(data.table); library(dplyr); library(dslabs); library(fossil); library(GGally); library(ggbeeswarm); library(ggplot2); library(ggrepel); library(magrittr); library(mclust); library(mlbench); library(pheatmap); library(plotROC); library(randomForest); library(tidyr). Be sure to have them already installed using install.packages(‘data.table’), and so on for each of those libraries. • Filling the exam – Download the pdf to your computer and edit it there. Save frequently your file, e.g. after each question. Make sure that your pdf editor supports native text input fields. Check the list of pdf editors in this document: https://tumexam.de/static/handreichung_submissions_students.pdf – Do not work with the pdf loaded in a web browser as it does not save your edits. – Answer by typing, no handwriting or sketching. Write into the solution box inside the pdf document. – Not all R outputs (e.g. tables or plots) are required except for the answer to the question. Simply copy the executed code from R to the solution box in the exam. In case the question asks you to provide a certain numeric value, then write that requested value in a plain English sentence underneath the code. For instance: "The P-value is 0.05.". If the question states “justify”, provide a short justification in plain English. In this case, only providing the code is not enough. – We do not accept any additional files. – Some questions value one point, other two points. No half-point will be given. • Interactions with examiners and oral questioning – The examiners are reachable during the exam and for the oral questioning via a zoom meeting – The zoom meeting will be open from 10.45 to 13.45 – The written exam will start at 11.00 sharp at what time point the exam will be downloadable from TUMexam. – Your exam should be submitted for upload back to TUMexam by 12:35 sharp. – Do not switch on your microphone, nor your camera, and do not share your screen during the written exam. – You should primarily use the zoom conference chat with direct messages to the examiner during the exam if you have any question during the written exam. – If your zoom connection breaks during the written exam, try first to reconnect. If it keeps failing you can post questions at [email protected] – Immediately after the written exam starts the oral questioning 12.35-13.35.

IN-daviz-5-20210305-E5078-02

– Page 2 / 20 –

Page empty

– The purpose of the oral questioning is to ensure your identity and that you did the exam by yourself. You should be able to explain why you gave a particular answer to a question (i.e. what was your reasoning). It does not matter whether your answer to the question is right or wrong. We only want to make sure that it comes from you. In the oral questioning you will not be allowed to consult any document any longer. – You are not allowed to communicate with anyone except the examiners during the entire hour reserved for oral questioning, even if you have been already orally questioned. – You must be reachable at all times by videoconference during the oral questioning hour. If your zoom connection breaks, immediately inform us at [email protected] and propose an alternative videoconference channel (preferably WhatsApp). We will not store your phone number after the oral questioning. – For the oral questioning, switch on the camera and microphone. Give us your matricule number, first name, and last name as it appears in TUMonline by copy-pasting this information in the chat window. Show your student ID and face. We will then ask you a few questions about your submission, to verify that you wrote it yourself.

Left room from

Page empty

to

/

– Page 3 / 20 –

Early submission at

IN-daviz-5-20210305-E5078-03

Problem 1

Question Nr. OUFBGYZ4K7SLD5Z619M4WBVYO3P6I 12 11

D

10 9 8 7

y

0 1 2

(2 credits)

6 5

X1

E C A

X2

4 3 2 1

B

0 0 1 2 3 4 5 6 7 8 9 10 11 12

x You are provided with this plot which represents a random initialization of a k-Means algorithm with k=2. X1, X2 are the randomly positioned centroids and A to E are the points of the 2-dimensional dataset. Report the x and y coordinates of the new positions of the centroids after the first iteration. Use the Euclidean distance. No standardization of the variables shall be applied. Justify your answer.

IN-daviz-5-20210305-E5078-04

– Page 4 / 20 –

Page empty

Problem 2

(2 credits) 0 1 2

Question Nr. 8NTB4VZK1U62MSXHY3LC Which operation has been applied to table ‘dt_A’ and table ‘dt_B’ to return the result table? Justify your answer. Write one line of R code using the library data.table that would produce the result table stored in a variable ‘dt_result’ assuming a data table ‘dt_A’ and a data table ‘dt_B’ in the working environment. Table dt_A: id

CreditCard

CCV

type

20 17 10 1 13 6 11 7 5 24

4247141693495656 7953147217751656 9589966499538341 9396859962839891 4133148738193359 4339355365311879 4994262737933115 2635673538455821 7366664949734465 9236461982785987

656 437 426 486 578 498 832 251 282 418

i i i i l r l i o r

Table dt_B: firstName

lastName

Cheyane Marcos Aaron Rachel Meeso Alan Lacrae Jessica Thomas Amelia Shanelle

Castaneda Recinos Peraza Santistevan Jones Hwang Vigil Mccoy Serna Montano Maldonado Baca Walters

customer_id 13 6 11 7 5 24 2 23 18 14 12

Table dt_result: id

CreditCard

CCV

type

5 6 7 11 13 24

7366664949734465 4339355365311879 2635673538455821 4994262737933115 4133148738193359 9236461982785987

282 498 251 832 578 418

o r l l l r

Page empty

firstName Meeso Marcos Rachel Aaron Cheyane Alan

lastName Hwang Recinos Peraza Jones Santistevan Castaneda Vigil

– Page 5 / 20 –

IN-daviz-5-20210305-E5078-05

Problem 3

a) Question Nr. SZKR81O20LCDIJ5V15 The Ozone dataset from the mlbench package contains ozone and weather records for different days in a year. The unique id is (“Month”, “DayOfMonth”, “DayOfWeek”). data(Ozone) names(Ozone)...


Similar Free PDFs