Title | Coding Data - ASAL Information Technology 9626 Notes on Chapter 1 & 6 along with additional |
---|---|
Course | Ict a level |
Institution | The Chancellor, Masters, and Scholars of the University of Cambridge |
Pages | 17 |
File Size | 1.2 MB |
File Type | |
Total Downloads | 329 |
Total Views | 425 |
Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT: undefined function: 32 Warning: TT:...
IT (9626) Theory Notes
CodingData Whatiscodingofdata? Anysystemwillneedtohavedatacollected,enteredandstored.Onemethodofstoringdatais toassigncodestoit.Thisusuallymeansshorteningtheoriginaldatainanagreedmanner. Example1
Originaldata:Monday;Tuesday;Wednesday;Thursday;Friday Codeddata:Mon;Tues;Wed;Thurs;Fri Example2
Originaldata:XtraLarge;Large;Medium;Small Codeddata:XL;L;M;S
Reasonstocodedata: Itiscommonformuchofthedatacollectedandenteredintoasystemtohavesomedegreeof repetitionandredundancyi.e.extrainformationthatdoesnotaddanything.Andthispatternor repetitioniswhyitisefficienttocodethedatainsomeway.
Speedingupdataentry
Let'staketheexampleofcollectingdataaboutaperson'sgender.Peoplecanbeeither 'Male'or'Female'. Whilstthesetwooptionsareeasilyunderstoodbyall,imaginehavingtoentertheword 'Male'and'Female'intoasystemmanyhundredsoftimes.Itisawasteoftimeandeffort becausenoextrainformationiscontainedinthefullwordscomparedtoasingleletter.
Increaseaccuracyofdataentry
Theotherissueisthatnomatterhowaccurateapersonisatdataentry,atsomestage theyarelikelytomakeamistakeandmightspell'Male'as'Mail'or'Female'as'Femal'. Thistypeofmistakewillmakeanyresultsfromyourdatabasequeriesunreliable. Insteadofentering'Male'or'Female'youcouldcodethedataandinsteadenteritas'M' or'F'. Simplyhavingtoenteroneletterinsteadofapossiblesixwillspeedupdataentry.Itwill alsocutdownontheriskofmistakesbeingmadewithspelling.
IT (9626) Theory Notes
Useofvalidation
Whendatahasbeencodeditmakesiteasiertousevalidationtocheckifthedata enteredissensible.Withtheexampleabove,thepersonenteringthedatacouldstill makeamistakeandenter'S'insteadof'M'or'F'. Butifyousetupvalidationsothatthefieldwillonlyaccepttheletters'M'or'F'and absolutelynothingelsethenthatshouldfurthercutdownonpossiblemistakes.
Lessstoragespacerequired
Everyletterthatyoustoreinyourdatabasesystemwilltakeatleastonebyteofstorage. Ifyoustore'Female'as'F'thenyouwillsavefivebytesofstoragespace.Ifthesystem belongstoalargeorganization,theremightbemanythousandsormillionsofrecords stored‐simplybycodingonefield,ahugeamountofharddiskstoragecanbesaved.
Fastersearchingfordata
Thesmallerthesizeofyourdatabase,thefasteritwillbetosearchandproduceresults. Thusbycodingdataandkeepingthesizeofthesystemtoaminimumthemoretimeyou cansaveinthelongrunwhenrunningqueries.
Problemscausedbycodingdata: Whilstcodingdatacanbringmanybenefitsitcanalsoleadtosomeproblems.
Coarseningofdata
Thismeansthatduringthecodingprocesssomeofthesubtledetailsinthedataarelost. Thecolourscouldbeclassedas: Lightpink,paleblue,blackandmidblue However,whenthesecoloursarecodedtheymaybecome: PK(pink),B(blue),BK(black),BE(blue) Inthiscase,noallowancehasbeenmadeforshadesofcolours.Thefinedetailhasbeen lost.Thisiswhatismeantby'coarseningofdata'.
Codingcanobscurethemeaningofthedata
Areaderseeingthe'gender'dataasM/FisprettylikelytoknowthatitmeansMale/ Female.
IT (9626) Theory Notes
Butsomecodesaremoreobscure,forexamplethecountrycodeforSwitzerlandisCHE. Manypeoplemightnotrecognizewhatthiscoderepresents.
CodingofValueJudgments
Whenyouarecollectingdataaboutpeople'sopinionsitmightbedifficulttocodetheir answerswithaccuracy.Thecodetheygivewilldependontheirindividualopinion.Coding ofvaluejudgmentswillinevitablyleadtocoarseningofthedatasincetherewillbeawide rangeofopinionsthatcouldbeheldandonlyalimitednumberofcodesavailable.
Furtherexamplesofdatacoding: Inoureverydayliveswecomeacrossmanyexamplesofhowcodingisusedtorepresentdata. Herearejustafewmoreideas:
Countrynames
Thenameofacountrycanberepresentedbytwoletters.Forexample: GreatBritain‐GB France‐FR Canada–CA
Airlineflightcodes
Whenyouflyyoumayhavenoticedthatyourflightisgivenacode. Thiscodeconsistsoftwoletterstoidentifytheairlinethatyouareflyingwith.Theletters areusuallyfollowedbynumberstorepresentaparticularroute. Examples: Soforexample,aBritishAirwaysflightfromHeathrowtoOslomightbecodedasBA766. AflightoperatedbytheairlinecompanyEmirateswhichdepartfromDubaiandarrivesat HeathrowmightbecodedasEK029.
Advantagesofcoding: ‐Dataentrycanbefaster ‐Dataentrycanbemoreaccurate ‐Validationcanfurtherimproveaccuracy ‐Lessstoragespacerequired
IT (9626) Theory Notes
‐Fastersearchingfordata ‐Codeddatacanbemoresecureifpeopledon’tknowwhatitmeans
Disadvantagesofcoding: ‐Coarseningofdata ‐Meaningofdatacanbeobscured ‐Valuejudgmentsaredifficulttocode ‐Ifpeopledon’tknowthecodeitcanslowdowndataentry ‐Ifcodesarecomplicatedtheymightbeenteredincorrectly ‐Mightrunoutofcodecombinations
IT (9626) Theory Notes
DataEncryption WhatisEncryption? Encryptionmeanstoscrambledatainsuchawaythatonlysomeonewiththesecretcodeorkey canreadit.
Whyisitimportant? Today,encryptionisfarmoresophisticated,butitservesthesamepurpose‐topassasecret messagefromoneplacetoanotherwithoutanyoneelsebeingabletoreadit. Encryptionisextremelyimportantfore‐commerceasitallowsconfidentialinformationsuchas yourcreditcarddetailstobesentsafelytotheonlineshopyouarevisiting. Webbrowsersareabletoencryptyourpurchasedetailsusinganencryptionmethodcalled'SSL' (SecureSocketLayer).Youknowthisisswitchedonwhenasmallpadlockappearsinthebottom rightofthebrowser.SSLgetsswitchedonwhenyouvisita'secureserver'thathasanaddress thatstartswithHTTPS://(notethe'S').
Howdoesitwork? Encryptionworksbyscramblingtheoriginalmessagewithaverylargedigitalnumber(key).This isdoneusingadvancedmathematics.Commercial‐levelencryptionuses128bitkeythatisvery, veryhardtocrack.Thecomputerreceivingthemessageknowsthedigitalkeyandsoisableto workouttheoriginalmessage.
Problemswithencryption Therearethreeproblems; a)Itisslowerthannormalbrowsing.Ittakesawhileforthebrowsertodothemathsrequiredto scramblethemessageandanotherdelayontheserverthathastounscramblethedata. b)Onlineshopshavetohaveadigitalcertificatethatcontainspartofthekey.Thisisnotfree andhastobesuppliedbya'certificateauthority'. c)Itcanbeacomplicatedbusinessrunningasecureserver,soveryoften,ordinaryonlineshops willhireaspecialist'PaymentGateway'suchasWorldPayorPaypaltohandlepaymentsfor them.
IT (9626) Theory Notes
SymmetricvsAsymmetricencryption SymmetricEncryption Symmetricencryption’sjobistotakereadabledata,scrambleittomakeitunreadable (protectingitfrompryingeyeswhileit’sbeingstoredonadiskortransmittedoveranetwork), thenunscrambleitagainwhenit’sneeded.It’sgenerallyfast,andtherearelotsofgood encryptionmethodstochoosefrom.Themostimportantthingtorememberaboutsymmetric encryptionisthatbothsides—theencrypter,andthedecrypter—needaccesstothesamekey. AsymmetricEncryption Asymmetricencryptionalsotakesreadabledata,scramblesit,andunscramblesitagainatthe otherend,butadifferentkeyisusedforeachend.Encryptersuseapublickeytoscramblethe data,anddecryptersusethematchingprivate(secret)keyontheotherendtounscrambleit again. Thepublickeymeansthatitcanandshouldbepublished.(Thisiswhyasymmetricencryptionis alsooftenreferredtoaspublic‐keyencryption),buttheprivatekeymustbekeptprivate, protectedmuchlikethekeyforsymmetricencryption.
IT (9626) Theory Notes
Data,InformationandKnowledge Data Dataiscomprisedofthe basic,unrefined,and generallyunfiltered information
Information Information…ismuchmore refineddata…thathasevolved tothepointofbeingusefulfor someformofanalysis
Knowledge Knowledgeresidesinthe user…happensonlywhen humanexperienceandinsight isappliedtodataand information
Dataisunprocessedfactsandfigureswithoutanyaddedinterpretationoranalysis."Thepriceof crudeoilis$80perbarrel."
Informationisdatathathasbeeninterpretedsothatithasmeaningfortheuser."Thepriceofcrude oilhasrisenfrom$70to$80perbarrel"givesmeaningtothedataandsoissaidtobeinformation tosomeonewhotracksoilprices.
Knowledgeisacombinationofinformation,experienceandinsightthatmaybenefittheindividual ortheorganization."Whencrudeoilpricesgoupby$10perbarrel,it'slikelythatpetrolpriceswill riseby2pperliter"isknowledge.
IT (9626) Theory Notes
DataValidationandVerification Validation: Validationisonewayoftryingtoreducethenumberoferrorsinthedatabeingenteredinto yoursystem.Validationisperformedbythecomputeratthepointwhenyouenterdata.Itisthe processofcheckingthedataagainstthesetofvalidationrules. Validationaimstomakesurethatdataissensible,reasonable,completeandwithinacceptable boundaries. Datavalidationcanbeperformedbyusinganumberofvalidationchecks. RangeCheck Arangecheckiscommonlyusedwhenyouareworkingwithdatawhichconsistsofnumbers, currencyordates/times. Arangecheckallowsyoutosetsuitableboundaries:
Boundary
Description
Validation
Upperlimit
Themaximumpriceofanyiteminashopis £100
=0
Arange
toachieveaBgradeyoumustscorebetween 75%‐84%
>=75AND...