Table of Contents
Enroll Here: Data Science 101 Cognitive Class Exam Quiz Answers
Introduction to Data Science 101
Data Science is a multidisciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge and insights from structured and unstructured data. It combines aspects of mathematics, statistics, computer science, and domain expertise to interpret data for decision-making purposes. The goal of data science is to uncover patterns, trends, and correlations that can guide strategic decisions, optimize processes, or solve complex problems.
Key components of data science include:
- Data Collection: Gathering relevant data from various sources, which could be structured (e.g., databases) or unstructured (e.g., text documents, images).
- Data Cleaning and Preprocessing: Ensuring data quality by handling missing values, removing duplicates, and transforming data into a suitable format for analysis.
- Exploratory Data Analysis (EDA): Analyzing and visualizing data to summarize its main characteristics, often using statistical graphics and summary statistics.
- Modeling and Algorithms: Applying mathematical models, machine learning algorithms, and statistical methods to analyze data and make predictions or decisions.
- Interpretation and Communication: Interpreting the results of data analysis and communicating findings to stakeholders using data visualization, reports, or presentations.
- Ethics and Privacy: Considering ethical implications related to data collection, usage, and privacy, ensuring responsible and unbiased data science practices.
Data science finds applications across various industries such as healthcare, finance, marketing, and e-commerce, where large volumes of data can be leveraged to gain competitive advantages, improve operations, and innovate.
In essence, data science empowers organizations to harness the power of data to drive informed decision-making and gain deeper insights into their operations and customers.
Data Science 101 Cognitive Class Certification Answers
Module 1 – Defining Data Science Quiz Answers
Question 1: From the reading In the report by the McKinsey Global Institute, by 2018, it is projected that there will be a shortage of people with deep analytical skills in the United States. What is the size of this shortage?
- 140 000 – 190 000 people
- 120 000
- 20 000 – 50 000 people
- 800 000 – 900 000 people
- 3 – 6 million people
Question 2: What has changed from the past to make Data science an in-demand occupation?
- There is now a lack of data
- Laws have changed
- Vast amount of data date being created
- The advent of the free market
Question 3: What is the minimum education requirement to become a data scientist?
- You must have a Degree in Computer Science
- You must have a Master’s degree in Statistics
- You must have a Ph.D. in Machine learning
- The above are all helpful, but they are not necessary to become a data scientist, education backgrounds of data scientists vary
Module 2 – What do data science people do Quiz Answers
Question 1: What is structured data??
- Data that can be stored in a database or some tabular form
- Images and video
- Segments of text
- Audio data
Question 2: What does the following formula represent: Base fair + Time x (Time in cab)
- The possible formula used in regression analysis to determine the cost of a cab ride
- The formula used to build a recommender system for rating a cab service
- A possible formula used in regression analysis used to determine the price of a house
- What is the impact of lot size on housing price?
Question 3: In the reading, what is an example of a question that can be put to a regression analysis?
- Do homes with brick exterior sell in rural areas?
- What is the impact of lot size on housing price?
- What are typical land taxes in a house sale?
- How much does a finished basement cost?
- How much should a house near a park cost?
Module 3 – Data Science in Business Quiz Answers
Question 1: Complete the following sentence that best explains why business needs to capture data: At the end of the day, for businesses, they know one thing, that if they are unable to measure something:
- they are unable to graph it
- they are unable to improve it
- they are unable to show compliance with tax laws
- they are unable to facilitate meetings between sales and marketing
Question 2: A business should never:
- delete data
- use Machen learning
- well document data
- use PowerPoint to deliver a message
Question 3: In the reading above, what is the role of the data scientist?
- Email the stakeholders about the analysis
- Manage a team of analysts to create a model
- Develop the strategy to fix the problems in the findings
- Use the insights to build the narrative to communicate the findings
- Use the data to tell the story the CEO wants to tell
Module 4 – Use Cases for Data Science Quiz Answers
Question 1: What popular product is primarily based on data science:
- Smartphone
- Google search
- Space X’s rockets
- Tesla’s Electric Cars
Question 2: From the readings the results section is where you present:
- The empirical findings
- R Squared
- The conclusion
- The contributors
- The methods used
Question 3: Complete the sentence: Predictions are useful?
- they are always correct
- but you need lots of data
- but they must come from a complicated model
- they are always wrong
Module 5 – Data Science People Quiz Answers
Question 1: In the reading, how does the author define ‘data science’?
- Data science is way of understanding things, of understanding the world
- Data science is a physical science like physics or chemistry
- Data science is some data and more science
- Data science is what data scientists do
- Data science is the art of uncovering the hidden secrets in data
Question 2: In the reading, what is admirable about Dr. Patil’s definition of a ‘data scientist’?
- His definition limits data science to activities involving machine learning
- His definition is only for people who program in Python
- His definition excludes statistics
- His definition is about weaving strong narratives into analytics
- His definition is inclusive of individuals from various academic backgrounds and training
Question 3: A good data scientist should?
- calculate confidence intervals
- be sceptical
- use complicated models
- only use big data
Data Science 101 Final Exam Answers
Question 1: In the reading, the output of a data mining exercise largely depends on:
- The engineer
- The programming language used
- The quality of the data
- The scope of the project
- The data scientist
Question 2: What has changed from the past to make Data science an in-demand occupation?
- There is now a lack of data
- Laws have changed
- Vast amount of data date being created
- The advent of the free market
Question 3: You develop an algorithm to predict rainy days, your algorithm predicts a rainy day, but the prediction is false. What is the following an example of?
- the r squared
- ture negatives
- false positive
- generated values
Question 4: What is an example of regression problem
- Finding an object in an image
- Reducing the size of a dataset
- Predicting the price of a house using the square footage
- Finding clusters in the data
Question 5: What should be a prime concern for storing data?
- Data safety and privacy
- Hiring the right database manager
- The size of the files
- The physical location of the servers
- Hadoop clusters
Question 6: What is a good starting point for data mining?
- Data Visualization
- Writing a data dictionary
- Non-parametric methods
- Creating a relational database
- Machine learning
Question 7: Complete the following sentence that best explains why business needs to capture data: At the end of the day, for businesses, they know one thing, that if they are unable to measure something:
- they are unable to graph it
- they are unable to improve it
- they are unable to show compliance with tax laws
- they are unable to facilitate meetings between sales and marketing
Question 8: When establishing data mining goals, the accuracy expected from the results also influences the:
- The timelines for the project
- The scope of the project
- The costs
- The presentation
- Data scientist
Question 9: When processing data, what factor can lead to errors in data?
- Synchronizing the database
- Changing services providers
- Renaming variables
- Human error
- Overfitting
Question 10: A good data scientist should?
- calculate confidence intervals
- be sceptical
- use complicated models
- only use big data