Can I get some help?
Data Analytics
Name: ___________________________________________
Choose the right answer
1. In order to manage profitability, some companies focus on the cross-selling of offerings.
What is cross-selling?
A. Selling a customer one product and then charging them for another, more expensive
product.
B. Understanding what the customer wants to buy but convincing the customer to
purchase more expensive items.
C. Cross-selling is a technique that gets the marketing department into sales by giving
them the ability to sell products.
D. The process of selling a complementary or related product to the customer, along with
the original product.
2. While performing analysis on data that you have collected, you find the need to
combine Hadoop analytics and Tableau extensions in one easy to use site. You find
which of the following meets your needs?
A. OpenRefine
B. The R Project
C. Rapid Miner
D. Tableau Public
3. As a data analyst, you know that buying information on new customers from a third-
party source is:
A. not a viable option as these customers may have never heard of your company.
B. illegal and unethical.
C. an excellent way to expand an existing customer base.
D. not a viable option as the information that is available for purchase is often made up
and not trustworthy.
4. Emails have the ability to include more complex and often technical information,
making them an important channel for business to business support. Efficient data
extraction from emails and user forums is difficult. Which of the following is a
suggested manner to extract data from emails and user forums?
A. It can be obtained by subtracting numbers such as payroll and overhead expenses
from product income and sales.
B. It can be obtained by implementing structured key word and rule-based system to
identify and reroute text content based on certain criteria.
C. It can be obtained by accessing your local database, journals and ledgers.
D. It can be obtained by analyzing the amount of money spent on an ad campaign and
comparing it to that of your competitor.
5. Multichannel search engine optimization increases point-of-contact between
businesses and customers. What is meant by "multichannel search engine
optimization"?
A. Multichannel search engine optimization is the use of multiple channels including
email, Twitter, Facebook, LinkedIn, and offline channels as well, to advertise your
Website.
B. Multichannel search engine optimization is the ability that allows customers to
purchase your product from other Websites.
C. Multichannel search engine optimization is the process of bidding on and paying for
keywords to increase your exposure on search engine sites.
D. Multichannel search engine optimization is the use of keywords and phrases to
increase the relevancy of a company's Website to increase ranking.
6. You are in need of a program that can combine data from more than one source and
display that information as something new. Which of the following will you select to
accomplish the task?
A. OpenRefine
B. Graph-R
C. Rapid Miner
D. KNIME
7. Before embarking on building your own in-house data center, it is important that a
company evaluates:
A. the overall budget and needs to make sure it's an affordable option for your business.
B. the overall abilities of the remote hosting server to make sure it is large enough to
handle the business.
C. the overall feelings of the employees as far as their sensitivity to the data involved.
D. the overall sensitivity of the data involved and who has access to that data.
8. Which of the following statements is true of B2C CRM software?
A. Consistency of long-term sales through marketing is a primary objective.
B. Predicting customer behavior based on past buying patterns and business profitability.
C. Long-term management of a potential customer is a critical goal.
D. Automation of sales processes is stressed.
9. You are working as a senior data analyst in a large company. You must update the
customer database on a regular basis. You regularly need to write many queries for the
production database, which is very time-consuming. Which of the following will simplify
this task?
A. XML, JavaScript
B. Project R, Fortran
C. Postgres, MySQL
D. Hadoop, MongoDB
10. Which Magic Quadrant is occupied by organizations who demonstrate completeness
of vision, but have little ability to execute?
A. Leaders
B. Challengers
C. Niche Players
D. Visionaries
11. You are working as a data analyst for the BigBucket discount store. You are required
to gather the data about the stock of the products stored in the store's warehouse.
What kind of data will you need to collect such information?
A. "Who" data
B. "When" data
C. "What" data
D. "Where" data
12. There are three major components of a CRM system to be implemented: a well-
defined overall business strategy, a business intelligence system for monitoring
customer journeys and interactions, and which of the following?
A. A robust technology infrastructure
B. A combination Marketing/Sales Department
C. Buy in and participation from all departments and employees
D. A schedule for customer contact initiatives
13. OpenRefine allows a user to take disorganized data and transform it from one format
to another. Why is this important?
A. Databases require the data to be in one specific format.
B. Datasets cannot be extracted unless they are of the same format.
C. Not all browsers support all formats of text.
D. It allows a user to gather data regardless of formats.
14. Which of the following best describes cloud computing?
A. A distributed model where computers and other devices use shared resources, data
and information to benefit the customer.
B. A structured set of data held in a computer, especially one that is secured at a nearby
location.
C. An application that requires the use of versions of SQL and other structured query
languages from which to parse information.
D. A database management system that integrates a relational database with user-
friendly graphical user interfaces.
15. This data is largely unstructured, and, as the name suggests, may be context-specific
in nature. Examples include impulse purchases, and purchases driven by
environmental or market conditions, complaint details, and customer query details.
A. Facilities Management Data
B. Contextual Data
C. Behavioral Data
D. Descriptive Data
16. In the last several decades, after replacing the "time-sharing" services with "cloud-
based data storage applications," the main issue software providers faced, was to
prevent the data from being accessed unauthorized users. Cloud providers offer a
variety of new security solutions to deal with this problem. Which of the following are
included in the newest security techniques?
A. PGP and PKI
B. PKI and Improved VM
C. SSL and PGP
D. Improved VM and PGP
17. Which of the following activities is not included in the GDPR?
A. Processing of data for the marketing of a new product at the global level
B. Processing of domestic/household information for personal use
C. Processing of information during a data breach in an organization
D. Processing of data involved in journalism, literature and art
18. Allowing customers to buy products through Facebook, and to publicize their products
by sharing their purchases with their friends, is an example of:
A. Mobile device commerce
B. Multiple channel analysis
C. Multichannel optimization
D. SEO optimization
19. In regards to data analysis, what is required for a company to stay ahead of its
competition?
A. A large database and database administrator.
B. An awesome company statement.
C. A catchy slogan.
D. A clear understanding of its customers.
20. What type of software helps to detect errors, connectivity, security threats, and other
areas that need attention in regards to in-house data?
A. CRM
B. Security breach software
C. Event monitoring software
D. Relational DB
21. Your boss asks you if there are any reasons why a company would not move their
data to the cloud and you respond:
A. While the security on cloud servers it better than those of an in-house data facility,
they require too many policies for access to be documented by the acquiring
company.
B. The physical proximity of a local server will offer less delay and the split-second
advantage of the local server may be a competitive advantage to some companies.
C. It is much costlier to keep data in the cloud as several people need to be paid in order
to watch over the servers for inclement weather that may affect their power source.
D. Cloud servers and migration is so standardized that it is actually very difficult to move
data to the cloud and for this reason alone most companies do not do it.
22. Segmentation, the process of identifying and targeting customers, is done by which
of the following departments?
A. The sales department
B. The accounting department
C. The management department
D. The marketing department
23. You have been asked to join a newly forming company and they are trying to
determine if your data should be stored in-house or on the cloud. You reply:
A. Cloud-based architectures increase the required amount of start-up capital for
software deployment, and has increased ongoing maintenance costs.
B. Both architectures cost about the same to start up but cloud-based architectures
increase server costs.
C. Cloud-based architectures reduce the required amount of start-up capital for software
deployment, with corresponding reductions in ongoing maintenance costs.
D. Cloud-based architectures increase the required amount of start-up capital for
Software deployment, but has reductions in ongoing maintenance costs.
24. A chain of grocery stores is buying processed beef from many different farms to sell
to its customers. What kind of transaction framework exists between the grocery
stores and the farms?
A. Business-to-Customer
B. Business-to-Business
C. Customer-to-Business
D. Customer-to-Customer
25. Which management system involves designing, planning, executing and monitoring
the products, information and finances as they flow from the initial supplier of raw
materials all the way to final consumer?
A. Just-in-time system
B. Facilities management system
C. Supply chain management
D. Inventory management system
26. What does the underlined part of the sentence below mean?
"Hadoop provides for distributed processing with clusters of computers."
A. They have many computers and each one stores a specific type of data, so they
would have the best server for a company.
B. They have a framework of a collection of computers that store and process data.
C. They store multiple copies of a company's data on multiple servers.
D. They divide sensitive data into bundles of computers that process in parallel.
27. As the company data analyst, you are asking the management team to consider a
centralized database. Your argument for doing so is:
A. there will be lower data maintenance costs since the database will reside on only one
computer.
B. changes and updates to data will be reflected on the very next business day – thus
improving turn around.
C. it may not be convenient for all users, but it will be more convenient for you.
D. higher data accuracy and consistency as well as lower data redundancy at a single
location means holding one main record of the data. This increases the ability to
maintain data reliability.
28. During which phase of data analysis is data gathered from various sources?
A. Delivery Phase
B. Customer Service Phase
C. Discovery phase
D. Execution phase
29. Which of the following strategies will not contribute to the improvement of the SEO
optimization?
A. Analyzing multiple channels
B. Sending promotional e-mails
C. Creating relevant content
D. Adding call to action
30. Which information security standard is defined for organizations that handle branded
credit cards from major credit card companies including Visa, Mastercard, American
Express, Discover and JCB?
A. Sarbanes-Oxley
B. PCI DSS
C. FISMA
D. HIPAA
31. As a data analyst, your responsibility to the company is to identify areas for
improvement, mitigate risks, and better engagement with buyers. For this, you need
to have a proper understanding of the organization's customers. Which aspect of a
CRM system will help you accomplish this objective?
A. Improvement in the sales and marketing processes
B. Improvement in customer service
C. Segmentation of customers for better targeting
D. Gathering customer insights
32. What is meant by "Call To Action"?
A. Call to action means posting thoughts and comments on relevant social topics.
B. When a user sees an ad there should be a single button to click.
C. When a user visits the Website there should be a clear next step for them to do.
D. Call to action is a programming term by which a Website is "called" up when a button
is clicked
33. As a data analyst, you gather information on the operations and transactions related
to revenues, payrolls, expenses, costs and assets from which department?
A. The accounting department
B. The sales department
C. The IT department
D. The human resources department
34. The process of analyzing data involves which of the following?
A. Understanding, with a high degree of certainty, where the data came from
B. Discovering where money was spent in order to lower employee pay and benefits
C. Using databases to store fields for further use
D. Discovering trends and finding patterns that may illuminate new growth strategies for
Businesses
35. As a data analyst for a company, the sales department asks you to analyze some
data. You ask them how they would like you to model that. They ask you to define
modeling. You reply:
A. It is the development of a visual representation of the data you are presenting.
B. It is the order of data you are parsing.
C. It is another way of expressing the linking of data in a database.
D. It is a term in data reporting as to how the information will be labeled.
36. Which of the following data analyzing tools has the unique feature of creating
dashboards and story points?
A. Tableau Public
B. KNIME
C. OpenRefine
D. Google Fusion Tables
37. What is the problem domain?
A. To understand the objects that make up the finished application
B. To understand the root cause of a problem and provide solutions.
C. To define the implementation of the problem domain's solution
D. To define the attributes and operations of the problem domain's resources
38. When a senior analyst sees you working with Project R and asks you why you chose
to use it, you defend it by saying:
A. R provides for distributed processing with clusters of computers.
B. R has been enhanced to include object-relational features and non-relational features
such as XML.
C. R has a user-friendly graphical user interfaces that enable people without SQL skills to
access data.
D. R is a programming language useful in statistical computing and graphics.
39. One of the primary objectives of a data analyst is to steer the company towards
higher profits by providing business insights that are generated from:
A. rigorous data analysis.
B. the vision statement for the company.
C. a customer relationship management database.
D. accessing competitor's information.
40. The first thing you ask for as a newly hired data analyst is a Twitter page. When
upper management asks you why you want a Twitter page, you reply:
A. having a Twitter page allows us to follow what our competitors are trending.
B. having a Twitter page will allow us to gather information about our users and track
what Websites they visit.
C. having a Twitter page will allow us to access our competitor's page and make negative
comments on their page.
D. having a Twitter page allows us to publicize information about new products, styles
and discounts.
Explain.
1. What are the three characteristics of Big Data, and what are the main considerations
In processing Big Data?
2. The differences between BI and Data Science.
3. What are the key skill sets and behavioral characteristics of a data scientist?
4. What are the benefits of doing a pilot program before a full-scale rollout of a new analytical methodology.
5. What kinds of tools would be used in the following phases, and for which kinds of use
scenario?
a) Phase 2: Data preparation.
b) Phase 4: Model building