Can I get some help?

Data Analytics

Name: ___________________________________________

Choose the right answer

1. In order to manage profitability, some companies focus on the cross-selling of offerings.

What is cross-selling?

A. Selling a customer one product and then charging them for another, more expensive

product.

B. Understanding what the customer wants to buy but convincing the customer to

purchase more expensive items.

C. Cross-selling is a technique that gets the marketing department into sales by giving

them the ability to sell products.

D. The process of selling a complementary or related product to the customer, along with

the original product.

2. While performing analysis on data that you have collected, you find the need to

combine Hadoop analytics and Tableau extensions in one easy to use site. You find

which of the following meets your needs?

A. OpenRefine

B. The R Project

C. Rapid Miner

D. Tableau Public

3. As a data analyst, you know that buying information on new customers from a third-

party source is:

A. not a viable option as these customers may have never heard of your company.

B. illegal and unethical.

C. an excellent way to expand an existing customer base.

D. not a viable option as the information that is available for purchase is often made up

and not trustworthy.

4. Emails have the ability to include more complex and often technical information,

making them an important channel for business to business support. Efficient data

extraction from emails and user forums is difficult. Which of the following is a

suggested manner to extract data from emails and user forums?

A. It can be obtained by subtracting numbers such as payroll and overhead expenses

from product income and sales.

B. It can be obtained by implementing structured key word and rule-based system to

identify and reroute text content based on certain criteria.

C. It can be obtained by accessing your local database, journals and ledgers.

D. It can be obtained by analyzing the amount of money spent on an ad campaign and

comparing it to that of your competitor.

5. Multichannel search engine optimization increases point-of-contact between

businesses and customers. What is meant by "multichannel search engine

optimization"?

A. Multichannel search engine optimization is the use of multiple channels including

email, Twitter, Facebook, LinkedIn, and offline channels as well, to advertise your

Website.

B. Multichannel search engine optimization is the ability that allows customers to

purchase your product from other Websites.

C. Multichannel search engine optimization is the process of bidding on and paying for

keywords to increase your exposure on search engine sites.

D. Multichannel search engine optimization is the use of keywords and phrases to

increase the relevancy of a company's Website to increase ranking.

6. You are in need of a program that can combine data from more than one source and

display that information as something new. Which of the following will you select to

accomplish the task?

A. OpenRefine

B. Graph-R

C. Rapid Miner

D. KNIME

7. Before embarking on building your own in-house data center, it is important that a

company evaluates:

A. the overall budget and needs to make sure it's an affordable option for your business.

B. the overall abilities of the remote hosting server to make sure it is large enough to

handle the business.

C. the overall feelings of the employees as far as their sensitivity to the data involved.

D. the overall sensitivity of the data involved and who has access to that data.

8. Which of the following statements is true of B2C CRM software?

A. Consistency of long-term sales through marketing is a primary objective.

B. Predicting customer behavior based on past buying patterns and business profitability.

C. Long-term management of a potential customer is a critical goal.

D. Automation of sales processes is stressed.

9. You are working as a senior data analyst in a large company. You must update the

customer database on a regular basis. You regularly need to write many queries for the

production database, which is very time-consuming. Which of the following will simplify

this task?

A. XML, JavaScript

B. Project R, Fortran

C. Postgres, MySQL

D. Hadoop, MongoDB

10. Which Magic Quadrant is occupied by organizations who demonstrate completeness

of vision, but have little ability to execute?

A. Leaders

B. Challengers

C. Niche Players

D. Visionaries

11. You are working as a data analyst for the BigBucket discount store. You are required

to gather the data about the stock of the products stored in the store's warehouse.

What kind of data will you need to collect such information?

A. "Who" data

B. "When" data

C. "What" data

D. "Where" data

12. There are three major components of a CRM system to be implemented: a well-

defined overall business strategy, a business intelligence system for monitoring

customer journeys and interactions, and which of the following?

A. A robust technology infrastructure

B. A combination Marketing/Sales Department

C. Buy in and participation from all departments and employees

D. A schedule for customer contact initiatives

13. OpenRefine allows a user to take disorganized data and transform it from one format

to another. Why is this important?

A. Databases require the data to be in one specific format.

B. Datasets cannot be extracted unless they are of the same format.

C. Not all browsers support all formats of text.

D. It allows a user to gather data regardless of formats.

14. Which of the following best describes cloud computing?

A. A distributed model where computers and other devices use shared resources, data

and information to benefit the customer.

B. A structured set of data held in a computer, especially one that is secured at a nearby

location.

C. An application that requires the use of versions of SQL and other structured query

languages from which to parse information.

D. A database management system that integrates a relational database with user-

friendly graphical user interfaces.

15. This data is largely unstructured, and, as the name suggests, may be context-specific

in nature. Examples include impulse purchases, and purchases driven by

environmental or market conditions, complaint details, and customer query details.

A. Facilities Management Data

B. Contextual Data

C. Behavioral Data

D. Descriptive Data

16. In the last several decades, after replacing the "time-sharing" services with "cloud-

based data storage applications," the main issue software providers faced, was to

prevent the data from being accessed unauthorized users. Cloud providers offer a

variety of new security solutions to deal with this problem. Which of the following are

included in the newest security techniques?

A. PGP and PKI

B. PKI and Improved VM

C. SSL and PGP

D. Improved VM and PGP

17. Which of the following activities is not included in the GDPR?

A. Processing of data for the marketing of a new product at the global level

B. Processing of domestic/household information for personal use

C. Processing of information during a data breach in an organization

D. Processing of data involved in journalism, literature and art

18. Allowing customers to buy products through Facebook, and to publicize their products

by sharing their purchases with their friends, is an example of:

A. Mobile device commerce

B. Multiple channel analysis

C. Multichannel optimization

D. SEO optimization

19. In regards to data analysis, what is required for a company to stay ahead of its

competition?

A. A large database and database administrator.

B. An awesome company statement.

C. A catchy slogan.

D. A clear understanding of its customers.

20. What type of software helps to detect errors, connectivity, security threats, and other

areas that need attention in regards to in-house data?

A. CRM

B. Security breach software

C. Event monitoring software

D. Relational DB

21. Your boss asks you if there are any reasons why a company would not move their

data to the cloud and you respond:

A. While the security on cloud servers it better than those of an in-house data facility,

they require too many policies for access to be documented by the acquiring

company.

B. The physical proximity of a local server will offer less delay and the split-second

advantage of the local server may be a competitive advantage to some companies.

C. It is much costlier to keep data in the cloud as several people need to be paid in order

to watch over the servers for inclement weather that may affect their power source.

D. Cloud servers and migration is so standardized that it is actually very difficult to move

data to the cloud and for this reason alone most companies do not do it.

22. Segmentation, the process of identifying and targeting customers, is done by which

of the following departments?

A. The sales department

B. The accounting department

C. The management department

D. The marketing department

23. You have been asked to join a newly forming company and they are trying to

determine if your data should be stored in-house or on the cloud. You reply:

A. Cloud-based architectures increase the required amount of start-up capital for

software deployment, and has increased ongoing maintenance costs.

B. Both architectures cost about the same to start up but cloud-based architectures

increase server costs.

C. Cloud-based architectures reduce the required amount of start-up capital for software

deployment, with corresponding reductions in ongoing maintenance costs.

D. Cloud-based architectures increase the required amount of start-up capital for

Software deployment, but has reductions in ongoing maintenance costs.

24. A chain of grocery stores is buying processed beef from many different farms to sell

to its customers. What kind of transaction framework exists between the grocery

stores and the farms?

A. Business-to-Customer

B. Business-to-Business

C. Customer-to-Business

D. Customer-to-Customer

25. Which management system involves designing, planning, executing and monitoring

the products, information and finances as they flow from the initial supplier of raw

materials all the way to final consumer?

A. Just-in-time system

B. Facilities management system

C. Supply chain management

D. Inventory management system

26. What does the underlined part of the sentence below mean?
"Hadoop provides for distributed processing with clusters of computers."

A. They have many computers and each one stores a specific type of data, so they

would have the best server for a company.

B. They have a framework of a collection of computers that store and process data.

C. They store multiple copies of a company's data on multiple servers.

D. They divide sensitive data into bundles of computers that process in parallel.

27. As the company data analyst, you are asking the management team to consider a

centralized database. Your argument for doing so is:

A. there will be lower data maintenance costs since the database will reside on only one

computer.

B. changes and updates to data will be reflected on the very next business day – thus

improving turn around.

C. it may not be convenient for all users, but it will be more convenient for you.

D. higher data accuracy and consistency as well as lower data redundancy at a single

location means holding one main record of the data. This increases the ability to

maintain data reliability.

28. During which phase of data analysis is data gathered from various sources?

A. Delivery Phase

B. Customer Service Phase

C. Discovery phase

D. Execution phase

29. Which of the following strategies will not contribute to the improvement of the SEO

optimization?

A. Analyzing multiple channels

B. Sending promotional e-mails

C. Creating relevant content

D. Adding call to action

30. Which information security standard is defined for organizations that handle branded

credit cards from major credit card companies including Visa, Mastercard, American

Express, Discover and JCB?

A. Sarbanes-Oxley

B. PCI DSS

C. FISMA

D. HIPAA

31. As a data analyst, your responsibility to the company is to identify areas for

improvement, mitigate risks, and better engagement with buyers. For this, you need

to have a proper understanding of the organization's customers. Which aspect of a

CRM system will help you accomplish this objective?

A. Improvement in the sales and marketing processes

B. Improvement in customer service

C. Segmentation of customers for better targeting

D. Gathering customer insights

32. What is meant by "Call To Action"?

A. Call to action means posting thoughts and comments on relevant social topics.

B. When a user sees an ad there should be a single button to click.

C. When a user visits the Website there should be a clear next step for them to do.

D. Call to action is a programming term by which a Website is "called" up when a button

is clicked

33. As a data analyst, you gather information on the operations and transactions related

to revenues, payrolls, expenses, costs and assets from which department?

A. The accounting department

B. The sales department

C. The IT department

D. The human resources department

34. The process of analyzing data involves which of the following?

A. Understanding, with a high degree of certainty, where the data came from

B. Discovering where money was spent in order to lower employee pay and benefits

C. Using databases to store fields for further use

D. Discovering trends and finding patterns that may illuminate new growth strategies for

Businesses

35. As a data analyst for a company, the sales department asks you to analyze some

data. You ask them how they would like you to model that. They ask you to define

modeling. You reply:

A. It is the development of a visual representation of the data you are presenting.

B. It is the order of data you are parsing.

C. It is another way of expressing the linking of data in a database.

D. It is a term in data reporting as to how the information will be labeled.

36. Which of the following data analyzing tools has the unique feature of creating

dashboards and story points?

A. Tableau Public

B. KNIME

C. OpenRefine

D. Google Fusion Tables

37. What is the problem domain?

A. To understand the objects that make up the finished application

B. To understand the root cause of a problem and provide solutions.

C. To define the implementation of the problem domain's solution

D. To define the attributes and operations of the problem domain's resources

38. When a senior analyst sees you working with Project R and asks you why you chose

to use it, you defend it by saying:

A. R provides for distributed processing with clusters of computers.

B. R has been enhanced to include object-relational features and non-relational features

such as XML.

C. R has a user-friendly graphical user interfaces that enable people without SQL skills to

access data.

D. R is a programming language useful in statistical computing and graphics.

39. One of the primary objectives of a data analyst is to steer the company towards

higher profits by providing business insights that are generated from:

A. rigorous data analysis.

B. the vision statement for the company.

C. a customer relationship management database.

D. accessing competitor's information.

40. The first thing you ask for as a newly hired data analyst is a Twitter page. When

upper management asks you why you want a Twitter page, you reply:

A. having a Twitter page allows us to follow what our competitors are trending.

B. having a Twitter page will allow us to gather information about our users and track

what Websites they visit.

C. having a Twitter page will allow us to access our competitor's page and make negative

comments on their page.

D. having a Twitter page allows us to publicize information about new products, styles

and discounts.

Explain.

1. What are the three characteristics of Big Data, and what are the main considerations

In processing Big Data?

2. The differences between BI and Data Science.

3. What are the key skill sets and behavioral characteristics of a data scientist?

4. What are the benefits of doing a pilot program before a full-scale rollout of a new analytical methodology.

5. What kinds of tools would be used in the following phases, and for which kinds of use

scenario?

a) Phase 2: Data preparation.

b) Phase 4: Model building