What is a data value that is numerically distant from most of the other data points in a set of data?

Upgrade to remove ads

Only ₩37,125/year

  1. Science
  2. Computer Science
  3. Computer Graphics

  • Flashcards

  • Learn

  • Test

  • Match

  • Flashcards

  • Learn

  • Test

  • Match

Terms in this set (27)

big data

a collection of large, complex data sets, including structured and unstructured data which cannot be analyzed using traditional database methods

distributed computing

processes and manages algorithms across many machines in a computing environment

virtualization

the creation of virtual (rather than actual) version of computing resources, such as an operating system, a server, a storage device, or network resources

data mining

the process of analyzing data to extract information not offered by the raw data alone

data profiling

the process of collecting statistics and information about data in an existing source

data replication

the process of sharing information to ensure consistency between multiple data sources

recommendation engine

a data mining algorithm that analyzes a customers purchases and actions on a website and then uses the data to recommend complementary products

estimation analysis

determine values for an unknown continuous variable behavior or estimated future value

affinity grouping analysis

reveals the relationship between variables along with the nature and frequency of the relationships

market basket analysis

evaluates such items as websites and checkout scanner information to detect customers buying behavior and predict future behavior by identifying affinities among customers' choices of products and services

cluster analysis

a technique used to divide an information set into mutually exclusive groups such that the members of each group are a close as possible to one another and the different groups are as far apart as possible

classification analysis

cable that can carry a wide range of frequencies with low signal loss

data mining tools

a variety of techniques to find patterns and relationships in large volumes of information that predict future behavior and guide decision making

prediction

a statement about what will happen or might happen in the future

cube

the common term for the representation of multidimensional information

algorithms

a mathematical formula placed in software that performs an analysis on a data set

analytics

the science of fact based decision making

anomaly detection

the process of identifying rare or unexpected items or rewards in a data set that do not conform to other items in the data set

outlier

a data value that is numerically distant from most of the other data points in a set of data

fast data

the application of big data analytics to smaller data sets in near real or real time in order to solve a problem or create business value

data scientist

extracts knowledge from data by performing statistical analysis, data mining, and advanced analytics on big data to identify trends, market changes, and other relevant information

infographics

present the results of data analysis, displaying the patterns, relationships, and trends in a graphical format

data artist

a business analytics specialist who uses visual tools to help people understand complex data

analysis paralysis

occurs when the user goes into an emotional state of over analysis a situation so that a decision or action is never taken in effect paralyzing the outcome

data visualization

includes the tests and evaluations used to determine compliance with data governance polices to ensure correctness of data

data visualization tools

moves beyond excel graphs and charts into sophisticated analysis techniques such as controls, instruments, maps, time series graphs and more

business intelligence dashboards

tracks corporate metrics such as critical success factors and key performance indicators and includes advanced capabilities such as interactive controls, allowing users to manipulate data for analysis

Recommended textbook solutions

Computer Organization and Design MIPS Edition: The Hardware/Software Interface

5th EditionDavid A. Patterson, John L. Hennessy

220 solutions

Introduction to Algorithms

3rd EditionCharles E. Leiserson, Clifford Stein, Ronald L. Rivest, Thomas H. Cormen

720 solutions

Introduction to the Theory of Computation

3rd EditionMichael Sipser

389 solutions

Engineering Electromagnetics

8th EditionJohn Buck, William Hayt

483 solutions

Sets with similar terms

CIS Chapter 8

26 terms

clarissa_marie_sloan

MIS - Ch.8

47 terms

mbj1128

ch 8 anglow cis

39 terms

michelle_mais9

Learnsmart 6.2 BIS

29 terms

emmalucky

Sets found in the same folder

MIS Chapter 15

35 terms

Christian2215

MIS Chapter 1

39 terms

Christian2215

MIS Chapter 2

19 terms

Christian2215

MIS Chapter 3

25 terms

Christian2215

Other sets by this creator

Strategic test 2

22 terms

Christian2215

Strategic test 2 study guide

67 terms

Christian2215

Strategic Chapter 5

12 terms

Christian2215

Project chapter 8

21 terms

Christian2215

Verified questions

COMPUTER SCIENCE

An operating system supports a paged virtual memory. The central processor has a cycle time of 1 microsecond. It costs an additional 1 microsecond to access a page other than the current one. Pages have 1,000 words, and the paging device is a drum that rotates at 3,000 revolutions per minute and transfers 1 million words per second. The following statistical measurements were obtained from the system: · One percent of all instructions executed accessed a page other than the current page. · Of the instructions that accessed another page, 80 percent accessed a page already in memory. · When a new page was required, the replaced page was modified 50 percent of the time. Calculate the effective instruction time on this system, assuming that the system is running one process only and that the processor is idle during drum transfers.

Verified answer

COMPUTER SCIENCE

Which of the following statements increase the value of x by 1? I. x++; II. x=x+1; III. x+=1; a. I only, b. II only, c. I and III, d. II and III, e. I, II, and III.

Verified answer

COMPUTER SCIENCE

What is the main advantage of the layered approach to system design? What are the disadvantages of the layered approach?

Verified answer

COMPUTER SCIENCE

Consider the following code segment. String s1="dog"; String s2="dog"; String s3=s1; Which of the following expressions is true? I s1==s2 II s1==s3 III s2==s3 (A) I only, (B) II only, (C) I and II only, (D) II and III only, (E) I, II, and III.

Verified answer

Other Quizlet sets

Ethical and Legal Considerations

19 terms

jbposh

NCTC Skills Chap 21 Measuring Vital Signs

90 terms

m_jean85

Design Principles Situational Scenarios

12 terms

oscarmanny

CKC End of Year Exam

325 terms

Cassidy_H23

Related questions

QUESTION

Data mining can be very useful in detecting patterns such as credit card fraud, but is of little help in improving sales(TRUE or FALSE)

2 answers

QUESTION

The physical localization of topographic landmarks on a patient is called

15 answers

QUESTION

gets rid of the unneeded shades of gray

2 answers

QUESTION

How is a 2D image created?

6 answers

What is fast data quizlet?

fast data. the application of big data analytics to smaller data sets in near real or real time in order to solve a problem or create business value.

What is the process of identifying rare or unexpected items or events in a data set that do not conform to other items in the data set?

Anomaly detection is the process of identifying unexpected items or events in data sets, which differ from the norm.

What is the process of identifying rare or unexpected items or events in a data set that do not conform to other items in the data set multiple choice question?

Anomaly Detection: The process of identifying rare or unexpected items or events in a dataset that do not conform to other items in the dataset and do not match a projected pattern or expected behavior.

Which of the following is the correct definition of correlation analysis?

Definition of Correlation Analysis Correlation Analysis is statistical method that is used to discover if there is a relationship between two variables/datasets, and how strong that relationship may be.

Toplist

Neuester Beitrag

Stichworte