# CS代写|机器学习代写Machine Learning代考|ACDL2022 Terminology

## CS代写|机器学习代写Machine Learning代考|Terminology

To conduct machine learning, we must have data first. Suppose we have collected a set of watermelon records, for example, (color $=$ dark; root $=$ curly; sound $=$ muffled), (color $=$ green; root = curly; sound $=$ dull $),($ color $=$ light $;$ root $=$ straight; sound = crisp),…, where each pair of parentheses encloses one record and “=” means “takes value”.

Collectively, the records form a data set, where each record contains the description of an event or object, e.g., a watermelon. A record, also called an instance or a sample, describes some attributes of the event or object, e.g., the color, root, and sound of a watermelon. These descriptions are often called attributes or features, and their values, such as green and dark, are called attribute values. The space spanned by attributes is called an attribute space, sample space, or input space. For example, if we consider color, root, and sound as three axes, then they span a three-dimensional space describing watermelons, and we can position every watermelon in this space. Since every point in the space corresponds to a position vector, an instance is also called a feature vector.

More generally, let $D=\left{x_1, x_2, \ldots, x_m\right}$ be a data set containing $m$ instances, where each instance is described by $d$ attributes. For example, we use three attributes to describe watermelons. Each instance $\boldsymbol{x}i=\left(x{i 1} ; x_{i 2} ; \ldots ; x_{i d}\right) \in \mathcal{X}$ is a vector in the $d$-dimensional sample space $\mathcal{X}$, where $d$ is called the dimensionality of the instance $\boldsymbol{x}i$, and $x{i j}$ is the value of the $j$ th attribute of the instance $\boldsymbol{x}_i$. For example, at the beginning of this section, the second attribute of the third watermelon takes the value straight.

## CS代写|机器学习代写Machine Learning代考|Hypothesis Space

Induction and deduction are two fundamental tools of scientific reasoning. Induction is the process from specialization to generalization, that is, summarizing specific observations to generalized rules. In contrast, deduction is the process from generalization to specialization, that is, deriving specific cases from basic principles. For example, in axiomatic systems of mathematics, the process of deriving a theorem from a set of axioms is deduction. By contrast, learning from examples is an inductive process, also known as inductive learning.

In a broad sense, inductive learning is almost equivalent to learning from examples. In a narrow sense, inductive learning aims to learn concepts from training data, and hence is also called concept learning or concept formation. The research and applications on concept learning are quite limited because it is usually too hard to learn generalized models with clear semantic meanings, whereas in real-world applications, the learned models are often black boxes that are difficult to interpret. Nevertheless, having a brief idea of concept learning is useful for understanding some basic concepts of machine learning.

The most fundamental form of concept learning is Boolean concept learning, which encodes target concepts as Boolean values 1 or 0 , indicating true or false. Taking the training data in – Table $1.1$ as an example, suppose we want to learn the target concept of ripe, assume that the ripeness of a watermelon entirely depends on its color, root, and sound. In other words, whether a watermelon is ripe or not is determined once we know the values of those three variables. Then, the concepts to be learned could be “ripe is watermelon with color $=X$, root $=Y$, and sound $=Z$ “, or equivalently as the Boolean expression “ripe $\leftrightarrow($ color $=$ ?) $\wedge$ (root $=$ ?) $\wedge$ (sound $=$ ?)”, where the “?” marks are the values to be learned from training data.

