Follow and like us on our Facebook page where we post on the new release subject and answering tips and tricks to help save your time so that you can never feel stuck again.
Shortcut

Ctrl + F is the shortcut in your browser or operating system that allows you to find words or questions quickly.

Ctrl + Tab to move to the next tab to the right and Ctrl + Shift + Tab to move to the next tab to the left.

On a phone or tablet, tap the menu icon in the upper-right corner of the window; Select "Find in Page" to search a question.

Share Us

Sharing is Caring

It's the biggest motivation to help us to make the site better by sharing this to your friends or classmates.

# Introduction to Machine Learning

Provides an overview of the key concepts and algorithms of machine learning, empowering students to develop intelligent systems and make data-driven decisions.

## data preprocessing

A model that detects fraudulent credit card use, for example, would be trained using a dataset that included labeled data points of known fraudulent and valid charges

• True
• False

Because the RSS is consistent across datasets, a straight line has a minimal variance

• True
• False

Since you are familiar with the intercept and coefficients of the polynomial multiple regression function, you can quickly visualize the 3D prediction function

• True
• False

Applying a trained model to data is what fit() and transform() are designed to do predict()

• True
• False

To generate datasets with different distributions, use the sklearndatasetssamples generator module of the Scikitlearn package

• True
• False

In matplob, you can easily connect the three elements better by dragging the plot with your keyboard

• True
• False

The optimal k is the one with the lowest average silhouette

• True
• False

Boolean indexing is a mechanism for selecting the number of rows to print

• True
• False

A polynomial regression line may not always be the optimal method for effectively capturing the relationships between the characteristics and labels

• True
• False

The intercept and coefficient are currently of greatest interest to us after the model has been predicted

• True
• False

The next step is to identify the best K that yields the highest average Silhouette Coefficient once you have seen how to calculate the mean Silhouette Coefficient for a dataset with K clusters

• True
• False

The structure of a DataFrame in Pandas demonstrates that a Series is a single column in a DataFrame along with the index

• True
• False

By estimating property prices based on several features, you will discover a variation on simple linear regression known as multiple linear regression

• True
• False

It's worth noting that a NumPy slice produces a reference rather than a copy of the original array

• True
• False

A broken value, representing the likelihood of an observation belonging to a given class, can also be the result of a classification difficulty

• True
• False

The main idea behind SVM is to draw a line between two or more classes in the best possible manner

• True
• False

Various models are utilized in the ensemble learning technique in order to work together on a single dataset, and the results are then combined

• True
• False

If the DataFrame is too long, you can use the head() function to accomplish this

• True
• False

4To get the best value for k, choose the value of k that offers the least accuracy Maximum

• True
• False

It is unclear at what value the linear regression line intercepts the x-axis Y-axis

• True
• False

Boolean indexing is a method that lets you select how many rows to print

• True
• False

The plot() function draws points without connecting lines Depending on the inputs, may or may not be plot lines Scatter() function

• True
• False

Anaconda is an open source package management system and environment management system for installing multiple versions of software packages and their dependencies and switching easily between them

• True
• False

To train a model in Scikit-learn, you usually utilize the predict() function Fit() function

• True
• False

When using a KNN, increasing k tends to make your prediction more resistant to data noise

• True
• False

You will have a very clear idea of the sizes you need to create once the algorithm has been located

• True
• False

It's your responsibility to organize each of the points into a distinct group so that you can look for a pattern

• True
• False

The end index is the source of most misunderstandings when it comes to slicing

• True
• False

Your goal with unsupervised learning is to predict a pattern in the dataset

• True
• False

When the axis parameter is set to 0, it means you wish to sort by index

• True
• False

A relationship between two variables is said to have a positive correlation when both variables move in lockstep

• True
• False

Using a linear object, you can build a new feature matrix from any polynomial combinations of features with a degree less than or equal to the desired degree

• True
• False

A NumPy slice generates a reference rather than a copy of the original array

• True
• False

It's worth noting that the sort_values() function only accepts Series objects with indexes equal to the DataFrame's index (axis=0) or the DataFrame's columns (axis=1)

• True
• False

When one variable rises while the other rises or when one variable fall while the other falls, there is a positive correlation

• True
• False

Keep in mind that the boundary smooths out as k decreases

• True
• False

Matplotlib makes it simple to create sophisticated charts and figures, and its combination with Jupyter Notebook makes it an excellent machine learning tool

• True
• False

A hard margin means that an SVM is very rigid in classification and tries to work extremely well in the training set, causing overfitting

• True
• False

A higher C will aim for the widest margin possible, but it will result in some points being classified incorrectly

• True
• False

The number of rows required for training cannot be less than the value of k More than

• True
• False

Setting the x-axis to start at 45 and expanding the range of the y-axis is a better visualization approach 0

• True
• False

A good application of Python programming is determining if a particular credit card transaction is fraudulent

• True
• False

The process of making a NumPy object is similar to that of making a NumPy array

• True
• False

Each cell in a Jupyter Notecourse can be run independently

• True
• False

Loading the dataset into a Pandas dataframe and then using the predict() function to check for null values in the dataframe is an effective technique to detect empty rows Isnull()

• True
• False

It's worth noting that the scatterplot will choose the colors for each pie chart slice Matplotlib

• True
• False

Matplotlib will choose the colors for each of the slices in the pie chart by default

• True
• False

To run or execute a code in Jupyter Notebook, you need to save the code with an extension name of py and remove the first line that contains the expression =E2=80=9C%matploblib inline"

• True
• False

You can easily visualize the 3D prediction of function since you are aware of the intercept and coefficients of the polynomial multiple regression function

• True
• False

To illustrate how one variable influences the value of another, factor charts are widely used

• True
• False

The =E2=80=98as np' portion of the code tells Python to bring the NumPy library into your current environment

• True
• False

Anaconda is a web-based editor for working with Python projects

• True
• False

If you wish to extract specific rows and columns from a DataFrame, you'll need to use the index property

• True
• False

The visual inspection makes it simple to determine the value of K for a small dataset, but it becomes more difficult for a large sample

• True
• False

It's critical to distinguish between the bars by setting their alpha to 05 because they may overlap (which makes them transparent)

• True
• False

A more sensible strategy would be to use the confusion matrix

• True
• False

When matplotlib loads a dataset with empty fields, it will represent those fields with NaN Pandas

• True
• False

The describe() function can be used to obtain numbers such as count, mean, standard deviation, minimum and maximum

• True
• False

Slicing is also applicable to both Series and DataFrames in Pandas

• True
• False

A high Gamma value indicates that every point has a far reach

• True
• False

There are numerous classes and functions for handling polynomial regression in the StatLib

• True
• False

Set the inplace argument to True if you want the original DataFrame to be sorted

• True
• False

With the dataset, you can use the K-Means technique to cluster these individuals according to the precise measurements of various parts

• True
• False

Unsupervised learning algorithms aim to uncover associations in data that do not have labels

• True
• False

When the frequency is set to year, the last day of the month will be the day of each date

• True
• False

For SVM, the right line is the one that has the widest margins, with each margin touching at least a point in each class

• True
• False

To prevent knots, similar to the preceding point, the number of classes must not be a multiple of the value of N k

• True
• False

In machine learning, regression identifies which set of categories a new observation belongs to based on training data containing the observed categories

• True
• False

The outcome of a classification problem can also be a broken value, indicating the likelihood of an observation belonging to a particular class

• True
• False

The goal of SVM is to separate the points into two or more classes so that you can use it to predict the classes of future points

• True
• False

The difference between Boosting and Bagging is that each learner is trained using all the data Still, the data points that previous learners misclassified are given greater weight so that following learners will pay closer attention to them during training

• True
• False

The best hyperplane is that plane with the shortest distance from both classes, which is SVM's main aim

• True
• False

The describe indexer can also be used to extract rows based on the row number

• True
• False

The index property is required if you want to extract specific rows and columns from a DataFrame

• True
• False

In mathematics, accuracy is calculated by dividing the total number of guesses by the total number of correct forecasts or prediction

• True
• False

Factor plots are frequently used to show how one variable affects the value of another Scatter plot

• True
• False

Fortify() initializes a ggplot object that can be used to declare the input data frame for a graphic and specify the plot aesthetics intended to be common throughout all subsequent layers unless specifically overridden Ggplot()

• True
• False

An overfitted model's main drawback is that it will struggle to handle brand-new, untested data

• True
• False

By default, Matplotlib will decide on the colors for each slice of the pie

• True
• False

In bar charting, the colors will be recycled because there are more slices than the colors you requested Pie charting

• True
• False

Using python commands, you can rotate and interact with the chart

• True
• False

NumPy index may also handle subtraction, multiplication, and division in addition to addition

• True
• False

It means that you have a 50% chance of landing a head when you state that the odds of landing a head are 1

• True
• False

Swarmplots are identical to line graphs that show how much one variable is affected by the presence of another, and this relationship between two variables is statistically termed as their correlation Scatter plots

• True
• False

For instance, when you flip a coin, the probability of getting a head is 1

• True
• False

In general, most DataFrame operations do not change the original DataFrame

• True
• False

Running a code outside Jupyter Notebook would have a significantly better way of presenting the 3D plot

• True
• False

In slicing by number, start:end means extracts row start through row-end but includes the end row, slicing by value includes the end row

• True
• False

Labels are also often referred to as targets, whereas features are also referred to as explanatory variables

• True
• False

You can do this with the head() function if the DataFrame is too long

• True
• False

You can use the iloc indexing method to select a subset of the DataFrame based on specific cell values

• True
• False

To display a 3D plot in python, you need to type in the command \$ python filenamepy

• True
• False

Matplotlib makes constructing intricate charts and figures simple, and it works well as a machine learning tool when combined with Jupyter Notebook

• True
• False

A NumPy object is created in the same way that a NumPy array is created

• True
• False

You can add another bar graph to an existing chart in the same way you can add another line chart

• True
• False

The degree of the polynomial features is specified by a number the PolynomialMultiple class accepts

• True
• False

The best k is chosen for each k, which reveals the k with the lowest MSE MSE

• True
• False

When displaying percentages or proportions of data, a pie chart is beneficial

• True
• False

Another important element to keep in mind is that the outcome of the slicing is determined by how you slice it

• True
• False

A positive correlation exists when one variable increases as the other increases or when one variable decreases while the other decreases

• True
• False

All points can be separated linearly, nor can they be separated using the kernel tricks

• True
• False

The make regression() function produces linearly distributed data wherein you can define the number of features and the standard deviation of the Gaussian noise used in an output

• True
• False

It is not possible to draw a straight line to separate two sets of points However, you can make this set of points linearly separable with some manipulation

• True
• False

A dependent variable class in the Scikit-learn library can help you precisely create an instance of this class and use the heights and weights lists to build a linear regression model with the fit() function LinearRegression class

• True
• False

Because the data distribution in datasets might not be uniform, your test set might be difficult to anticipate, making it impossible to determine whether your model is inefficient

• True
• False

Although printing the predictions alongside the test set's original diagnoses is useful, it does not give a clear sense of how well the model predicts if a tumor is malignant

• True
• False

In the KNN model, the score is higher at the conclusion of the k-runs

• True
• False

You can include a label in your pie charts, but before you can accomplish it, you must first deal with the pie() function's return values Legend

• True
• False

Obtaining sample datasets for experimentation is frequently one of the issues in machine learning

• True
• False

Use DataFrameplot to access the shape of a Pandas DataFrame; the shape property provides a tuple that indicates the dimensionality of the DataFrame DataFrameshape

• True
• False

You can utilize the iloc indexing method to choose a subset of the DataFrame depending on particular values in the cells

• True
• False

In Pandas, slicing may be applied to both Series and DataFrames

• True
• False

The use pandas portion of the code tells Python to bring the panda's data analysis library into your current environment

• True
• False

The tail() function in Python displays the last five rows of the dataframe by default

• True
• False

Overfitting happens when the model does not adequately fit the facts

• True
• False

Instead of writing the index of each row and column at the center of the number, a better method to visualize slicing is to write it between the numbers

• True
• False

Unlabeled data lacks a label Hence the fact that we can identify patterns in it is of interest to us because there is no label in unlabeled data

• True
• False

The simplest way to begin with Scikit-learn for machine learning is to start with linear regression

• True
• False

Multiplying the relevant entries in each vector and adding the results yields the sum product of two vectors

• True
• False

The higher the value of Gamma, the more it will try to fit the training dataset exactly, resulting in oversizing

• True
• False

The main problem with KNN is how to determine the optimal value of k to employ

• True
• False

Because it will generate dramatically different RSS for varied datasets, the curved line exhibits a great amount of fluctuation

• True
• False

Any dimension can be used to calculate the distance between two places using the Euclidean distance() function

• True
• False

Starting with linear regression is the simplest method to get started with machine learning with Scikit-learn

• True
• False

Unsupervised learning's purpose is to predict a pattern in a dataset

• True
• False

The tail() method, like the head() function, lets you select how many rows to print

• True
• False

To avoid occurrences when there is a tie between the classes in a two-class problem, k should have an even value Odd

• True
• False

We can plot a linear regression using a specific code snippet after the values of =CE=B20 and x=CE=B2 are obtained

• True
• False

Because the bars could overlap, it's crucial to be able to tell them apart by setting their alpha to 05 (making them translucent)

• True
• False

SVM uses the kernel trick to deal with datasets that are not linearly separable

• True
• False

When cells run, a running number displays the sequence in which they are executed

• True
• False

Using a polynomial regression line is not always necessary to capture the correlations between the features and labels

• True
• False

In data cleansing, no encoding is required if the collection already contains all numerical values

• True
• False

Clustering helps in forecasting the future by estimating the relationship between variables

• True
• False

Overfitting indicates that your model makes a valiant effort to precisely fit all the data

• True
• False

Rather than having to retrain the model each time you have new data to test, a stored model allows you to load the trained model and make predictions without having to retrain the model

• True
• False

Sub plots can also be plotted separately and then combined into a single figure Multiple scatter plot

• True
• False

The lmplot is another popular plot in Seaborn

• True
• False

You may also load several interesting datasets in Scikitlearn, in addition to the Iris dataset

• True
• False

Pie charts can have labels added to them, but first, you must deal with the values returned by the pie() function

• True
• False

A better way to visualize is to start the x-axis at 45 and increase the y-range axis's

• True
• False

When learning Python, it is generally helpful to begin with a simple dataset that you may construct yourself to ensure that the algorithm is performing correctly according to your understanding Machine learning algorithm

• True
• False

The basic notion behind polynomial regression is finding the polynomial function's prediction that best matches the data

• True
• False

A straight line attempting to connect all the locations has a strong variance because it doesn't cut through all the points

• True
• False

A running number shows how cells were performed when they were run

• True
• False

In a Jupyter Notecourse, each cell can be run individually

• True
• False

You can create a new feature matrix made up of all polynomial combinations of the features with a degree less than or equal to the specified degree by using a linear object

• True
• False

You can use the magic function %matplotlib inline to enable scatter plotting, where the plots/graphs will be displayed just below the cell

• True
• False

If the random state parameter of the train_test_curve() function is not supplied, you will receive a different training and testing set each time you call this function

• True
• False

A state-based matplotlib interface called matplotlibfigure offers an implicit plotting method that is similar to MATLAB Matplotlibpyplot

• True
• False

The sigmoid function that tries to fit the points on the chart can be plotted using =CE=B20 and x=CE=B2

• True
• False

The concentric rings drawn around the test point are based on the values of k, which range from 7 to 1, with intervals of -2

• True
• False

The Scikitlearn library's sklearndatasetssamples generator module can be used to create datasets with various distributions

• True
• False

Depending on the inputs, the plot() function may or may not construct connecting lines when drawing points

• True
• False

Another option for dealing with Isnull in your dataset is to delete the rows that contain them NaNs

• True
• False

When performing multiplications on an index array and a matrix, there is another significant difference

• True
• False

You can also get the gradient of the linear regression line using the model object's coef_ property

• True
• False

NumPy arrays make it simple to conduct array math

• True
• False

One of the problems in machine learning is frequently obtaining sample datasets for experimentation

• True
• False

The sum-product of two vectors is computed by multiplying the relevant entries in each vector and adding the results

• True
• False

When the bbox inches parameter is set to zero, the surplus white space surrounding your figure is removed Tight

• True
• False

There is another fundamental distinction between multiplications on an index array and a matrix

• True
• False

To combine two arrays, you can use the npconcat() function instead of the + operator

• True
• False