Which one is not the feature engineering techniques used in ML data science world?
Imputation
Binning
One hot encoding
Statistical
Which command manually triggers a single run of a scheduled task (either a standalone task or the root task in a DAG) independent of the schedule defined for the task?
RUN TASK
CALL TASK
EXECUTE TASK
RUN ROOT TASK
Consider a data frame df with columns ['A', 'B', 'C', 'D'] and rows ['r1', 'r2', 'r3']. What does the ex- pression df[lambda x : x.index.str.endswith('3')] do?
Returns the row name r3
Results in Error
Returns the third column
Filters the row labelled r3
You previously trained a model using a training dataset. You want to detect any data drift in the new data collected since the model was trained. What should you do?
Create a new dataset using the new data and a timestamp column and create a data drift monitor that uses the training dataset as a baseline and the new dataset as a target.
Create a new version of the dataset using only the new data and retrain the model.
Add the new data to the existing dataset and enable Application Insights for the service where the model is deployed.
Retrained your training dataset after correcting data outliers & no need to introduce new data.
Which of the learning methodology applies conditional probability of all the variables with respec- tive the dependent variable?
Reinforcement learning
Unsupervised learning
Artificial learning
Supervised learning
Select the correct mappings:
I,II
I,II,III
III,IV
II,III,IV
W Weights or Coefficients of independent variables in the Linear regression model --> Model Pa- rameter II. K in the K-Nearest Neighbour algorithm --> Model Hyperparameter III. Learning rate for training a neural network --> Model Hyperparameter IV. Batch Size --> Model Parameter
Mark the correct steps for saving the contents of a DataFrame to a Snowflake table as part of Moving Data from Spark to Snowflake?
Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter. Step 2.Specify SNOWFLAKE_SOURCE_NAME using the NAME() method. Step 3.Use the dbtable option to specify the table to which data is written. Step 4.Specify the connector options using either the option() or options() method. Step 5.Use the save() method to specify the save mode for the content.
Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter. Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method. Step 3.Specify the connector options using either the option() or options() method. Step 4.Use the dbtable option to specify the table to which data is written. Step 5.Use the save() method to specify the save mode for the content.
Step 1.Use the write() method of the DataFrame to construct a DataFrameWriter. Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method. Step 3.Specify the connector options using either the option() or options() method. Step 4.Use the dbtable option to specify the table to which data is written. Step 5.Use the mode() method to specify the save mode for the content. (Correct)
Step 1.Use the writer() method of the DataFrame to construct a DataFrameWriter. Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method. Step 3.Use the dbtable option to specify the table to which data is written. Step 4.Specify the connector options using either the option() or options() method. Step 5.Use the save() method to specify the save mode for the content.
Secure Data Sharing do not let you share which of the following selected objects in a database in your account with other Snowflake accounts?
Sequences
Tables
External tables
Secure UDFs
All aggregate functions except _____ ignore null values in their input collection
Count(attribute)
Count(*)
Avg
Sum
Which ones are the key actions in the data collection phase of Machine learning included?
Label
Ingest and Aggregate
Probability
Measure