Practical-Machine-Learning | Notes and code from Practical Machine Learning course slides | Machine Learning library
kandi X-RAY | Practical-Machine-Learning Summary
kandi X-RAY | Practical-Machine-Learning Summary
Notes and code from Practical Machine Learning course slides
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of Practical-Machine-Learning
Practical-Machine-Learning Key Features
Practical-Machine-Learning Examples and Code Snippets
Community Discussions
Trending Discussions on Practical-Machine-Learning
QUESTION
I am planning to run glm, lasso and randomForest across different sets of predictors to see which model combination is the best. I am going to be doing v-fold cross validation. To compare the ML algorithms consistently, the same fold has to be fed into each of the ML algorithms. Correct me if I am wrong here.
How can we achieve that in h2o package in R? Should I set
- fold_assignment = Modulo within each algo function such as h2o.glm(), h2o.randomForest() etc.
- Hence, would the training set be split the same way across the ML algos?
If I use fold_assignment = Modulo and what if I have to stratify my outcome? The stratification option is with fold_assignment parameter as well? I am not sure I can specify Modulo and and Stratified both at the same time.
Alternatively, if I set the same seed in each of the model, would they have the same folds as input?
I have the above questions after reading Chapter 4 from [Practical Machine Learning with H2O by Darren Cook] (https://www.oreilly.com/library/view/practical-machine-learning/9781491964590/ch04.html)
Further, for generalizability in site level data in a scenario as in the quotation below:
For example, if you have observations (e.g., user transactions) from K cities and you want to build models on users from only K-1 cities and validate them on the remaining city (if you want to study the generalization to new cities, for example), you will need to specify the parameter “fold_column” to be the city column. Otherwise, you will have rows (users) from all K cities randomly blended into the K folds, and all K cross-validation models will see all K cities, making the validation less useful (or totally wrong, depending on the distribution of the data). (source)
In that case, since we are cross folding by a column, it would be consistent across all the different models, right?
...ANSWER
Answered 2020-Aug-17 at 07:03Make sure you split the dataset the same for all ML algos (same seed). Having the same seed for each model won't necessarily have the same cross validation assignments. To ensure they are apples-to-apples comparisons, create a fold column (.kfold_column()
or .stratified_kfold_column()
) and specify it during training so they all use the same fold assignment.
QUESTION
I am trying to learn a bit about different types of regression and I am hacking my way through the code sample below.
...ANSWER
Answered 2018-Jul-09 at 15:49Just a few minutes ago I got an upvote for Function to calculate R2 (R-squared) in R. Now I guess it is from you, thanks.
Rsquare
function expects two vectors, but you've passed in a model object fit
(which is a list) and a vector test$mpg
. I guess you want predict(fit, newdata = test)
for its first argument here.
QUESTION
I have a large HTML table on a page of my old website. Once the table grew to a certain size (a few months ago) a random "i" showed up in the header of the table (see below).
The "i" is nowhere to be found in the original source HTML (see below). It only appears when the HTML transmitted from IIS and rendered in a browser. Both Chrome and Edge display the character so it doesn't appear to be an issue with a specific browser.
...ANSWER
Answered 2018-Feb-20 at 07:09I figured it out. :D
It's just a "typo" in your HTML, look at line 422 you have the i
placed there between the table markup: i
.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install Practical-Machine-Learning
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page