Data Science. TileDB. Open Source. Quant Research. R. C++. Debian. Linux. Adjunct Clinical Professor, University of Illinois. Lots of coffee. And some running.
Another STAT 447 is in the books, and a big thank you and congratulations to all students. Group projects were impressive (see below). Been thinking about "external" project topics for Data Science Programming Methods, may open a GH repo. Get in touch if interested. #rstats
PSA: If g++-11 produces errors like "reference to ‘data’ is ambiguous", ensure you do not (accidentally ?) flatted the `std` namespace and one of your containing `data`.
C++17 brings us `std::data()` which can clash.
Illustration and fixes in repo.
github.com/eddelbuettel/mine…
_If_ you trial numbers are indeed unique, make it a factor and retrieve the factor levels.
Or use @Rdatatable which does that for you via grouping and access to the group index.
#rstats
How It Started How It's Going
Less Is More: #rstats testing edition.
Yesterday's post at dirk.eddelbuettel.com/blog/ has more on "less is more", and a vignette engine example.
It's git. There are no rules. Only trial and error til we get there. And battle scars that remind us how we got there.
Kidding. I love git, and yes it surely beats rsyncing between machines and messing up.
Yep because you can rebase, or squash, or simply cherry pick.
It all varies. Some of my projects have more and smaller commits (and you may find 'temp' or 'snapshot' in the commit message -- per your need here). Others (i.e. work) squash.
"Mark Zuckerberg has only neutral feelings toward Peppa Pig, who he understands is a fictional character, and he blames the coronavirus pandemic on other factors."
xkcd.com/2551/
Yes. Even better to just do
suppressMessages({
library(splines)
library(ggplot2)
library(dplyt)
...
})
as you then also catch non-conformant packages NOT using startup messages. This really silences *evreything* which is a nice feature.