Data Science. TileDB. Open Source. Quant Research. R. C++. Debian. Linux. Adjunct Clinical Professor, University of Illinois. Lots of coffee. And some running.
Per the @Rdatatable benchmarks at h2oai.github.io/db-benchmark… (which includes a 'join' problem likely close to your merge issue) you are probably unlikely to beat `data.table` just by going to #Python or #Rcpp. Maybe profile a little and then discuss with team @Rdatatable?
A 32-bit integer has a limit on its range, but if you read my reply to @jaredlander you will note I already mentioned `numeric`. Which works fine, obviously. And we also have `int64` in `nanotime` if you want higher-resolution increments.
I would suspect you cannot as no reader can guess when seeing an `int` that you meant it as a delta relative to epoch. I would just read as `int` (or `numeric`) and convert as shown in the screenshot.
Yes -- that is my understanding (and I owe the organizers a follow-up confirming this). We also plan to add more documentation and tutorials to the @tiledb website.
Come to our @tiledb#rstats tutorial where @aaronwolen and I may show how to put a 194 million rows "flights" csv file (85 gb uncompr.) into a single TileDB (sparse) array (indexed by flight date, carrier, origin and destination) you can read / write to / from S3 / GCS / Azure.
🔊 Nine of our tutorials are sold out!!
Don't waste time and register to ensure your place.
Look what we offer you in English.
📌 user2021.r-project.org/parti…
Concurrent multi-user editing of markdown docs with revisions, graphics, ... under copyleft license and containers to self-host. Looks like a lot to like in @HedgeDocOrg at hedgedoc.org/
H/t to @troy_phd for a pointer to this hackmd alternative
There are a few tutorials on how to set X11 up. I use Ubuntu natively: see the screen shot for using an alias I have for Docker. This uses R 4.1.0 via @marutterstat 's PPA using my @Debian package of #RStats and the r-base container that is also our rocker/r-base
#RockerProject
Sure. Am over in RF and could cycle to OP to pick up bagels. Seems like karmic justice after a (rare) morning run (as I am still struggling with some lingering injury).
inline 0.3.18 on CRAN: Routine update
Inline C, C++ and Fortran programs with ease in your R code
dirk.eddelbuettel.com/blog/2…
Updated test logic accomodating R-devel changes thanks to @JohannesRanke#rstats
That is not needed for this 4.0.* to 4.1.* transition.
Some release require it, but then it is clearly signalled. R 3.6.* to R 4.0.* was one such case, as was not R 3.3.* to R R 3.4.*. All others since R 3.0.0, as well as many before, simply reused your packages!
#rstats
The data team at @TheEconomist outdid itself: cover story of its Covid 19 excess death model, lead article, longer briefing, tech background at economist.com/graphic-detail…, additional page with per-country fits and a @Github repo with code and data. Bravo!
Truncation of (daily) dates to years in #RStats came up on SO, and along with heavy hitters @bolkerb and (twitterless?) Thomas Petzoldt three nice answers were put up. I like my (revised) base R one---but there is a 'wart' with `trunc.Date`. stackoverflow.com/a/67562643…
I used this one this week:
developer.ibm.com/exchanges/…
Comes in a humungous 190 million row (if I remember correctly) full set and a 2 million row subset.
One week to go to the #RStats 4.1.0 release!
The release cycle is now at 'rc' aka release candidate versions of R 4.1.0 which you can, and should, test via @debian experiemental binaries from my ppaR400 at @github or via @docker as shown below.