Data Science. TileDB. Open Source. Quant Research. R. C++. Debian. Linux. Adjunct Clinical Professor, University of Illinois. Lots of coffee. And some running.
The second edition of ISLR is due out by end of summer! If you are an **instructor** in need of a pre-print to prepare for AY 2021-2022, then send an e-mail to hello@statlearning.com
Yes! Had an eight-minute round from submission to publication today.
But incentives are misaligned: happens mostly to packages with i) no / few reverse dependencies and ii) little / no compiled code (very easy to trip up). Yet it iii) encourages status OK which is good.
Thanks to the @gretl_stats team to invite me to the Gretl 2021 Conference to talk a little about the now 21 years (!!) of maintaining Gretl inside @Debian. The slides, with some asides on @Docker / #RockerProject and (recent) @obshq comments are now on my site.
Mission accomplished! The @Debugging_Book and all its 15 chapters are now complete - from interactive debuggers to slicing and automated repair. Thank you for your continued support and enjoy the read at debuggingbook.org !
Today, I am announcing "The Debugging Book" – an interactive book on tools and techniques for automated software debugging. With code, notebooks + videos for debuggers, slicing, statistical debugging, program repair, and more. New chapters every Tuesday! debuggingbook.org/
True but my focus is on equities as I still run a '4:15' portfolio (p&l and risk) report over my accounts, using the same non-R code I wrote 20 years ago, and which should get rewritten in #RStats "real soon now". As I have been saying for 20 years now too...
My causal inference textbook The Effect has done an editing round and is now available in Bookdown form! It's also at its own easy-to-remember URL at theeffectbook.net
Check it out! The Bookdown version will remain up even after the published version is out.
Today I finally get to share the full draft of my new book The Effect. I think it turned out really well. The Effect offers a highly accessible and intuitive approach to research design and causal inference with observational data. Find it here: nickchk.com/causalitybook.ht…
Per the @Rdatatable benchmarks at h2oai.github.io/db-benchmark… (which includes a 'join' problem likely close to your merge issue) you are probably unlikely to beat `data.table` just by going to #Python or #Rcpp. Maybe profile a little and then discuss with team @Rdatatable?
A 32-bit integer has a limit on its range, but if you read my reply to @jaredlander you will note I already mentioned `numeric`. Which works fine, obviously. And we also have `int64` in `nanotime` if you want higher-resolution increments.
I would suspect you cannot as no reader can guess when seeing an `int` that you meant it as a delta relative to epoch. I would just read as `int` (or `numeric`) and convert as shown in the screenshot.
Yes -- that is my understanding (and I owe the organizers a follow-up confirming this). We also plan to add more documentation and tutorials to the @tiledb website.
Come to our @tiledb#rstats tutorial where @aaronwolen and I may show how to put a 194 million rows "flights" csv file (85 gb uncompr.) into a single TileDB (sparse) array (indexed by flight date, carrier, origin and destination) you can read / write to / from S3 / GCS / Azure.
🔊 Nine of our tutorials are sold out!!
Don't waste time and register to ensure your place.
Look what we offer you in English.
📌 user2021.r-project.org/parti…
Concurrent multi-user editing of markdown docs with revisions, graphics, ... under copyleft license and containers to self-host. Looks like a lot to like in @HedgeDocOrg at hedgedoc.org/
H/t to @troy_phd for a pointer to this hackmd alternative