Skip to content
Toby Dylan Hocking edited this page Sep 5, 2025 · 1 revision

Background

The data.table package is an invaluable tool for data analysis and manipulation.

Related work /impact

data.table is one of the most widely used R packages.

Details of your coding project

Find one or more students to fix/close some of the hundreds of outstanding issues.

Maybe would be good to start with the issues labeled as beginner-task.

It would be easier for me if a student approached and said something like :

  • “I want to do some serious C at low level”. I could then propose something there.
  • “I want to write documentation/vignette/whitepaper”. Then I could suggest writing about data.table code that hasn’t been written about before.
  • “I want to close 100 issues, one per day”. Then maybe I could pick the ones where that might be possible.
  • “I want to work on performance testing / benchmarking”
  • “I want to work on performance testing/ using atime for benchmarking”. I will teach on how to use atime to check if reported data.table issues have been fixed.

Mentors/tests

Test is to find an issue labeled as beginner-task,

  • try to reproduce on your own computer, and report your results on the issue (were you able to reproduce? or not?)
  • create a PR which would close that issue.

When you have finished at least one test, please add a link to it on this page, then contact the following mentors:

  • Toby Dylan Hocking @tdhock
  • Anirban Chetia @Anirban166
  • Doris Amoakohene @DorisAmoakohene
  • Joshua Wu @joshhwuu

Potential contributor test results (to edit)

  • Contributor Name, link to issue, link to PR
  1. YOUR INFO HERE
Clone this wiki locally