Skip to content

Implementing the Paper: Data Portraits for Recording Foundation Model Training Data

Notifications You must be signed in to change notification settings

rjzhb/DataPortraits

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataPortraits CMake

Implementing the Paper: Data Portraits for Recording Foundation Model Training Data Paper: https://arxiv.org/abs/2303.03919

Introduction

image

image

Requires spdlog, Google-test, MurmurHash3

sudo apt-get install libspdlog-dev.

sudo apt-get install libgtest-dev.
export CMAKE_PREFIX_PATH=/path/to/gtest:$CMAKE_PREFIX_PATH

sudo apt-get install libmurmurhash3-dev

Code Structure

  • benchmark -- application code to use the generated shared library
  • cmake -- cmake configuration files
  • docs -- any documents
  • include -- all the header files
  • src -- corresponding source files, will generate a shared library
  • test -- test code based on google test

About

Implementing the Paper: Data Portraits for Recording Foundation Model Training Data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published