Skip to content
/ CAFE Public

[SIGMOD 2024] CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models

Notifications You must be signed in to change notification settings

HugoZHL/CAFE

Repository files navigation

CAFE+: Towards Compact, Adaptive, and Fast Embedding for Large-scale Online Recommendation Models

This repository contains all related code of our papers "CAFE+: Towards Compact, Adaptive, and Fast Embedding for Large-scale Online Recommendation Models" (under submission), and "CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models" (SIGMOD 2024).

Scripts

Our implementation builds upon DLRM repo: https://github.com/facebookresearch/dlrm

  1. The code supports interface with the Criteo Kaggle Display Advertising Challenge Dataset.

    • The model can be trained using the following script

      • Convert the value of the numerical feature to log(x+1).
      • Ensure that the feature count for each field is independent.
      • Set the parameters cat_path, dense_path, label_path and count_path in the script.
      ./bench/criteo_kaggle.sh
      
  2. The code supports interface with the Criteo Terabyte Dataset.

    • Please do the following to prepare the dataset for use with this code:

      • Convert the value of the numerical feature to log(x+1).
      • Ensure that the feature count for each field is independent.
      • Set the parameters cat_path, dense_path, label_path and count_path in the script.
    • The model can be trained using the following script

      ./bench/criteo_terabyte.sh
      
  3. The code also supports another two datasets Avazu and KDD12.

    • Please do the following to prepare the dataset for use with this code:

      • Ensure that the feature count for each field is independent.
      • Set the parameters cat_path, dense_path, label_path and count_path in the script.
    • The model can be trained using the following script

      ./bench/avazu.sh
      ./bench/kdd12.sh
      
  4. The code provides three models to train the dataset:

    • dlrm:

      ./bench/criteo_terabyte.sh
      
    • wdl:

      ./bench/wdl.sh
      
    • dcn:

      ./bench/dcn.sh
      
  5. The code provides six methods for generating embedding layers:

    • Full embedding with the following script

      ./bench/criteo_terabyte.sh
      
    • Hash embedding with the following script

      ./bench/criteo_terabyte.sh "--hash-flag --compress-rate=0.001"
      
    • CAFE with the following script

      ./bench/criteo_terabyte.sh "--sketch-flag --compress-rate=0.001 --hash-rate=0.3 --sketch-threshold=1 --adjust-threshold=1 --sketch-alpha=1.0000005"
      
    • QR embedding with the following script

      ./bench/criteo_terabyte.sh "--qr-flag --qr-collisions=10"
      
    • Ada embedding with the following script

      ./bench/criteo_terabyte.sh "--ada-flag --compress-rate=0.1"
      
    • MD embedding with the following script

      ./bench/criteo_terabyte.sh "--md-flag --compress-rate=0.1"
      

Guidance for Adjustment of CAFE Parameters

  • Default parameters:

    ./bench/criteo_terabyte.sh "--sketch-flag --compress-rate=0.001 --hash-rate=0.3 --sketch-threshold=1 --adjust-threshold=1 --sketch-alpha=1.0000005"
    
  • To get better experimental results, when cranking up the compression rate, you can crank down the memory footprint of the hash and crank up the threshold, and vice versa. For example, for other compression rates please try the following commands:

    ./bench/criteo_terabyte.sh "--sketch-flag --compress-rate=0.1 --hash-rate=0.7 --sketch-threshold=1 --adjust-threshold=1 --sketch-alpha=1.0000005"
    
    ./bench/criteo_terabyte.sh "--sketch-flag --compress-rate=0.0001 --hash-rate=0.2 --sketch-threshold=1 --adjust-threshold=1 --sketch-alpha=1.0000005"
    

Papers

If you find this work useful, welcome to cite our papers!

About

[SIGMOD 2024] CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •