Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why do file inventory jobs take so long for some users #1274

Open
hectorcorrea opened this issue Jan 31, 2025 · 0 comments
Open

Why do file inventory jobs take so long for some users #1274

hectorcorrea opened this issue Jan 31, 2025 · 0 comments

Comments

@hectorcorrea
Copy link
Member

hectorcorrea commented Jan 31, 2025

While fixing issue #1263 we noticed that the file inventory job takes a very long time when Matt (user_id 186) runs it in QA. The file inventory for project_id 44 (which has 189,000 files) takes 65 minutes when Matt runs it, but as shown below it only takes 5 minutes when Hector user_id 192 runs it.

Below is what the user_request table shows in QA.

#<FileInventoryRequest:0x00007ff447b7da20
  id: 16,
  user_id: 192,
  project_id: 44,
  job_id: "fa74522e-c720-4ed5-b30b-741bdeb3b804",
  completion_time: Fri, 31 Jan 2025 17:15:35.378825000 UTC +00:00,
  state: "completed",
  type: "FileInventoryRequest",
  request_details: {"file_size"=>22090419, "output_file"=>"/mnt/nfs/tigerdata/fa74522e-c720-4ed5-b30b-741bdeb3b804.csv", "project_title"=>"Test 06-12-24"},
  created_at: Fri, 31 Jan 2025 17:10:12.237105000 UTC +00:00,
  updated_at: Fri, 31 Jan 2025 17:15:35.382571000 UTC +00:00>,
 #<FileInventoryRequest:0x00007ff447b7d660
  id: 17,
  user_id: 186,
  project_id: 44,
  job_id: "fdb79db4-e6b6-41a3-842a-b63afa619163",
  completion_time: Fri, 31 Jan 2025 20:05:34.709517000 UTC +00:00,
  state: "completed",
  type: "FileInventoryRequest",
  request_details: {"file_size"=>22090419, "output_file"=>"/mnt/nfs/tigerdata/fdb79db4-e6b6-41a3-842a-b63afa619163.csv", "project_title"=>"Test 06-12-24"},
  created_at: Fri, 31 Jan 2025 18:59:57.304534000 UTC +00:00,
  updated_at: Fri, 31 Jan 2025 20:05:34.719367000 UTC +00:00>]

We think this could be related to the fact that Hector is an admin on MediaFlux QA but Matt isn't.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant