Can’t Query Type Data Inserted by Bulk Loader #3968

cactus222 · 2019-09-11T14:02:40Z

What version of Dgraph are you using?

Dgraph version : v1.1.0
Dgraph SHA-256 : 7d4294a80f74692695467e2cf17f74648c18087ed7057d798f40e1d3a31d2095
Commit SHA-1 : ef7cdb2
Commit timestamp : 2019-09-04 00:12:51 -0700
Branch : HEAD
Go version : go1.12.7

Have you tried reproducing the issue with the latest release?

Yes

What is the hardware spec (RAM, OS)?

Attempted on Ubuntu 18.04 and using Dgraph docker

Steps to reproduce the issue (command/config used to run Dgraph).

Start Zero (the commands are actually in the docker-compose file)

dgraph zero --my=zero:5080

Feed data and schema into Dgraph bulk

data file

_:brand1 <dgraph.type> "Brand" .
_:brand1 <name> "brand1" .
_:brand2 <dgraph.type> "Brand" .
_:brand2 <name> "brand2" .

_:product1 <dgraph.type> "Product" .
_:product1 <brand> _:brand1 .
_:product1 <name> "name1" .
_:product1 <pid> "abc" .

_:product2 <dgraph.type> "Product" .
_:product2 <brand> _:brand2 .
_:product2 <name> "name2" .
_:product2 <pid> "123" .

_:product3 <dgraph.type> "Product" .
_:product3 <brand> _:brand2 .
_:product3 <name> "name3" .
_:product3 <pid> "ab1" .

schema file

type Product {
  name: string
  brand: uid
  pid: string
}

type Brand {
  name: string
}

name: string @index(term)  .
pid: string @index(hash)  .
brand: uid .

Run Dgraph Bulk on data and schema

dgraph bulk --schema ./data/smallschema.txt -f ./data/small.txt  --format=rdf --reduce_shards=2 --num_go_routines=2 --map_shards=2

Start alphas pointing to the generated directories

dgraph alpha --my=server:7080 --lru_mb=2048 --zero=zero:5080 -p out/0/p/
dgraph alpha --my=server:7081 --lru_mb=2048 --zero=zero:5080 -p out/1/p/ -o=1

Expected behaviour and actual result.

Query all objects with type Product

  q(func: type(Product)) {
    name
    uid
  }

Expected all product objects returned
Actual result is an empty result

Discussion link on forums: https://discuss.dgraph.io/t/cant-query-type-data-on-bulk-loader/5038

The text was updated successfully, but these errors were encountered:

pawanrawal · 2019-09-11T23:37:05Z

This happens because of the following

When loading data with the given dataset using bulk loader with reduce_shards as 3, the data for dgraph.type lies in the 3rd output shard i.e. out/2/p.
When the 1st alpha node is started with out/0/p it proposes initial schema for dgraph.type and starts serving the tablet.

dgraph/worker/groups.go

Line 149 in c9bc4bb

gr.proposeInitialSchema()
When 3rd alpha node comes, although it has the data for dgraph.type it doesn't serve the predicate as it finds some other node is already serving it.

I think what we should do instead is not propose initial schema on startup but propose it when the first mutation for dgraph.type happens on the cluster as the user would do mutations only after starting all the nodes serving all different shards.

Note - This problem could happen with any of the internal predicates defined in

dgraph/schema/schema.go

Line 438 in c9bc4bb

initialSchema = append(initialSchema, &pb.SchemaUpdate{

when loading data using the bulk loader.

martinmr · 2019-09-13T00:20:06Z

I think the cleanest solution should be to force the bulk loader to allocate the reserved predicates in the first shard. But I don't have a lot of insight into the bulk loader so it might not be that easy.

pawanrawal · 2019-09-16T07:02:45Z

I think the cleanest solution should be to force the bulk loader to allocate the reserved predicates in the first shard.

That can be a temporary fix, yes. Though it is still possible that the Alpha node which starts with the first shard doesn't get assigned to group zero. Imagine, a user starting a cluster using something like Kubernetes where there is no guarantee that one Alpha process starts before the other. This problem would still be present then.

This PR contains a couple of related changes. Bulk loader forces reserved predicates to end up in the first reduce shard. It also writes a file in the posting directories with the proposed group ID for each shard. Dgraph looks at that file during startup and uses it to request the right group ID from zero. The change is being tested by modifying the 21million test to use multiple groups and add a new query to verify the number of nodes with a dgraph.type predicate. If the test runs without the fix, dgraph.type sometime ends up in a different group. Fixes #3968.

danielmai added the kind/bug Something is broken. label Sep 11, 2019

pawanrawal self-assigned this Sep 11, 2019

campoy added area/bulk-loader Issues related to bulk loading. area/types Issues related to the type system. priority/P1 Serious issue that requires eventual attention (can wait a bit) status/accepted We accept to investigate/work on it. labels Sep 13, 2019

campoy added this to the Dgraph v1.1.1 milestone Sep 13, 2019

martinmr assigned martinmr and unassigned pawanrawal Oct 22, 2019

martinmr mentioned this issue Oct 22, 2019

Bulk loader allocates reserved predicates in first reduce shard. #4202

Merged

martinmr closed this as completed in #4202 Oct 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can’t Query Type Data Inserted by Bulk Loader #3968

Can’t Query Type Data Inserted by Bulk Loader #3968

cactus222 commented Sep 11, 2019

pawanrawal commented Sep 11, 2019 •

edited

Loading

martinmr commented Sep 13, 2019

pawanrawal commented Sep 16, 2019

Can’t Query Type Data Inserted by Bulk Loader #3968

Can’t Query Type Data Inserted by Bulk Loader #3968

Comments

cactus222 commented Sep 11, 2019

What version of Dgraph are you using?

Have you tried reproducing the issue with the latest release?

What is the hardware spec (RAM, OS)?

Steps to reproduce the issue (command/config used to run Dgraph).

Expected behaviour and actual result.

pawanrawal commented Sep 11, 2019 • edited Loading

martinmr commented Sep 13, 2019

pawanrawal commented Sep 16, 2019

pawanrawal commented Sep 11, 2019 •

edited

Loading