Bug "UNION types \"char\" and text cannot be matched" with v9.0.1.+++ versions, #2410

yevon · 2022-08-08T20:00:56Z

Environment

PostgREST docker image version: v9.0.1.20220802 or v9.0.1.20220714

Description of issue

I was trying versions above v9.0.1, from v9.0.1.20220714, but when I change the docker image I just receive errors in the docker logs. Any idea what could it be? If I just go back to the stable v9.0.1 it works ok. I tried a schema reload without success.

wolfgangwalther · 2022-08-08T20:39:17Z

Which PostgreSQL version are you using?

Can you try to put together a reproducible example? There's probably something in your schema that triggers this.

wolfgangwalther · 2022-08-08T20:45:14Z

You could also have a look at the PostgreSQL logs - the statement that failed should be logged there, could you show us that, too?

yevon · 2022-08-08T21:53:11Z

I'm using the postgres 15 beta 2. I think that I found the query that fails, if I try to manually execute it it fails:

Found here:

postgrest/src/PostgREST/DbStructure.hs

Lines 655 to 809 in bac63f3

    
           with recursive 
        
           pks_fks as ( 
        
             -- pk + fk referencing col 
        
             select 
        
               contype, 
        
               conname, 
        
               conrelid as resorigtbl, 
        
               unnest(conkey) as resorigcol 
        
             from pg_constraint 
        
             where contype IN ('p', 'f') 
        
             union 
        
             -- fk referenced col 
        
             select 
        
               concat(contype, '_ref') as contype, 
        
               conname, 
        
               confrelid, 
        
               unnest(confkey) 
        
             from pg_constraint 
        
             where contype='f' 
        
           ), 
        
           views as ( 
        
             select 
        
               c.oid       as view_id, 
        
               n.nspname   as view_schema, 
        
               c.relname   as view_name, 
        
               r.ev_action as view_definition 
        
             from pg_class c 
        
             join pg_namespace n on n.oid = c.relnamespace 
        
             join pg_rewrite r on r.ev_class = c.oid 
        
             where c.relkind in ('v', 'm') and n.nspname = ANY($1 || $2) 
        
           ), 
        
           transform_json as ( 
        
             select 
        
               view_id, view_schema, view_name, 
        
               -- the following formatting is without indentation on purpose 
        
               -- to allow simple diffs, with less whitespace noise 
        
               replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 regexp_replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                 replace( 
        
                   view_definition::text, 
        
                 -- This conversion to json is heavily optimized for performance. 
        
                 -- The general idea is to use as few regexp_replace() calls as possible. 
        
                 -- Simple replace() is a lot faster, so we jump through some hoops 
        
                 -- to be able to use regexp_replace() only once. 
        
                 -- This has been tested against a huge schema with 250+ different views. 
        
                 -- The unit tests do NOT reflect all possible inputs. Be careful when changing this! 
        
                 -- ----------------------------------------------- 
        
                 -- pattern           | replacement         | flags 
        
                 -- ----------------------------------------------- 
        
                 -- `<>` in pg_node_tree is the same as `null` in JSON, but due to very poor performance of json_typeof 
        
                 -- we need to make this an empty array here to prevent json_array_elements from throwing an error 
        
                 -- when the targetList is null. 
        
                 -- We'll need to put it first, to make the node protection below work for node lists that start with 
        
                 -- null: `(<> ...`, too. This is the case for coldefexprs, when the first column does not have a default value. 
        
                    '<>'              , '()' 
        
                 -- `,` is not part of the pg_node_tree format, but used in the regex. 
        
                 -- This removes all `,` that might be part of column names. 
        
                 ), ','               , '' 
        
                 -- The same applies for `{` and `}`, although those are used a lot in pg_node_tree. 
        
                 -- We remove the escaped ones, which might be part of column names again. 
        
                 ), E'\\{'            , '' 
        
                 ), E'\\}'            , '' 
        
                 -- The fields we need are formatted as json manually to protect them from the regex. 
        
                 ), ' :targetList '   , ',"targetList":' 
        
                 ), ' :resno '        , ',"resno":' 
        
                 ), ' :resorigtbl '   , ',"resorigtbl":' 
        
                 ), ' :resorigcol '   , ',"resorigcol":' 
        
                 -- Make the regex also match the node type, e.g. `{QUERY ...`, to remove it in one pass. 
        
                 ), '{'               , '{ :' 
        
                 -- Protect node lists, which start with `({` or `((` from the greedy regex. 
        
                 -- The extra `{` is removed again later. 
        
                 ), '(('              , '{((' 
        
                 ), '({'              , '{({' 
        
                 -- This regex removes all unused fields to avoid the need to format all of them correctly. 
        
                 -- This leads to a smaller json result as well. 
        
                 -- Removal stops at `,` for used fields (see above) and `}` for the end of the current node. 
        
                 -- Nesting can't be parsed correctly with a regex, so we stop at `{` as well and 
        
                 -- add an empty key for the followig node. 
        
                 ), ' :[^}{,]+'       , ',"":'              , 'g' 
        
                 -- For performance, the regex also added those empty keys when hitting a `,` or `}`. 
        
                 -- Those are removed next. 
        
                 ), ',"":}'           , '}' 
        
                 ), ',"":,'           , ',' 
        
                 -- This reverses the "node list protection" from above. 
        
                 ), '{('              , '(' 
        
                 -- Every key above has been added with a `,` so far. The first key in an object doesn't need it. 
        
                 ), '{,'              , '{' 
        
                 -- pg_node_tree has `()` around lists, but JSON uses `[]` 
        
                 ), '('               , '[' 
        
                 ), ')'               , ']' 
        
                 -- pg_node_tree has ` ` between list items, but JSON uses `,` 
        
                 ), ' '             , ',' 
        
               )::json as view_definition 
        
             from views 
        
           ), 
        
           target_entries as( 
        
             select 
        
               view_id, view_schema, view_name, 
        
               json_array_elements(view_definition->0->'targetList') as entry 
        
             from transform_json 
        
           ), 
        
           results as( 
        
             select 
        
               view_id, view_schema, view_name, 
        
               (entry->>'resno')::int as view_column, 
        
               (entry->>'resorigtbl')::oid as resorigtbl, 
        
               (entry->>'resorigcol')::int as resorigcol 
        
             from target_entries 
        
           ), 
        
           recursion as( 
        
             select r.* 
        
             from results r 
        
             where view_schema = ANY ($1) 
        
             union all 
        
             select 
        
               view.view_id, 
        
               view.view_schema, 
        
               view.view_name, 
        
               view.view_column, 
        
               tab.resorigtbl, 
        
               tab.resorigcol 
        
             from recursion view 
        
             join results tab on view.resorigtbl=tab.view_id and view.resorigcol=tab.view_column 
        
           ) 
        
           select 
        
             sch.nspname as table_schema, 
        
             tbl.relname as table_name, 
        
             rec.view_schema, 
        
             rec.view_name, 
        
             pks_fks.conname as constraint_name, 
        
             pks_fks.contype as constraint_type, 
        
             array_agg(row(col.attname, vcol.attname) order by col.attnum) as column_dependencies 
        
           from recursion rec 
        
           join pg_class tbl on tbl.oid = rec.resorigtbl 
        
           join pg_attribute col on col.attrelid = tbl.oid and col.attnum = rec.resorigcol 
        
           join pg_attribute vcol on vcol.attrelid = rec.view_id and vcol.attnum = rec.view_column 
        
           join pg_namespace sch on sch.oid = tbl.relnamespace 
        
           join pks_fks using (resorigtbl, resorigcol) 
        
           group by sch.nspname, tbl.relname,  rec.view_schema, rec.view_name, pks_fks.conname, pks_fks.contype

I'm trying to figure out why:

yevon · 2022-08-08T21:59:37Z

The issue is here, but I don't quite understand why it fails, it is mixing char and text:

yevon · 2022-08-08T22:05:58Z

Possible fix is to add a cast to the first union:

yevon · 2022-08-08T22:16:57Z

Seems to be just that, contype is type "char" and concat returns type "text", so the union fails:

yevon · 2022-08-23T19:48:19Z

Clossing this as this was corrected in v10.0.0! Thanks.

wolfgangwalther added the bug label Aug 8, 2022

yevon mentioned this issue Aug 8, 2022

Fix bug "UNION types \"char\" and text cannot be matched" with v9.0.1.+++ versions #2413

Merged

yevon closed this as completed Aug 23, 2022

wibrt mentioned this issue Dec 3, 2022

external pg15 and posgrest9 issue supabase/supabase#10707

Closed

ghost mentioned this issue Apr 13, 2023

[regression] many-to-many relationships (FK-only, no or different PK) #2748

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug "UNION types \"char\" and text cannot be matched" with v9.0.1.+++ versions, #2410

Bug "UNION types \"char\" and text cannot be matched" with v9.0.1.+++ versions, #2410

yevon commented Aug 8, 2022

wolfgangwalther commented Aug 8, 2022

wolfgangwalther commented Aug 8, 2022

yevon commented Aug 8, 2022 •

edited by laurenceisla

Loading

yevon commented Aug 8, 2022

yevon commented Aug 8, 2022

yevon commented Aug 8, 2022

yevon commented Aug 23, 2022

Bug "UNION types \"char\" and text cannot be matched" with v9.0.1.+++ versions, #2410

Bug "UNION types \"char\" and text cannot be matched" with v9.0.1.+++ versions, #2410

Comments

yevon commented Aug 8, 2022

Environment

Description of issue

wolfgangwalther commented Aug 8, 2022

wolfgangwalther commented Aug 8, 2022

yevon commented Aug 8, 2022 • edited by laurenceisla Loading

yevon commented Aug 8, 2022

yevon commented Aug 8, 2022

yevon commented Aug 8, 2022

yevon commented Aug 23, 2022

yevon commented Aug 8, 2022 •

edited by laurenceisla

Loading