sources.beancount: Correct type for links and tags columns #242

andreasgerstmayr · 2025-02-03T22:43:27Z

Currently, [...] GROUP BY links throws the following exception:

beanquery.compiler.CompilationError: GROUP-BY a non-hashable type is not supported: "Column(name='links')"

because set is not hashable. frozenset is hashable, and can be used for the links and tags which don't need to be mutable.

Currently, `[...] GROUP BY links` throws the following exception: beanquery.compiler.CompilationError: GROUP-BY a non-hashable type is not supported: "Column(name='links')" because `set` is not hashable. `frozenset` is hashable, and can be used for the `links` and `tags` which don't need to be mutable.

dnicolodi · 2025-02-05T08:03:04Z

Thanks for working on this. I trusted the old type annotation in Beancount and deduced that these fields are returned as sets, not frozen sets. Newer Beancount has the right annotations. This should definitely be fixed, however, the proposed patch is not enough: there are other things that need to be adjusted to support frozen sets. The first that comes to mind is support to render columns with frozenset data type (without that the rendering code falls back to the rendering of generic objects, which is quite ugly for frozen sets).

It is not strictly related to the PR itself, but if the added test case is an example of why you need this functionality, I think there is a better way of doing what you are doing:

2010-02-21 open Assets:AccountsReceivable:Doctor20100221

2010-02-21 * "Doctor appointment"
    Assets:AccountsReceivable:Doctor20100221   1000.00 USD
    Assets:Bank:Checking

2010-02-22 * "Insurance reimbursement"
    Assets:AccountsReceivable:Doctor20100221   -100.00 USD
    Assets:Bank:Checking

2010-03-22 * "Insurance reimbursement"
    Assets:AccountsReceivable:Doctor20100221   -900.00 USD
    Assets:Bank:Checking

2010-03-22 close Assets:AccountsReceivable:Doctor20100221

with a query like:

SELECT
  FIRST(date) as date, 
  FIRST(payee) AS payee, 
  FIRST(narration) AS narration,
  SUM(position) AS balance
WHERE account ~ 'Assets:AccountsReceivable:'
GROUP BY account
HAVING not empty(sum(position))
ORDER BY balance DESC

andreasgerstmayr · 2025-02-06T22:32:18Z

Thanks for working on this. I trusted the old type annotation in Beancount and deduced that these fields are returned as sets, not frozen sets. Newer Beancount has the right annotations. This should definitely be fixed, however, the proposed patch is not enough: there are other things that need to be adjusted to support frozen sets. The first that comes to mind is support to render columns with frozenset data type (without that the rendering code falls back to the rendering of generic objects, which is quite ugly for frozen sets).

Thanks! I'll update the PR and also add a new test for the rendering.

It is not strictly related to the PR itself, but if the added test case is an example of why you need this functionality, I think there is a better way of doing what you are doing:

I agree, it feels a bit like a hack (I was surprised that grouping by links works in beancount.query), but I like to simplicity of it, as I don't have to open/close a new account for every reimbursement. The auto_accounts plugin would help with this, but then I don't catch typos in account names (e.g. a misspelled Expenses:Groceries etc.).

dnicolodi · 2025-02-07T13:11:26Z

I agree, it feels a bit like a hack (I was surprised that grouping by links works in beancount.query), but I like to simplicity of it, as I don't have to open/close a new account for every reimbursement. The auto_accounts plugin would help with this, but then I don't catch typos in account names (e.g. a misspelled Expenses:Groceries etc.).

The problem of using links in this way is that it makes it impossible to use them for anything else as adding links to a transaction break the GROUP BY but there isn't a mechanism in place to make sure that additional links are not added. Furthermore, there is no check on the set of links is consistent (ie that there are no typos). Finally, it works kind of ok for tracking reimbursements where you have one expense and a few reimbursements transactions. However, it breaks down quite badly if you have a single transaction reimbursing expenses occurred in multiple transactions, or for which only a part is reimbursable. All these problems are not there using a dedicated reimbursement account per reimbursement request. I use this system to track the reimbursement of my travel expenses and it scales well. I think the cost of the open and close entries is worth the advantages.

andreasgerstmayr · 2025-02-09T15:01:52Z

Good points, I'll consider it in the future.

I added a frozenset renderer (inherited from the set renderer) to the PR. Do I need to update anything else?

andreasgerstmayr · 2025-02-09T19:37:23Z

Actually, you got me convinced. I just refactored my entire ledger (with autobean-refactor), and added a customized version of the auto_accounts plugin to only auto-open Assets:AccountsReceivable accounts. Best of both worlds.

Anyway, I think the PR is still valid, possibly there are other use cases for grouping by links or tags.

dnicolodi changed the title ~~Support GROUP BY for links and tags~~ sources.beancount: Correct type for links and tags columns Feb 4, 2025

add renderer for frozenset

2adaf14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sources.beancount: Correct type for links and tags columns #242

sources.beancount: Correct type for links and tags columns #242

andreasgerstmayr commented Feb 3, 2025

dnicolodi commented Feb 5, 2025

andreasgerstmayr commented Feb 6, 2025

dnicolodi commented Feb 7, 2025

andreasgerstmayr commented Feb 9, 2025

andreasgerstmayr commented Feb 9, 2025

sources.beancount: Correct type for links and tags columns #242

Are you sure you want to change the base?

sources.beancount: Correct type for links and tags columns #242

Conversation

andreasgerstmayr commented Feb 3, 2025

dnicolodi commented Feb 5, 2025

andreasgerstmayr commented Feb 6, 2025

dnicolodi commented Feb 7, 2025

andreasgerstmayr commented Feb 9, 2025

andreasgerstmayr commented Feb 9, 2025