improve backbone extraction #24

jboynyc · 2020-10-22T13:37:46Z

It would be nice to move away from Serrano et al. to something more robust in a future release.

Possibly relevant literature: 1, 2

jboynyc · 2020-11-10T13:03:55Z

I will focus on implementing Liebig & Rao (2016).

jboynyc · 2021-10-11T14:43:04Z

BradKML · 2021-10-27T09:16:21Z

Can one say that a "backbone" in a network operates as a "keyword" in a document-term graph? Or maybe a "top person" in a document-author or author-term graph?

jboynyc · 2021-10-27T10:32:02Z

No, sorry, by backbone extraction I mean the process of eliminating edges from a graph to find relevant connections. Currently I use a filtering technique that doesn't take the bipartite structure of the initial network into consideration.

BradKML · 2021-10-27T12:03:40Z

@jboynyc Apologies, but connections on which context?

BradKML · 2021-10-28T04:04:11Z

Backbone seemed be dependent on scale, weightedness and directedness

jboynyc · 2021-10-28T12:17:46Z

Sorry, in other words it's about finding significant edges and discarding insignificant ones. (connections = edges)

BradKML · 2021-10-28T12:29:00Z

Define "significant". Would it see edges that form rings to be less significant (optimizing for spanning trees)? Would methods based on weighted edges weight stronger edges better? Would it want disconnected edges?

If this is hard to describe, would this extraction method apply to topic, terms, or author graphs?

jboynyc · 2021-10-28T12:39:19Z

The definition of "significance" differs by technique. Usually there's a comparison to a null model, with different techniques using different null models.

My question on this issue is specifically about techniques that use information about the bipartite network to aid backbone extraction of projections. Liebig & Rao and this paper outline some techniques, but so far I haven't found any usable implementations.

jboynyc · 2022-11-02T08:57:19Z

Here is an implementation of the bipartite configuration model: https://github.com/mat701/BiCM

I hesitate to add a dependency to this package until I get a chance to study it more closely.

How does it perform?
Is it a problem that the BiCM does not consider edge weights?
How well maintained is it? It has some recent commits, but doesn't seem to have been tested beyond Python 3.8.
Dependencies are mostly overlapping with current dependencies, but it would pull in numba as an additional dependency. If I remove the disparity filter, I would no longer have to depend on cython, so this could be zero sum.

jboynyc · 2023-03-31T11:56:37Z

Another relevant citation supporting my impression that using the disparity filter on projected one-mode networks is not a great idea: https://doi.org/10.1038/s42005-022-00856-9

jboynyc added the enhancement label Oct 22, 2020

jboynyc self-assigned this Oct 22, 2020

jboynyc mentioned this issue Oct 27, 2021

implement more measures to aid interpretation #23

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve backbone extraction #24

improve backbone extraction #24

jboynyc commented Oct 22, 2020

jboynyc commented Nov 10, 2020

jboynyc commented Oct 11, 2021

BradKML commented Oct 27, 2021

jboynyc commented Oct 27, 2021

BradKML commented Oct 27, 2021

BradKML commented Oct 28, 2021

jboynyc commented Oct 28, 2021

BradKML commented Oct 28, 2021

jboynyc commented Oct 28, 2021

jboynyc commented Nov 2, 2022

jboynyc commented Mar 31, 2023

improve backbone extraction #24

improve backbone extraction #24

Comments

jboynyc commented Oct 22, 2020

jboynyc commented Nov 10, 2020

jboynyc commented Oct 11, 2021

BradKML commented Oct 27, 2021

jboynyc commented Oct 27, 2021

BradKML commented Oct 27, 2021

BradKML commented Oct 28, 2021

jboynyc commented Oct 28, 2021

BradKML commented Oct 28, 2021

jboynyc commented Oct 28, 2021

jboynyc commented Nov 2, 2022

jboynyc commented Mar 31, 2023