BibTexNanny

BibTexNanny is a tool to check the consistency of BibTex files, fix common mistakes and generate simplified versions of a bibliography.

BibTex Parser

BibTexNanny uses biblib to parse and generate BibTex files.

The following fixes and changes should be made to biblib:

Add BibDesk-compatibility mode for BibTex output
Fix issues with loading bad month information
- Can't replicate issue anymore, not sure what changed.
Add ability to handle duplicate keys
Prevent BibTex Parser from dropping metadata and comment lines
- BibTexNanny internal work-around
When names are parsed, curly braces need to be handled correctly

BibTex file consistency checker

BibTex Fixer

BibTex simplifier

Auxiliary

Dictionary of conference names

Allow full name, name variation, short name
Names should allow for number placeholder
How to link regularly named conferences with years where they were held in conjunction with something?
Additional script to suggest possible name variations

Key formatting

There might already be an open source system for standardising BibTex keys. This is also used by Zotero. Gotra check that out.

Relevant factors for key formatting

Common formats

lastnameYEAR
LastnameYEAR
LastnameYEARkeyword
LastnameYEARdisambig
lastname_keyword_year
TITLEWORD
LastnameYEAR or KEYWORD

How to choose format

Number of hardcoded options
- Easy to implement, little flexibility
RegEx
- Easy to implement, flexible, but limited functionality (can't check other fields)
- Actually, if you use named groups, you could use those names to trigger additional checks for them.
Custom format
- Lots of work to implement, full functionality, probably quite flexible

Field Inference

BibTexNanny Input Parameters

Input methods

Use Python's configparser, which allows INI-like config files

Internal processing

~~Dict~~
- Straightforward, but need to keep the key strings straight
Custom object with lots of boolean fields
- More design effort, but probably more flexible
- Should have different class for each Nanny component
  - As the tasks overlap considerably, there should be a NannyConfig superclass and inherriting classes for the components.
  - Accessing config info should be done via functions, not fields, to allow custom processing of the stored information

Required states for custom variables

Consistency checker

True (check value)
False (don't check value)

Fixer

True/Autofix/Auto (autofix value)
Tryfix/Try (autofix if trivial, otherwise prompt to fix)
Promptfix/Prompt (Prompt to fix)
False (don't check value)

Consistency + Fixer

How information for both scripts can be given in the same config file

Single value for both (Try and Prompt are treated as True)
~~Tuple: False,Tryfix (CONSISTENCY,FIXER)~~
Variables for only one of the two configs, e.g. duplicateKeys-consistency
Different sections for giving instructions for both or just either

Simplifier

Should have separate config files.

Blacklist: List fields that should be removed
Whitelist List only the fields that are wanted
Variables for conversion functions

============================================================

Interface

Good way to set parameters?

Argument calls
- set list of wanted fields (if None, all are wanted)
- Set list of unwanted fields (optional)
Config files
- allows for templates
- More complex to set up
Prompts during processing, asking for user decisions
- Could also be used to auto-generate config files

External information files

LaTeX style files

.bst: BibTex format file (difficult to parse)
.sty: LaTeX style file (can this contain the bst info?)
.cls: LaTeX class file (can this contain the bst info?)

LaTeX temp files

.aux: Lists citations and labels
- Single line to parse: \citation{citationlabel}

BibTexNanny files

Dictionary of conference names
Style config file
Tool config files
- Consistency checker config file
- Fixer config file
- Simplifier config file

BibTex field requirements

We need to be able to check the following aspects for fields:

What type of entry are we looking at?
What are the generally required and optional fields for this entry?
- This bit can be hardcoded as it is always true for all BibTex files
- Look up BibTex documentation to determine these values
For a particular bibliography type, which are the required and optional fields, which fields are ignored?
- Easy solution: Manually create a config file that lists fields as mandatory, optional and ignored
  - Requires config file design
- Better solution: Load style files to automatically extract this kind of information.
  - Are there python tools that can load sty and cls files for us?
Design a config file that allows users to set which info they want to drop and which they need enforced
- List by entry type
  - Allow defining fields for more than one entry type at once
- Define fields as mandatory, optional, unused and maybe as hidden
Three layer approach:
1. In-built BibTex entry definitions
2. Config file for bibliography style requirements
3. Config file for simplification requirements

People working on related tools

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
aux		aux
config		config
doc		doc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
consistency.py		consistency.py
fixer.py		fixer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BibTexNanny

BibTex Parser

BibTex file consistency checker

BibTex Fixer

BibTex simplifier

Auxiliary

Dictionary of conference names

Key formatting

Relevant factors for key formatting

Common formats

How to choose format

Field Inference

BibTexNanny Input Parameters

Input methods

Internal processing

Required states for custom variables

Consistency checker

Fixer

Consistency + Fixer

Simplifier

============================================================

Interface

Good way to set parameters?

External information files

LaTeX style files

LaTeX temp files

BibTexNanny files

BibTex field requirements

People working on related tools

About

Releases

Packages

Languages

License

marcschulder/BibTexNanny

Folders and files

Latest commit

History

Repository files navigation

BibTexNanny

BibTex Parser

BibTex file consistency checker

BibTex Fixer

BibTex simplifier

Auxiliary

Dictionary of conference names

Key formatting

Relevant factors for key formatting

Common formats

How to choose format

Field Inference

BibTexNanny Input Parameters

Input methods

Internal processing

Required states for custom variables

Consistency checker

Fixer

Consistency + Fixer

Simplifier

============================================================

Interface

Good way to set parameters?

External information files

LaTeX style files

LaTeX temp files

BibTexNanny files

BibTex field requirements

People working on related tools

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages