Skip to content
This repository has been archived by the owner on Jul 22, 2024. It is now read-only.

Commit

Permalink
prepare for 0.2.0 release
Browse files Browse the repository at this point in the history
  • Loading branch information
UtaUtaUtau committed Jan 2, 2023
1 parent 211a2a1 commit 5fbad67
Show file tree
Hide file tree
Showing 3 changed files with 12 additions and 8 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,5 @@ dist/
build/
test/
*.ico
*.bat
straycat.spec
12 changes: 6 additions & 6 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Changelog

## [Unreleased] - 2022-31-12
## [0.2.0] - 2023-01-02

## Added
- Added the g flag.
Expand All @@ -10,24 +10,24 @@
- Changed peak normalization volume to make renders less loud and make early clipping for the A flag happen less.
- Sped up functions using `numba`. Might've helped feature generation and some flags. The functions sped up and their usages are:
- `smoothstep(edge0, edge1, x)`
- Used for flags with "envelopes" (vocal fry flag, voicing flag)
- Used for flags with "envelopes" (e.g. vocal fry flag, voicing flag)
- `clip(x, x_min, x_max)`
- Used generally for clipping things within range. `numpy.clip` exists but for some reason numba made it faster.
- `base_frq(f0, f0_min=None, f0_max=None)`
- Used to get the base frequency from the frequency information. Feature generation might be a little faster because of this.
- Reworked timing system to be more robust.

## [0.1.3] - 2022-27-12
## [0.1.3] - 2022-12-27

## Changed
- Changed P flag default to 86. Makes it so that the A flag doesn't clip as much. Still does though.

## [0.1.2] - 2022-26-12
## [0.1.2] - 2022-12-26

## Fixed
- Skipping rendering if out file is `nul` to not pull up an error when generating `.sc.npz` files through the short frq generation commant UTAU uses. It still generates the `.sc.npz` but I just don't like how unclean it is...

## [0.1.1] - 2022-24-12
## [0.1.1] - 2022-12-24

### Added
- This changelog!
Expand All @@ -36,7 +36,7 @@
### Changed
- Changed flag order for A and P flag. Peak normalization is applied before tremolo now. May cause clipping issues.

## [0.1.0] - 2022-23-12
## [0.1.0] - 2022-12-23

### Added
- First release.
7 changes: 5 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@ optional arguments:

# Example Renders

The renders use straycat dev version. No flags are used except for renders with extreme volume changes, in which the P0 flag is set.
The renders use straycat dev version. No flags are used except for renders with extreme volume changes, in which the P0 flag is set.

**Voicebank**: 櫻花アリス -吾亦紅- / Ouka Alice -Waremokou-

Expand Down Expand Up @@ -94,6 +94,9 @@ https://user-images.githubusercontent.com/29729824/210136998-a0e9fde8-3445-42f4-

**vo(-inf, +inf)** is the offset of the transition from the consonant in milliseconds. Negative values move it earlier. Default is 0.

### g(-inf, +inf)
Gender/Formant shift flag. Shifts the formant of the render, more commonly known as adding gender. 10 units in this flag is equivalent to pitching the sample a semitone without formant preservation and pitching it back with formant preservation.

### B[0, 100]
Breathiness flag. Values lower than 50 lowers breathiness, but it does not have much effect. Values higher than 50 mixes an unvoiced render in, with 100 as being only the unvoiced render. Default is 50.

Expand All @@ -114,7 +117,7 @@ https://user-images.githubusercontent.com/29729824/210136998-a0e9fde8-3445-42f4-
### G
Force feature rerendering. This rerenders the cached file straycat reads which is the `.sc.npz` file. It is a regular Numpy compressed array file.

# Remarks
# Remarks
This resampler is very slow considering it's written in pure Python. It would actually be pretty fast if it wasn't for Python's packages basically having so much stuff that it makes load times way slower. Python's nature of being an interpreted language might also be a big bottleneck, but Python itself has been considerably fast for me. This is just one of those cases... I could technically speed it up by doing an UTAU specific hack, but it might not work with OpenUtau anymore after this.

I don't want to beat myself down that much for this but the slow speed very much ensures complete compatibility to both OpenUtau and classic UTAU. I guess you could say this would be a resampler in the olden days of single-thread resampling. The new Ameya resamplers are fast because of multiprocessing, and the other WORLD-based resamplers are fast because of their compiled nature. This resampler will always underperform because of Python's interpreted nature. Python was made for scripting after all, as much as so many AI models rely heavily on it.

0 comments on commit 5fbad67

Please sign in to comment.