styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする #551

Hiroshiba · 2023-07-27T23:21:59Z

内容

[vvm] styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする #548

の解決です。

manifest.jsonにマッピング情報を記載し、id_relationsに格納して値を取り回すようにしました。

InferenceCoreとStatusの役割がちょっとごちゃごちゃになっている印象を受けました。
VvmModelの情報をとりあえずStatusに持たせてみています。
ちょっとどういう設計にすればいいのかはパッと思いつけませんでした。

その他

crates/voicevox_core/src/voice_model.rs

Hiroshiba · 2023-07-27T23:24:27Z

crates/voicevox_core/src/status.rs

    pub fn predict_duration_session_run(
        &self,
-        style_id: StyleId,
+        model_id: &VoiceModelId,
        inputs: Vec<&mut dyn AnyArray>,
    ) -> Result<Vec<f32>> {


ここはスタイルIDではなくモデルIDを渡す設計の方が正しそうだったのでそうしました

Hiroshiba · 2023-07-27T23:25:07Z

crates/voicevox_core/src/result_code.rs

-        VOICEVOX_RESULT_INVALID_MODEL_INDEX_ERROR => "無効なmodel_indexです\0",
+        VOICEVOX_RESULT_INVALID_MODEL_ID_ERROR => "無効なmodel_indexです\0",


Model IndexではなくModel Idになったけど、このエラーメッセージが1回も使われてなかったのでスルーされてました

crates/voicevox_core/src/voice_model.rs

qryxip · 2023-07-28T16:17:25Z

今CIが落ちていますが、直しかたは次のようになると思います。

test-workflow / rust-integration-test

crates/voicevox_core_c_api/tests/e2e/snapshots.tomlの、compatible_engine.metasの更新
test workflow / c-header
```
❯ cargo xtask update-c-header
```

crates/voicevox_core/src/voice_model.rs

crates/voicevox_core/src/manifest.rs

crates/voicevox_core/src/voice_model.rs

Co-authored-by: Ryo Yamashita <[email protected]>

…うにする

Hiroshiba · 2023-08-02T06:15:48Z

mainに追従して、エラーメッセージがおかしかったのと、ドキュメント的にこっちのがちょっといいかなと思って修正しました e7896bb

qryxip

一点だけありますが、LGTMです!

qryxip · 2023-08-02T15:59:39Z

crates/voicevox_core/src/manifest.rs

 #[derive(Deserialize, Getters, Clone)]
 pub struct Manifest {
    manifest_version: ManifestVersion,
    metas_filename: String,
    decode_filename: String,
    predict_duration_filename: String,
    predict_intonation_filename: String,
+    #[serde(default)]
+    style_id_to_model_inner_id: BTreeMap<StyleId, ModelInnerId>,


今言うのもなんですが、"model_inner_id"という名前に欠点があるとすれば、声(voice)を指すことが少しだけわかりずらくなっているというのがあるかなと思いました。

"speaker_id"という表現はもうcompatible_engine以外で使っていないので、"true_speaker_id"とかにするというのもアリなんじゃないかと思いました。

確かに言われておっしゃる通りなかなか分かりにくくなっているなと思いました。

大事なのはmodel_innerの部分だと思っていて、例えば同じ声を作れるものが、別のモデルでは別のIDになっていることもなくはない感じです。
speakerは声なのか話者なのか一意に定まらないので、やるとしたらmodel_inner_voice_idあたりなのかな～と思いました。

とりあえずこの値はここでしか使われていないから、一旦このままでも通じるのかなと思いました！
ただ分かりにくいのはおっしゃる通りだと思うので、いつか変更する場合はたぶん賛成できます。

ああ言われてみればモデルごとに違ってもいいんですね。
他にあるとすれば...local_voice_idとか...?

確かにmodel含めなくてもいいかもですね！　inner_voice_idとかもありかもです。

Hiroshiba · 2023-08-03T22:19:48Z

レビュー本当にありがとうございます！！！
マージします！！

styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする

2fc2ad9

Hiroshiba commented Jul 27, 2023

View reviewed changes

This was referenced Jul 28, 2023

mutabilityとasyncnessを仕上げる #552

Closed

mutabilityとasyncnessを仕上げる #553

Merged

Hiroshiba added 3 commits July 29, 2023 07:29

zipエラーが出るように

b7072c5

update metas

88ac130

.into()

c13debb

qryxip reviewed Jul 30, 2023

View reviewed changes

crates/voicevox_core/src/voice_model.rs Show resolved Hide resolved

crates/voicevox_core/src/voice_model.rs Outdated Show resolved Hide resolved

crates/voicevox_core/src/manifest.rs Outdated Show resolved Hide resolved

Hiroshiba added 3 commits July 31, 2023 06:07

manifest.rsに移動

6f67812

VOICEVOX#551 (comment) の解決

73756c0

remove unused

5d8e33f

qryxip reviewed Jul 31, 2023

View reviewed changes

crates/voicevox_core/src/voice_model.rs Outdated Show resolved Hide resolved

model_inner_id_for

85dcd1b

qryxip reviewed Aug 1, 2023

View reviewed changes

crates/voicevox_core/src/voice_model.rs Outdated Show resolved Hide resolved

Hiroshiba and others added 3 commits August 2, 2023 15:08

Update crates/voicevox_core/src/voice_model.rs

000e9f1

Co-authored-by: Ryo Yamashita <[email protected]>

Merge branch 'main' into styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるよ…

c765ba2

…うにする

エラーメッセージとドキュメントがおかしかったので修正

e7896bb

Hiroshiba added 4 commits August 2, 2023 15:17

cargo xtask update-c-header

d731bbc

0.15.0-hiroshibavvmmap.0

1704b73

実行確認できたので0.15.0-preview.1へ

41e8b5b

VOICEVOX_FAT_RESOURCE_VERSIONも変えないとなので、とりあえず戻す

e8383e0

qryxip approved these changes Aug 2, 2023

View reviewed changes

Hiroshiba merged commit e0d32a5 into VOICEVOX:main Aug 3, 2023
31 checks passed

Hiroshiba deleted the styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする branch August 3, 2023 22:19

qryxip mentioned this pull request Aug 9, 2023

RustのdoctestをCI #573

Merged

qryxip mentioned this pull request Aug 23, 2023

VVMのマニフェストの形式の再考 #581

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする #551

styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする #551

Hiroshiba commented Jul 27, 2023

Hiroshiba Jul 27, 2023

Hiroshiba Jul 27, 2023 •

edited

Loading

qryxip commented Jul 28, 2023

Hiroshiba commented Aug 2, 2023

qryxip left a comment

qryxip Aug 2, 2023 •

edited

Loading

Hiroshiba Aug 3, 2023

qryxip Aug 4, 2023

Hiroshiba Aug 5, 2023

Hiroshiba commented Aug 3, 2023

		VOICEVOX_RESULT_INVALID_MODEL_INDEX_ERROR => "無効なmodel_indexです\0",
		VOICEVOX_RESULT_INVALID_MODEL_ID_ERROR => "無効なmodel_indexです\0",

styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする #551

styleIdとsession.runに渡す数値が異なっているVVMでも音声合成できるようにする #551

Conversation

Hiroshiba commented Jul 27, 2023

内容

関連 Issue

その他

Hiroshiba Jul 27, 2023

Choose a reason for hiding this comment

Hiroshiba Jul 27, 2023 • edited Loading

Choose a reason for hiding this comment

qryxip commented Jul 28, 2023

Hiroshiba commented Aug 2, 2023

qryxip left a comment

Choose a reason for hiding this comment

qryxip Aug 2, 2023 • edited Loading

Choose a reason for hiding this comment

Hiroshiba Aug 3, 2023

Choose a reason for hiding this comment

qryxip Aug 4, 2023

Choose a reason for hiding this comment

Hiroshiba Aug 5, 2023

Choose a reason for hiding this comment

Hiroshiba commented Aug 3, 2023

Hiroshiba Jul 27, 2023 •

edited

Loading

qryxip Aug 2, 2023 •

edited

Loading