conditioning_not_working#9

Draft

johannahom wants to merge 7 commits intomainfrom

Collaborator

johannahom commented Mar 4, 2022

Still need to fix single conditions


          Changed format of input file to csv with headers. See new input files…

07cae0f

… as examples.

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/common/utils.py Outdated

                       parts = line.strip().split(split)
-                      if has_speakers:
+                      #need to add option for all conditions
+                      if has_speakers or has_conditions: #this might be wrong @Johannah

Owner

evdv May 2, 2022 •

edited

Loading

True or True = True
is this to say, the last item is either the speaker, or the condition?
given they're both booleans you could do:
has_speakers is not has_conditions
or put has_speakers and has_conditions first, so that this case is already handled and therefore excluded from has_speakers or has_conditions

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/data_function.py

                       if len(self.audiopaths_and_text[0]) < expected_columns:
                           raise ValueError(f'Expected {expected_columns} columns in audiopaths file. '
-                                           'The format is <mel_or_wav>|[<pitch>|]<text>[|<speaker_id>]')
+                                           'The format is <mel_or_wav>|[<pitch>|]<text>[|<speaker_id>|<condition_id>]')

Owner

evdv May 2, 2022

oh, I guess we're checking it here?

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/data_function.py

                                symbol_set='english_basic',
                                p_arpabet=1.0,
                                n_speakers=1,
+                               n_conditions=1,

Owner

evdv May 2, 2022

default is 1 but to have conditions there should be more than 1? If 1 is a way of saying there are no conditions, why not 0?:

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/data_function.py Outdated

                   def __getitem__(self, index):
                       # Separate filename and text
-                      if self.n_speakers > 1:
+                      if self.n_speakers > 1 and self.n_conditions < 1:

Owner

evdv May 2, 2022

this block is a little cumbersome but I really appreciate how legible it is appreciation comment

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/data_function.py Outdated

-                      if self.n_speakers > 1:
+                      #specifying the fields
+                      if self.n_speakers > 1 and self.n_conditions < 1:

Owner

evdv May 2, 2022

what? my brain is tired and can't exactly figure out what's going on here

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/data_function.py


		audiopaths = [batch[i][7] for i in ids_sorted_decreasing]

		if batch[0][8] is not None:

Owner

evdv May 2, 2022

I imagine this is the bit that would need updating once the other code is merged?

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/model.py

                       (inputs, input_lens, mel_tgt, mel_lens, pitch_dense, energy_dense,
-                       speaker, attn_prior, audiopaths) = inputs
+                       speaker, attn_prior, audiopaths, condition) = inputs

Owner

evdv May 2, 2022

as a side-note I am wondring if we should make inputs/outputs some enum or datatype that also doesn't rely on indices to get different things out

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/model.py

                       # Predict pitch
-                      pitch_pred = self.pitch_predictor(enc_out, enc_mask).permute(0, 2, 1)
+                      pitch_pred = self.pitch_predictor(enc_out, enc_mask).permute(0, 2, 1) #maybe we want to condition pitch prediction on the conditioning parameter.

Owner

evdv May 2, 2022

cool idea, we should make a ticket for this

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/transformer.py

                           )
-                  def forward(self, dec_inp, seq_lens=None, conditioning=0):
+                  def forward(self, dec_inp, seq_lens=None, conditioning=0, conditioning_2=0): #here when called we add speaker or other discrete condition

Owner

evdv May 2, 2022

you could make condition a tuple, or rename the conditionings to conditioning_speaker conditioning_other

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/train.py

                       prepare_tmp(args.pitch_online_dir)
-                  trainset = TTSDataset(audiopaths_and_text=args.training_files, **vars(args))
+                  trainset = TTSDataset(audiopaths_and_text=args.training_files, **vars(args)) #making changes here ./fastpitch/data_function.py

Owner

evdv May 2, 2022

I think this comment can be deleted?

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/inference.py

                   gen_kw = {'pace': args.pace,
                             'speaker': args.speaker,
+                            'condition': args.condition, #@Johannah have to add condition here

Owner

evdv May 2, 2022

I think this comment can be deleted?

evdv reviewed

View reviewed changes

PyTorch/SpeechSynthesis/FastPitch/fastpitch/model.py

                   def infer(self, inputs, pace=1.0, dur_tgt=None, pitch_tgt=None,
                             energy_tgt=None, pitch_transform=None, max_duration=75,
-                            speaker=0):
+                            speaker=0, condition=0):

Owner

evdv May 2, 2022

so because this is the condition index, the default is 0 (despite no condition being n_conditions = 1 and so far there only being able to be 1 condition?)
Just making sure I understand, once again my brain is melting

Owner

evdv commented May 2, 2022

@johannahom my apologies for making all of these separarately, but mostly tomorrow I need to train a model from this branch to see how it goes

johannahom added 3 commits

May 10, 2022 16:48


          This is a test

89fbe48


          Working conditioning

40a93c2


          Working conditioning

878115a

evdv force-pushed the conditioning branch from 9a33997 to 878115a Compare

May 12, 2022 18:59

evdv added 3 commits

May 16, 2022 15:23


          Fix print statement

5e54a81


          Add LJ files with random condition

67097fc


          Cleanup commit from dudley

2d0a53f

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet