Feature/voice analysis #30

Nathanlauga · 2022-12-23T09:36:24Z

No description provided.

… system and heavy file constraints

…riments

Add comments

…dencies

TheoLvs

Hello,

Pas mal de choses à revoir dans cette PR @Lokhia @DnzzL avant de la merge sur main. Le plus important c'est qu'elle ne respecte pas pour le moment les conventions du repo c'est à dire en philosophie librairie avec des modules interopérables. Il faut qu'on puisse facilement pouvoir faire
``from bechdelai.audio import ...```
Sinon ça va être impossible à déployer et à rendre interopérable avec le reste.
Les autres commentaires sont dans les fichiers.

Aussi on va rajouter une règle, il faut qu'on ait un notebook qui explique les fonctionnalités dans le dossier racine tutorials avant d'être validé sur main (parce qu'à terme c'est ce qui partira en prod sur PyPi).

En attendant sur la partie audio pour la démo restons sur la branche feature/voice_analysis le temps de le faire évoluer proprement :)

bechdelai/audio/main.py

bechdelai/audio/dependencies.py

bechdelai/audio/OLD/speech_transcriber.py

bechdelai/audio/audio_processor.py

bechdelai/audio/dialogue_tagger.py

pyproject.toml

TheoLvs · 2023-02-25T11:10:50Z

pyproject.toml

@@ -1,12 +1,12 @@
 [tool.poetry]
 name = "bechdelai"
-version = "0.0.1-alpha.2"
+version = "0.1.0"


Restons sur une version 0.0.2 pour le prochain bump

pyproject.toml

TheoLvs · 2023-02-25T11:11:37Z

pyproject.toml

-openpyxl = "^3.0.10"
-pytube = "^12.1.0"
-mediapipe = "^0.9.0"
-scenedetect = {extras = ["opencv"], version = "^0.6.1"}


Il faut garder ces versions dans le merge

pyproject.toml

bechdelai/audio/OLD/gender_identifier.py

bechdelai/audio/OLD/speech_transcriber.py

DnzzL · 2023-04-07T13:55:41Z

bechdelai/audio/gender_segmenter.py

+                            columns=['gender', 'start', 'end'])
+
+
+    def _convert_whisper_output(self,segments:pd.DataFrame) -> list:


je trouve le nom de la fonction pas hyper clair

DnzzL · 2023-04-07T13:56:39Z

bechdelai/audio/transcriber.py

+            print("Could not request results from Google Speech Recognition service; {0}".format(e))
+
+
+class WhisperAPI(Transcriber):


j'aimerais bien de la docstring pour clarifier l'usage entre WhisperAPI et Whisper

DnzzL · 2023-04-07T13:56:57Z

bechdelai/audio/utils.py

+from moviepy.video.io.ffmpeg_tools import ffmpeg_extract_subclip
+
+
+def cut_and_save(movie_path: str, start: float, end: float, target_name: str) -> None:


DnzzL · 2023-04-07T13:58:23Z

notebooks/audio/OLD/.env.example

+path_to_extract=<path_to_extract_file>
+path_to_audio=<path_to_audio_file>
+path_to_full_movie=<path_to_full_movie_file>
+path_to_trailer=<path_to_trailer_file>


OLD à supprimer ?

DnzzL · 2023-04-07T13:58:35Z

notebooks/audio/OLD/README.md

+#Readme pour l'audio
+## Installation
+* .env
+  * créer un fichier ".env" en local pour y placer le chemin vers la vidéo, comme dans .env.example
+* poetry update / poetry install
+* poetry run python .\gender_identification.py
+* ffmpeg codex (pour Windows, suivre les instructions [ici](https://www.geeksforgeeks.org/how-to-install-ffmpeg-on-windows/) -
+pas besoin de mettre à la racine en admin et de redémarrer ;
+pour ubuntu `$ sudo apt-get install ffmpeg`, voir la [doc du projet](https://github.com/ina-foss/inaSpeechSegmenter))


Lokhia added 6 commits February 1, 2023 16:52

Adapt artpech's imports for ina speech segmenter to python and poetry…

5ec016a

… system and heavy file constraints

Gendered audio segmentation based on inaSpeechSegmenter and some expe…

b0d00fd

…riments

Speaking time according to gender implemented

db5f243

Speech to text implementation

eac60f9

Extract to csv now available

1429e72

Doing full movie pipeline

f9ffc8e

DnzzL force-pushed the feature/voice_analysis branch from 6177b88 to f9ffc8e Compare February 1, 2023 15:53

DnzzL force-pushed the feature/voice_analysis branch from dd79922 to ecf17c6 Compare February 13, 2023 13:12

DnzzL added 2 commits February 13, 2023 14:14

Add Whisper automatic speak recognition

bfb61a1

Extract from notebook

fe3f487

Add comments

DnzzL force-pushed the feature/voice_analysis branch from ecf17c6 to fe3f487 Compare February 13, 2023 13:14

DnzzL and others added 8 commits February 15, 2023 12:02

[FIX] Output length + types

29a6e08

Small renaming and adding docstrings

ff0f003

Poetry update and some audio tests

63d235d

Refactoring audio processing package - transcribers gestion

35043f4

Refactoring audio processing - gender segmenter gestion

3b95203

Refactoring audio processing - dialogue tagger gestion

e6f0b5f

Refactoring audio processing - Audio Processor, main and poetry depen…

9da1078

…dencies

Merge remote-tracking branch 'origin/main' into feature/voice_analysis

7dee943

Lokhia marked this pull request as ready for review February 23, 2023 18:52

Lokhia requested a review from TheoLvs February 23, 2023 18:52

TheoLvs requested changes Feb 25, 2023

View reviewed changes

Lokhia and others added 8 commits February 26, 2023 12:45

Archive previous audio notebook work

4ca2cd4

Update an properly merge pyproject from main

907cdfe

Minor changes in speech to text to make it work with default API key :)

493e531

Updating poetry lock and toml

70d32bb

Transform all audio code to functionable library with tutorial

1cfab8f

Include Us English profile and tutorial

c620f9a

Added whisper API in the pipeline

af4dfd0

Updated demo

73e5269

DnzzL requested changes Apr 7, 2023

View reviewed changes

Remove deprecated code

2bda1d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/voice analysis #30

Feature/voice analysis #30

Nathanlauga commented Dec 23, 2022

TheoLvs left a comment •

edited

Loading

TheoLvs Feb 25, 2023

TheoLvs Feb 25, 2023

Lokhia Feb 25, 2023

DnzzL Apr 7, 2023

DnzzL Apr 7, 2023

DnzzL Apr 7, 2023

DnzzL Apr 7, 2023

DnzzL Apr 7, 2023

		columns=['gender', 'start', 'end'])


		def _convert_whisper_output(self,segments:pd.DataFrame) -> list:

		print("Could not request results from Google Speech Recognition service; {0}".format(e))


		class WhisperAPI(Transcriber):

		from moviepy.video.io.ffmpeg_tools import ffmpeg_extract_subclip


		def cut_and_save(movie_path: str, start: float, end: float, target_name: str) -> None:

Feature/voice analysis #30

Are you sure you want to change the base?

Feature/voice analysis #30

Conversation

Nathanlauga commented Dec 23, 2022

TheoLvs left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TheoLvs left a comment •

edited

Loading