Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

In-computer audio input #31

Open
cepsong opened this issue Jul 2, 2024 · 7 comments
Open

In-computer audio input #31

cepsong opened this issue Jul 2, 2024 · 7 comments

Comments

@cepsong
Copy link

cepsong commented Jul 2, 2024

Is there any way to take audio input from inside the computer (any application), maybe via a virtual audio driver. For now, it can only take in default microphone input. Thanks. @sveinbjornt @MrYakobo @adisidev

@sveinbjornt
Copy link
Owner

You should be able to change your default audio input source in System Settings to a virtual audio driver and everything should work. The code in hear just uses the default input source.

@cepsong
Copy link
Author

cepsong commented Jul 6, 2024

You should be able to change your default audio input source in System Settings to a virtual audio driver and everything should work. The code in hear just uses the default input source.

Thank you @sveinbjornt. I am looking at a use case where I need to use my default external microphone to take in my voice, and the virtual audio driver to transcribe what I hear (from a video or other speakers in a virtual meeting) but not my own speech. So can this be modified to take in non-default (any) virtual audio driver? Thanks so much.

@sveinbjornt
Copy link
Owner

Why not do this post hoc? Does this need to be (near) live? If you have both audio channels written to disk there is no problem with post hoc processing using input files.

@cepsong
Copy link
Author

cepsong commented Jul 7, 2024

Why not do this post hoc? Does this need to be (near) live? If you have both audio channels written to disk there is no problem with post hoc processing using input files.

Thank you @sveinbjornt. Yes it has to be live as we are developing real-time ASR of a meeting while the person using it is speaking at the same time in a different language to another program or channel. So the virtual audio driver needs to send the output as input to "hear", while the default microphone is used by the user as input for another program or channel. Can this be done? Thanks.

@sveinbjornt
Copy link
Owner

Should be simple enough to modify the hear source code to do what you want. Maybe I'll implement a flag to specify audio input device at a future date. Keeping this open.

@sveinbjornt sveinbjornt reopened this Jul 9, 2024
@cepsong
Copy link
Author

cepsong commented Jul 9, 2024

Should be simple enough to modify the hear source code to do what you want. Maybe I'll implement a flag to specify audio input device at a future date. Keeping this open.

That is highly appreciated @sveinbjornt. Just a side track, would it be possible to (modify and) extend this to iOS since the services are all provided by Apple?

@sveinbjornt
Copy link
Owner

sveinbjornt commented Jul 9, 2024

Well, hear uess a macOS API (which is probably also available on iOS) but people don't run command line programs on iOS so that's probably a no-go.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants