Uncloudy voice assistant: Difference between revisions

Latest revision as of 14:18, 21 March 2023

How to access automatic subtitling in an interview, conferencing, gathering, conversation in a counter cloud way?

For example in the case of a counter cloud video conferencing like BBB and what happens when somebody complains about the non-seamlessness.

Some thoughts on machine listening:

The process of machine listening has been supported and developed technically through legal, state and military cases and developments. Clouds and voice databases are also rooted in this.

We can unpack the processes of speech recognition in interviewing, transcribing, listening, training, listening to patterns, predicting and follow hybrid ways of automated subtitling, that include computing and not.

Some algorithms

Speech recognition tools

from pocketsphinx import LiveSpeech
for phrase in LiveSpeech(): 
   # how to pause the mic occasionally so the live speech print appears often
   # check timer thread
   print(phrase)

Gentle

Poetic experiments

Gossip Booth: recognise words and keep only the ones with no meaning, like vocal expressions.
Transcribe only phonemes with coqui STT and CMU Sphinx
Radioactive Monstrosities: voice upload and vocal transformation on a browser with the use of Web Audio API
Remnants of future voices: cloning voice scripts with an interface
WordMord Para-dictionary: replace words of a text/transcription with a collective made dictionary
Pincelate: Nonsense verse (and other creative uses of spelling) with Python and machine learning by Allison Parrish

Older threads:

https://pzwiki.wdka.nl/mediadesign/User:Angeliki/Ttssr-Speech_Recognition_Iterations
classes with Amy
workshop at Leipzig
scripts with WordMord
workshop and references with Jon
Radioactive

references:

https://github.com/lowerquality/gentle/pulse
Meeting/Presentation/discussion with Xiaochang Li in De Krook, Gent. Notes documentation: https://mensuel.framapad.org/p/algolit_documentation_190524 , note Xiaochang Li :https://mensuel.framapad.org/p/algolit_xiaochang_li_190524 https://pubmed.ncbi.nlm.nih.gov/31231075/

@@ Line 14: / Line 14: @@
 ==== Speech recognition tools ====
-* Vosk: https://alphacephei.com/vosk/install
+* [https://alphacephei.com/vosk/install Vosk]
-* otter.ai: https://otter.ai/
+* [https://otter.ai/ otter.ai]
-* Speech_recognition with Pocketsphinx: https://gitlab.com/nglk/speaking_with_the_machine/-/blob/master/ttssr2_transcribe.py
+* [https://gitlab.com/nglk/speaking_with_the_machine/-/blob/master/ttssr2_transcribe.py Speech_recognition with Pocketsphinx]
-* Livespeech with Pocketsphinx: https://pypi.org/project/pocketsphinx/
+* [https://pypi.org/project/pocketsphinx/ Livespeech with Pocketsphinx]
   from pocketsphinx import LiveSpeech
   for phrase in LiveSpeech():
@@ Line 23: / Line 23: @@
      # check timer thread
      print(phrase)
-* Gentle: https://github.com/lowerquality/gentle/pulse
+* [https://github.com/lowerquality/gentle/pulse Gentle]
-==== Experiments ====
+==== Poetic experiments ====
 * [https://gitlab.com/nglk/gossip-gaps/-/blob/master/gossip-gaps.py Gossip Booth]: recognise words and keep only the ones with no meaning, like vocal expressions.
 * Transcribe only phonemes with coqui STT and CMU Sphinx
 * [https://gitlab.com/nglk/radioactive Radioactive Monstrosities]: voice upload and vocal transformation on a browser with the use of Web Audio API
 * [https://github.com/jreus/chorusworkshop Remnants of future voices]: cloning voice scripts with an interface
-* WordMord Para-dictionary:
+* [https://gitlab.com/nglk/para-dictionary-wordmord WordMord Para-dictionary]: replace words of a text/transcription with a collective made dictionary
+* [https://github.com/aparrish/nonsense-verse-pycon-2020 Pincelate]: Nonsense verse (and other creative uses of spelling) with Python and machine learning by Allison Parrish

Uncloudy voice assistant: Difference between revisions

Latest revision as of 14:18, 21 March 2023

Contents

How to access automatic subtitling in an interview, conferencing, gathering, conversation in a counter cloud way?

Some algorithms

Speech recognition tools

Poetic experiments

Navigation menu

Uncloudy voice assistant: Difference between revisions

Latest revision as of 14:18, 21 March 2023

How to access automatic subtitling in an interview, conferencing, gathering, conversation in a counter cloud way?

Some algorithms

Speech recognition tools

Poetic experiments

Navigation menu

Search