From 012d80ac6857fa12d0c647982ebf56aeff96ad86 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:14:58 +0800 Subject: [PATCH 01/17] add Schuller library link --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index 96b971d..d01bd3a 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis![](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 77745e671c6ab340c207b9ebdb567bb9470b169b Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:17:04 +0800 Subject: [PATCH 02/17] adjust styling schuller text reference --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index d01bd3a..b91af49 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis![](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - ![intelligent audio analysis](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 0a8f2a39a08424714e57c12b61f45e1b609ece94 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:19:11 +0800 Subject: [PATCH 03/17] adjust styling schuller text reference --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index b91af49..4514759 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - ![intelligent audio analysis](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![[2](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 18136eacadad2edde77d66c48679e5d793730b9a Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:20:03 +0800 Subject: [PATCH 04/17] adjust styling schuller text reference --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index 4514759..0e3a591 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![[2](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![[2]](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 38b034aa1e3505f39f1dd1ac8d5dfc02e40321b7 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:21:27 +0800 Subject: [PATCH 05/17] adjust styling schuller text reference --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index 0e3a591..dbdb673 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![[2]](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![ ](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From d77893ad6c581bc6a2477901a347e714901f99f0 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:21:56 +0800 Subject: [PATCH 06/17] adjust styling schuller text reference --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index dbdb673..82aac3f 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![ ](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 6756bee5dab7a218a5a45a101e9360ef6fa51be3 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:22:44 +0800 Subject: [PATCH 07/17] adjust styling schuller text reference --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index 82aac3f..fff301b 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis ![](bib:827d1f44-5a35-4278-a527-4df67e5ba321) - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From d4027d748972c4586576f150446d099b3f498219 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 15:42:59 +0800 Subject: [PATCH 08/17] add www links --- .../topic/against-the-coming-world-of-listening-machines.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index fff301b..64fb33b 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,11 +8,11 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection - scream detection - lie detection - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: -Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "amazon fresh first grocery story")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health - social distancing - music streaming - music education - composition - gaming - brand development - marketing - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance ... +Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "amazon fresh first grocery story")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition - gaming - brand development - marketing - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training[[1](https://github.com/project-spectra "Project Spectra: Vocal-gender training software for trans & gender non-conforming people")] As with all forms of machine learning, questions of efficacy, access, privacy, bias, fairness and transparency arise with every use case. But machine listening also demands to be treated as an epistemic and political system in its own right, that increasingly enables, shapes and constrains basic human possibilities, that is making our auditory worlds knowable in new ways, to new institutions, according to new logics, and is remaking (sonic) life in the process. From 875ed8d4cc219627e1b03880f51671242646a57d Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 16:07:21 +0800 Subject: [PATCH 09/17] add www links --- .../topic/against-the-coming-world-of-listening-machines.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index 64fb33b..66fb119 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,11 +8,11 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: -Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "amazon fresh first grocery story")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition - gaming - brand development - marketing - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training[[1](https://github.com/project-spectra "Project Spectra: Vocal-gender training software for trans & gender non-conforming people")] +Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing - acoustic ecology [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] As with all forms of machine learning, questions of efficacy, access, privacy, bias, fairness and transparency arise with every use case. But machine listening also demands to be treated as an epistemic and political system in its own right, that increasingly enables, shapes and constrains basic human possibilities, that is making our auditory worlds knowable in new ways, to new institutions, according to new logics, and is remaking (sonic) life in the process. From 409d742f8f9a39df87479a09996b71622acb262a Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 16:11:28 +0800 Subject: [PATCH 10/17] test youtube embed --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index 66fb119..ac30f9d 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition {{ youtube 86I3-VYIvAM }} - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 8be2054455575205b4ae3bcee22785d01e6f435a Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 16:16:30 +0800 Subject: [PATCH 11/17] test youtube embed --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index ac30f9d..e7bdd78 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition {{ youtube 86I3-VYIvAM }} - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From f1e57403725440db9c7c10fe55541f8d8d5e01bc Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 16:20:29 +0800 Subject: [PATCH 12/17] test youtube embed --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index e7bdd78..c364586 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [![1](https://img.youtube.com/vi/86I3-VYIvAM/0.jpg)](https://www.youtube.com/watch?v=86I3-VYIvAM) - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 3fdac685e786a4cc9827d06a3716da2cf83654e5 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 16:23:17 +0800 Subject: [PATCH 13/17] test youtube embed --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index c364586..c5d3693 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [![1](https://img.youtube.com/vi/86I3-VYIvAM/0.jpg)](https://www.youtube.com/watch?v=86I3-VYIvAM) - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [![](youtube:86I3-VYIvAM)] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From 08ca89e14b5ed2d7b496f80c7d8546036aad71be Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 16:26:26 +0800 Subject: [PATCH 14/17] add link out to youtube video without embed --- content/topic/against-the-coming-world-of-listening-machines.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index c5d3693..e7bdd78 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,7 +8,7 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [![](youtube:86I3-VYIvAM)] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: From fa8d3de5c5c2f89d2598c4adf0a166a1c1f291c2 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 19:10:51 +0800 Subject: [PATCH 15/17] add www links --- .../topic/against-the-coming-world-of-listening-machines.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index e7bdd78..cdd8704 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,11 +8,11 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: -Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing - acoustic ecology [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] +Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] As with all forms of machine learning, questions of efficacy, access, privacy, bias, fairness and transparency arise with every use case. But machine listening also demands to be treated as an epistemic and political system in its own right, that increasingly enables, shapes and constrains basic human possibilities, that is making our auditory worlds knowable in new ways, to new institutions, according to new logics, and is remaking (sonic) life in the process. From 274670f573a3bd2dfffa1d2e762a523d877fbf13 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 19:37:32 +0800 Subject: [PATCH 16/17] add www links --- .../topic/against-the-coming-world-of-listening-machines.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index cdd8704..b73044b 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,11 +8,11 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis - covid diagnosis - machine fault diagnosis - bird sound identification [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location [[1](https://www.trillbit.com/trillbit-home-page.html “Contactless ultrasonic authentication protocol”)] - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification [[1](https://www.youtube.com/watch?v=gJCVla9xYUs “Command Lines: Power, Affect and Identity in Networked Interactions”)] - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection [[1](https://www.audeering.com/what-we-do/automotive/ “Cars take care of their passengers”)] - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis [[1](http://www.canaryspeech.com/ “Using voice to identify human conditions sooner.”)] - covid diagnosis [[1](https://app.surveylex.com/surveys/5384d6d0-6499-11ea-bc3a-b32c3ca92036 ‘We are launching an initiative to collect your voices with a goal to be able to triage, screen and monitor COVID-19 virus.”)] - machine fault diagnosis - psychosis diagnosis [[1]](https://www.sciencedaily.com/releases/2019/06/190613104552.htm “The whisper of schizophrenia: Machine learning finds 'sound' words predict psychosis”)] - bird sound identification [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: -Digital voice assistants - voice user interfaces - state and corporate surveillance - profiling - border security - home security - pre-emptive policing - weapons systems - court systems - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] +Digital voice assistants - voice user interfaces - state and corporate surveillance [[1](https://paranoid.com/products “Paranoid Home. Data is forever. Get Paranoid.”)] - profiling - border security - home security - pre-emptive policing - weapons systems - court systems [[1](https://www.wired.com/story/star-witness-your-smart-speaker/ “Meet the Star Witness: Your Smart Speaker”)] - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring [[1](https://get.cherryhome.ai/care/ “Cherry Home”)] - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing [[1](https://www.veritonic.com/ “Veritonic The Sonic Truth”)] - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] As with all forms of machine learning, questions of efficacy, access, privacy, bias, fairness and transparency arise with every use case. But machine listening also demands to be treated as an epistemic and political system in its own right, that increasingly enables, shapes and constrains basic human possibilities, that is making our auditory worlds knowable in new ways, to new institutions, according to new logics, and is remaking (sonic) life in the process. From 12cbe11605ebcf5dfb6192e22a14a813cbdb4d92 Mon Sep 17 00:00:00 2001 From: Alisa Date: Thu, 10 Sep 2020 19:44:44 +0800 Subject: [PATCH 17/17] fix formatting on links --- .../topic/against-the-coming-world-of-listening-machines.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/content/topic/against-the-coming-world-of-listening-machines.md b/content/topic/against-the-coming-world-of-listening-machines.md index b73044b..d833a72 100644 --- a/content/topic/against-the-coming-world-of-listening-machines.md +++ b/content/topic/against-the-coming-world-of-listening-machines.md @@ -8,11 +8,11 @@ has_lessons: [] "Machine listening" is one common term for a fast-growing interdisciplinary field of science and engineering which uses audio signal processing and machine learning to "make sense" of sound and speech [Cella, Serizel, Ellis]. Machine listening is what enables you to be "understood" by Siri and Alexa, to Shazam a song, and to interact with many audio-assistive technologies if you are blind or vision impaired [Alper]. As early as the 90s, the term was already being used in computer music to describe the analytic dimension of 'interactive music systems', whose behavior changes in response to live musical input [Rowe, Maier]. It was also, of course, a cornerstone of the mass surveillance programs revealed by Edward Snowden in 2013: SPIRITFIRE's "speech-to-text keyword search and paired dialogue transcription"; EViTAP's "automated news monitoring"; VoiceRT's "ingestion", according to one NSA slide, of Iraqi voice data into voiceprints. Domestically, machine listening technologies underpin the vast databases of vocal biometrics now held by many prison providers [ref] and, for instance, the Australian Tax Office [ref]. And they are quickly being integrated into infrastructures of development, security and policing. -Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location [[1](https://www.trillbit.com/trillbit-home-page.html “Contactless ultrasonic authentication protocol”)] - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification [[1](https://www.youtube.com/watch?v=gJCVla9xYUs “Command Lines: Power, Affect and Identity in Networked Interactions”)] - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection [[1](https://www.audeering.com/what-we-do/automotive/ “Cars take care of their passengers”)] - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis [[1](http://www.canaryspeech.com/ “Using voice to identify human conditions sooner.”)] - covid diagnosis [[1](https://app.surveylex.com/surveys/5384d6d0-6499-11ea-bc3a-b32c3ca92036 ‘We are launching an initiative to collect your voices with a goal to be able to triage, screen and monitor COVID-19 virus.”)] - machine fault diagnosis - psychosis diagnosis [[1]](https://www.sciencedaily.com/releases/2019/06/190613104552.htm “The whisper of schizophrenia: Machine learning finds 'sound' words predict psychosis”)] - bird sound identification [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... +Automatic speech recognition, transcription and translation [Kathy Reid audio] - targeted key word detection - vocal biometrics [[1](https://www.nice.com/engage/real-time-technology/voice-biometrics/ "NICE leverages voice biometrics for safer and more secure customer authentication")] and audio fingerprinting - speaker verification, differentiation, enumeration and location [[1](https://www.trillbit.com/trillbit-home-page.html "Contactless ultrasonic authentication protocol")] - personality and emotion recognition [[1](https://www.youtube.com/watch?v=86I3-VYIvAM "callAIser in action: Call Center agent gets desperate over angry customer")] - accent identification [[1](https://www.youtube.com/watch?v=gJCVla9xYUs "Command Lines: Power, Affect and Identity in Networked Interactions")] - sound recognition - audio object recognition - audio scene analysis - intelligent audio analysis [![](bib:827d1f44-5a35-4278-a527-4df67e5ba321)] - audio event analysis - audio context awareness - music mood analysis - music identification - music playlist generation - audio synthesis - speech synthesis - musical synthesis - adversarial music [[1](https://arxiv.org/abs/1911.00126 "Real World Audio Adversary Against Wake-word Detection System")] - audio brand recognition - aggression detection [[1](https://www.audeering.com/what-we-do/automotive/ "Cars take care of their passengers")] - depression detection - laughter detection - stress detection - distress detection - intoxication detection[[1](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3872081/ "Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors")] - scream detection - lie detection - hoax detection[[1](https://amp.abc.net.au/article/12568084 "University of Southern Queensland gets $300k for hoax emergency call detection technology")] - gunshot detection - autism diagnosis - parkinson's diagnosis [[1](http://www.canaryspeech.com/ "Using voice to identify human conditions sooner.")] - covid diagnosis [[1](https://app.surveylex.com/surveys/5384d6d0-6499-11ea-bc3a-b32c3ca92036 "We are launching an initiative to collect your voices with a goal to be able to triage, screen and monitor COVID-19 virus.")] - machine fault diagnosis - psychosis diagnosis [[1]](https://www.sciencedaily.com/releases/2019/06/190613104552.htm "The whisper of schizophrenia: Machine learning finds 'sound' words predict psychosis")] - bird sound identification [[1](https://voicebot.ai/2020/06/26/voice-match-is-for-the-birds-new-google-competition-seeks-avian-audio-ai/ "Voice Match is for the Birds")] - gender identification - ethnicity detection - age determination - voice likeability determination - risk assessment [[1](https://www.clearspeed.com/ "Clearspeed: Using the Power of Voice for Good")]... These applications are all either currently in use by states, corporations and other entities around the world, or under development. The list is obviously not exhaustive. Nor does it convey the real diversity of markets, cyberphysical and political contexts into which these applications are quickly embedding themselves: -Digital voice assistants - voice user interfaces - state and corporate surveillance [[1](https://paranoid.com/products “Paranoid Home. Data is forever. Get Paranoid.”)] - profiling - border security - home security - pre-emptive policing - weapons systems - court systems [[1](https://www.wired.com/story/star-witness-your-smart-speaker/ “Meet the Star Witness: Your Smart Speaker”)] - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring [[1](https://get.cherryhome.ai/care/ “Cherry Home”)] - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing [[1](https://www.veritonic.com/ “Veritonic The Sonic Truth”)] - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] +Digital voice assistants - voice user interfaces - state and corporate surveillance [[1](https://paranoid.com/products "Paranoid Home. Data is forever. Get Paranoid.")] - profiling - border security - home security - pre-emptive policing - weapons systems - court systems [[1](https://www.wired.com/story/star-witness-your-smart-speaker/ "Meet the Star Witness: Your Smart Speaker")] - hospital systems - call centre optimisation - disability services - grocery store wayfinding [[1](https://edition.cnn.com/2020/08/27/business/amazon-fresh-first-grocery-store/index.html "Alexa, what aisle is the milk in?")] - ambient elderly monitoring [[1](https://get.cherryhome.ai/care/ "Cherry Home")] - baby monitoring - house arrest monitoring - ![human rights monitoring](soundcite:static/audio/intro-to-pulse-and-radio-content-analysis.mp3)[^andre_audio_1] - remote education - school security - remote diagnostics - biomonitoring and personalised health[[1](https://twitter.com/voiceome "The Voiceome Project")] - social distancing - music streaming - music education - composition [[1](https://disclaimer.org.au/contents/holly-herndon-and-mat-dryhurst-in-conversation-with-sean-dockray "Inhuman Intelligence")] - gaming - brand development - marketing [[1](https://www.veritonic.com/ "Veritonic The Sonic Truth")] - acoustic ecology - employee performance metrics - wearables - hearables - recruitment - banking - insurance - gender vocal training [[1](https://github.com/project-spectra "Project Spectra")] As with all forms of machine learning, questions of efficacy, access, privacy, bias, fairness and transparency arise with every use case. But machine listening also demands to be treated as an epistemic and political system in its own right, that increasingly enables, shapes and constrains basic human possibilities, that is making our auditory worlds knowable in new ways, to new institutions, according to new logics, and is remaking (sonic) life in the process.