首页   新闻  文摘   行业   产品  技术  厂商  标准  BBS  导航  搜索
呼叫中心 | CRM | 统一通信 | 企业通信 | VoIP | 视像通讯 | 语音应用 | 热点专题

 



Computer Speech : Recognition, Compression, Synthesis (Springer Series in Information Sciences, 35)

by Manfred R. Schroeder


Table of Contents

Introduction
Speech: Natural and Artificial
Voice Coders
Voiceprints for Combat and for Fighting
Crime
The Electronic Secretary
The Human Voice as a Key
Clipped Speech
Frequency Division
The First Circle of Hell: Speech in the Soviet Union
Linking Fast Trains to the Telephone Network
Digital Decapitation
Man into Woman and Back
Reading Aids for the Blind
High-Speed Recorded Books
Spectral Compression for the Hard-of-Hearing
Restoration of Helium Speech
Noise Suppression
Slow Speed for Better Comprehension
Multiband Hearing Aids and Binaural Speech Processors
Improving Public Address Systems
Raising Intelligibility in Reverberant Spaces
Conclusion
A Brief History of Speech
Animal Talk
Wolfgang Ritter von Kempelen
From Kratzenstein to Helmholtz
Helmholtz and Rayleigh
The Bells: Alexander Melville and Alexander Graham Bell
Modern Times
The Vocal Tract
Articulatory Dynamics
The Vocoder and Some of Its Progeny
Formant Vocoders
Correlation Vocoders
The Voice-Excited Vocoder
Center Clipping for Spectrum Flattening
Linear Prediction
Subjective Error Criteria
Neural Networks
Wavelets
Conclusion
Speech Recognition and Speaker Identification
Speech Recognition
Dialogue Systems
Speaker Identification
Word Spotting
Pinpointing Disasters by Speaker
Identification
Speaker Identification for Forensic Purposes
Dynamic Programming
Markov Models
Shannon's Outguessing Machine--A Hidden
Markov Model Analyzer
Hidden Markov Models in Speech Recognition
Neural Networks
The Perceptron
Multilayer Networks
Backward Error Propagation
Kohonen Self-Organizing Maps
Hopfield Nets and Associative Memory
Whole Word Recognition
Robust Speech Recognition
The Modulation Transfer Function
Speech Compression
Vocoders
Digital Simulation
Linear Prediction
Linear Prediction and Resonances
The Innovation Sequence
Single Pulse Excitation
Multipulse Excitation
Adaptive Predictive Coding
Masking of Quantizing Noise
Instantaneous Quantizing Versus Block
Coding
Delays
Code Excited Linear Prediction (CELP)
Algebraic Codes
Efficient Coding of Parameters
Waveform Coding
Transform Coding
Audio Compression
Speech Synthesis
Model-Based Speech Synthesis
Synthesis by Concatenation
Prosody
Speech Production
Sources and Filters
The Vocal Source
The Vocal Tract
Radiation from the Lips
The Acoustic Tube Model of the Vocal Tract
Discrete Time Description
The Speech Signal
Spectral Envelope and Fine Structure
Unvoiced Sounds
The Voiced--Unvoiced Classification
The Formant Frequencies
Hearing
Historical Antecedents
Thomas Seebeck and Georg Simon Ohm
More on Monaural Phase Sensitivity
Hermann von Helmholtz and Georg von Bekesy
Thresholds of Hearing
Pulsation Threshold and Continuity Effect
Anatomy and Basic Capabilities of the Ear
The Pinnae and the Outer Ear Canal
The Middle Ear
The Inner Ear
Mechanical to Neural Transduction
Some Astounding Monaural Phase Effects
Masking
Loudness
Scaling in Psychology
Pitch Perception and Uncertainty
Binaural Hearing--Listening with Both Ears
Directional Hearing
Precedence and Haas Effects
Vertical Localization
Virtual Sound Sources and Quasi-Stereophony
Binaural Release from Masking
Binaural Beats and Pitch
Direction and Pitch Confused
Pseudo-Stereophony
Virtual Sound Images
Philharmonic Hall, New York
The Proper Reproduction of Spatial Sound Fields
The Importance of Lateral Sound
How to Increase Lateral Sounds in Real Halls
Summary
Basic Signal Concepts
The Sampling Theorem and Some Notational
Conventions
Fourier Transforms
The Autocorrelation Function
The Convolution Integral and the Delta Function
The Cross-Correlation Function and the
Cross-Spectrum
A Bit of Number Theory
The Hilbert Transform and the Analytic Signal
Hilbert Envelope and Instantaneous Frequency
Causality and the Kramers--Kronig Relations
Anticausal Functions
Minimum-Phase Systems and Complex Frequencies
Allpass Systems
Dereverberation
Matched Filtering
Phase and Group Delay
Heisenberg Uncertainty and The Fourier Transform
Prolate Spheroidal Wave Functions and Uncertainty
Time and Frequency Windows
The Wigner--Ville Distribution
The Cepstrum: Measurement of Fundamental Frequency
Line Spectral Frequencies
A. Acoustic Theory and Modeling of the Vocal Tract
Introduction
Acoustics of a Hard-Walled, Lossless Tube
Field Equations
Time-Invariant Case
Formants as Eigenvalues
Losses and Nonrigid Walls
Discrete Modeling of a Tube
Time-Domain Modeling
Frequency-Domain Modeling, Two-Port Theory
Tube Models and Linear Prediction
Notes on the Inverse Problem
Analytic and Numerical Methods
Empirical Methods
B. Direct Relations Between Cepstrum and
Predictor Coefficients
Derivation of the Main Result
Direct Computation of Predictor
Coefficients from the Cepstrum
A Simple Check
Connection with Algebraic Roots and Symmetric Functions
Connection with Statistical Moments and
Cumulants
Computational Complexity
An Application of Root-Power Sums to Pitch
Detection
References
General Reading
Selected Journals
A Sampling of Societies and Major Meetings
Glossary of Speech and Computer Terms
Name Index
Subject Index
The Author

 


发表评论


  ·Dialogic IP呼叫中心及增值业务主题研讨会  [11月27 成都]  
  ·面对严峻经济形势,如何降低联络中心成本  [11月26-28 上海 北京] 
  ·招聘:亿迅(中国) 拓敏信息 易谷网络 盈联信息 商路通 怡海软件

  ·《2008中国呼叫中心产业发展研究报告》    免费下载简本  
  ·最新资料:《企业呼叫中心建设指南》 《企业通信案例及方案大全》
  ·免费索取:《多媒体交换机资料》   技术前沿资料:《IP、无线和视频方案》

  ·东进Seegoe Enterprise/Office呼叫中心产品介绍
  ·新太科技企业呼叫中心解决方案
  ·TTS在线演示:InterPhonic 5.5系统

            


企业会员
华瑞中鹏 井星科技 Voxeo
FDS 上海盈联 易宝通讯
加入办法 ->





CTI论坛推荐
·鼎晟DS-iTouch联络中心
·新太科技企业呼叫中心解决方案
·上海维卡推出VN系列电话语音卡
·CTstage 5i客户联络中心-适用大规模分散网点
·三友亚星:上海红孩子电话营销和客服系统
·什么是IP分布式呼叫中心
·语音合成:InterPhonic 5.5在线演示系统
·东进技术:Seegoe Enterprise/Office呼叫中心
   
相关链接
CTI论坛周刊 融合通信专栏
行业案例汇编 免费发布新闻
管理员俱乐部 服务与营销论坛

热 点 专 栏
|业界新闻|论坛文摘|行业应用|产品展示|技术天地|厂商汇总|免责声明|咨询服务|公司简介|联系方法|广告服务|企业会员|

编辑投稿信箱      如何查找厂商联系方法

电话:010-82012787,82079677   传真:010-62041062
呼叫中心建设及运营管理咨询服务:优胜资讯(010)87768798 87768726