ปี พ.ศ. 2555 |
1 |
Toward summarization of communicative activities in spoken conversation |
ปี พ.ศ. 2554 |
2 |
HMM-based speech synthesis using an acoustic glottal source model |
3 |
Investigating Non-Uniqueness in the Acoustic-Articulatory Inversion Mapping |
4 |
Automatic Speech Recognition for ageing voices |
5 |
Cross-lingual automatic speech recognition using tandem features |
ปี พ.ศ. 2553 |
6 |
Speech-driven animation using multi-modal hidden Markov models |
7 |
Hierarchical Bayesian Language Models for Conversational Speech Recognition |
8 |
Power Law Discounting for N-Gram Language Models |
9 |
Recognition and Understanding of Meetings |
10 |
A Digital Microphone Array for Distant Speech Recognition |
11 |
Ageing voices: The effect of changes in voice parameters on ASR performance |
12 |
Evaluating speech synthesis intelligibility using Amazon Mechanical Turk |
13 |
Augmentation of adaptation data |
14 |
Transforming Voice Source Parameters in a HMM-based Speech Synthesiser with Glottal Post-Filtering |
ปี พ.ศ. 2552 |
15 |
Hierarchical Reinforcement Learning for Spoken Dialogue Systems |
16 |
Digital Microphone Array - Design, Implementation and Speech Recognition Experiments |
17 |
Modelling Speech Dynamics with Trajectory-HMMs |
18 |
Speech Recognition Using Augmented Conditional Random Fields |
19 |
A Parallel Training Algorithm for Hierarchical Pitman-Yor Process Language Models |
20 |
Age Recognition for Spoken Dialogue Systems: Do We Need It? |
21 |
Evaluation of a hierarchical reinforcement learning spoken dialogue system |
22 |
Meeting decision detection: multimodal information fusion for multi-party dialogue understanding |
23 |
HMM-based Speech Synthesis with an Acoustic Glottal Source Model |
24 |
Robust Speaker-Adaptive HMM-based Text-to-Speech Synthesis |
25 |
Speaker normalisation for large vocabulary multiparty conversational speech recognition |
26 |
Automatic recognition of multiparty human interactions using dynamic Bayesian networks |
ปี พ.ศ. 2551 |
27 |
Global Inference for Sentence Compression: An Integer Linear Programming Approach |
28 |
Automatic determination of sub-word units for automatic speech recognition |
29 |
Predicting Tongue Shapes from a Few Landmark Locations |
30 |
Combining Spectral Representations for Large Vocabulary Continuous Speech Recognition |
31 |
Recognition of Dialogue Acts in Multiparty Meetings using a Switching DBN |
32 |
Interpretation of Multiparty Meetings: The AMI and AMIDA Projects |
33 |
Using Participant Role in Multiparty Meetings as Prior Knowledge for Nonparametric Topic Modeling |
34 |
Glottal Spectral Separation for Parametric Speech Synthesis |
35 |
Unsupervised Language Model Adaptation Based on Topic and Role Information in Multiparty Meetings |
36 |
Pitch adaptive features for LVCSR |
37 |
Acoustic-Articulatory Modelling with the Trajectory HMM |
38 |
Longitudinal study of ASR performance on ageing voices |
39 |
Recognition and Understanding of Meetings: Overview of the European AMI and AMIDA Projects |
40 |
A Cascaded Broadcast News Highlighter |
ปี พ.ศ. 2550 |
41 |
Hierarchical dialogue optimization using semi-markov decision processes. |
42 |
DBN based joint dialogue act recognition of multiparty meetings |
43 |
Automatic dialogue act recognition using a dynamic Bayesian network |
44 |
Towards an improved modeling of the glottal source in statistical parametric speech synthesis |
45 |
Improved Average-Voice-based Speech Synthesis Using Gender-Mixed Modeling and a Parameter Generation Algorithm Considering GV |
46 |
Recognition and interpretation of meetings: The AMI and AMIDA projects |
47 |
Modeling prosodic features in language models for meetings. |
48 |
Term-weighting for summarization of multi-party spoken dialogues |
ปี พ.ศ. 2548 |
49 |
Applying Vocal Tract Length Normalization to Meeting Recordings |
50 |
Content-based access to spoken audio |
51 |
Accessing the spoken word |
52 |
Maximum entropy segmentation of broadcast news |
53 |
Speech and crosstalk detection in multi-channel audio |
54 |
Speaker verification using sequence discriminant support vector machines |
55 |
Automatic summarization of voicemail messages using lexical and prosodic features |
56 |
Evaluating Automatic Summaries of Meeting Recordings |
57 |
Extractive summarization of meeting recordings. |
58 |
Transcription of conference room meetings: an investigation |
59 |
Multistream dynamic Bayesian network for meeting segmentation |
ปี พ.ศ. 2547 |
60 |
Acoustic Space Dimensionality Selection and Combination using the Maximum Entropy Principle |
61 |
Dynamic Bayesian Networks for Meeting Structuring |
62 |
From text summarisation to style-specific summarisation for broadcast news |
63 |
Multi-Stream Segmentation of Meetings |
ปี พ.ศ. 2546 |
64 |
Evaluation of extractive voicemail summarization. |
65 |
Multi-class Extractive Voicemail Summarization |
66 |
Are extractive text summarisation techniques portable to broadcast news? |
67 |
Feature selection for the classification of crosstalk in multi-channel audio |
68 |
Exploring the style-technique interaction in extractive summarization of broadcast news. |
69 |
Audio information access from meeting rooms. |
70 |
SVMSVM: Support vector machine speaker verification methodology. |
71 |
Statistical Language Modelling |
ปี พ.ศ. 2545 |
72 |
ASR System Modeling for Automatic Evaluation and Optimization of Dialogue Systems. |
73 |
Evaluation of Kernal Methods for Speaker Verification and Identification |
74 |
Connectionist Speech Recognition of Broadcast News |
ปี พ.ศ. 2544 |
75 |
Extractive Summarization of Voicemail using Lexical and Prosodic Feature Subset Selection |
76 |
Punctuation annotation using statistical prosody models. |
77 |
The role of prosody in a voicemail summarization system |
78 |
An advanced integrated architecture for wireless voicemail retrieval. |
79 |
The THISL SDR system at TREC-9. |
ปี พ.ศ. 2543 |
80 |
Indexing and retrieval of broadcast news |
81 |
Information extraction from broadcast news |
82 |
Variable word rate N-grams |
83 |
The THISL SDR system at TREC-8 |
84 |
Transcription and Summarization of Voicemail Speech |
85 |
Sentence Boundary Detection in Broadcast Speech Transcripts |
86 |
Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions |
ปี พ.ศ. 2542 |
87 |
Integrated transcription and identification of named entities in broadcast speech. |
88 |
The SPRACH/LaSIE system for named entity identification in broadcast news. |
89 |
An Overview of the SPRACH System for the Transcription of Broadcast News |
90 |
Start-synchronous search for large vocabulary continuous speech recognition. |
91 |
Named entity tagged language models. |
92 |
Topic-based mixture language modelling. |
93 |
The THISL broadcast news retrieval system. |
94 |
Recognition, indexing and retrieval of British broadcast news with the THISL system. |
95 |
The THISL system for indexing and retrieval of broadcast news. |
96 |
Statistical annotation of named entities in spoken audio. |
97 |
Retrieval of broadcast news documents with the THISL system. |
98 |
A latent-variable modelling approach to the acoustic-to-articulatory mapping problem. I |
99 |
Confidence measures from local posterior probability estimates |
ปี พ.ศ. 2541 |
100 |
Confidence Measures Derived from an Acceptor HMM |
101 |
Acoustic Confidence Measures for Segmenting Broadcast News |
102 |
The THISL Spoken Document Retrieval System |
103 |
Confidence Measures for Evaluating Pronunciation Models |
104 |
Experimental Evaluation of Latent Variable Models for Dimensionality Reduction |
105 |
Retrieval of Broadcast News Documents with the THISL System |
106 |
The THISL Spoken Document Retrieval System |
107 |
Dimensionality reduction of electropalatographic data using latent variable models |
ปี พ.ศ. 2540 |
108 |
Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems. |
109 |
Confidence measures for hybrid HMM/ANN speech recognition. |
110 |
Document space models using latent semantic analysis. |
ปี พ.ศ. 2539 |
111 |
The 1995 ABBOT LVCSR system for multiple unknown microphones |
112 |
Efficient evaluation of the LVCSR search space using the NOWAY decoder |
113 |
Phone deactivation pruning in large vocabulary continuous speech recognition |
ปี พ.ศ. 2538 |
114 |
Efficient search using posterior phone probability estimates. |
115 |
The 1994 Abbot hybrid connectionist-HMM large vocabulary recognition system. |
116 |
Recent improvements to the Abbot large vocabulary CSR system. |
117 |
Speaker-Adaptation for Hybrid HMM-ANN Continuous Speech Recognition System |
ปี พ.ศ. 2537 |
118 |
IPA: improved phone modelling with recurrent neural networks |
119 |
Connectionist probability estimators in HMM speech recognition |
120 |
Connectionist model combination for large vocabulary speech recognition |
121 |
Learning temporal dependencies in connectionist speech recognition |
ปี พ.ศ. 2536 |
122 |
Bayesian regularisation methods in a hybrid MLP-HMM system. |
ปี พ.ศ. 2535 |
123 |
CDNN: a context dependent neural network for continuous speech recognition |
124 |
Connectionist probability estimation in the DECIPHER speech recognition system |
125 |
Improving statistical speech recognition |
ปี พ.ศ. 2534 |
126 |
Probability estimation by feed-forward networks in continuous speech recognition. |
ปี พ.ศ. 2532 |
127 |
Radial basis function network for speech pattern classification |
ปี พ.ศ. 2531 |
128 |
Unstable connectionist networks in speech recognition |
129 |
A connectionist approach to speech recognition using peripheral auditory modelling |