東北大学大学院工学研究科伊藤・能勢研究室

国際会議(査読あり) : ～2013年

■Speech Recognition under Noisy Environments using Multiple Microphones Based on Asynchronous and Intermittent Measurements
Kohei Machida, Akinori Ito
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific, pp. 1-4
APSIPA ASC 2013

■Evaluation of Sinusoidal Modeling for Polyphonic Music Signal
Yuki Igarashi,Masashi Ito,Akinori Ito
IIH-MSP 2013 2013/10

■Acoustic Features and Auditory Impressions of Death Growl and Screaming Voice
Keizo Kato,Akinori Ito
IIH-MSP 2013 2013/10

■Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal
Yohei Abe, Akinori Ito
IIH-MSP 2013 2013/10

■Packet Loss Recovery of G.729 speech using discriminative model and N-gram
Takeshi NAGANO, Akinori ITO
IIH-MSP 2013 2013/10

■Estimation of User's State During a Dialog Turn with Sequential Multi-modal Features
Yuya Chiba, Masashi Ito, Akinori Ito
HCI International 2013 - Posters' Extended Abstracts, Part II, Vol. 29 pp.572-576
HCI International 2013 2013/6

■ASAHI: OK for failure: a robot for supporting daily life, equipped with a robot avatar
Yutaka Hiroi, Akinori Ito
Human-Robot Interaction (HRI), pp. 141-142
HRI 2013 2013/3

2012年

■Spoken document retrieval by discriminative modeling in a high dimensional feature space
Proceedings of International Conference on Acoustics, Speech and Signal Processing, (2012), 5153-5156
Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito

■Effect of Robot Height on Comfortableness of Spoken Dialog
Proceedings of 5th International Conference on Human System Interaction, (2012) CD-ROM
Yutaka Hiroi, Takayuki Nakayama, Hisanori Kuroda, Shinji Miyake and Akinori Ito

■Estimation of User's Internal State before the User's - First Utterance Using Acoustic Features and Face Orientation
Proceedings of 5th International Conference on Human System Interaction, (2012) CD-ROM
Yuya Chiba, Masashi Ito and Akinori Ito

■Effect of linguistic contents on human estimation of internal state of dialog system users
Proceedings of Interdisciplinary Workshop on Feedback Behavior in Dialogs, 74-78
Yuya Chiba, Masashi Ito, Akinori Ito

■Packet Loss Concealment of VoIP Under Severe Loss Conditions
Proceedings of International Symposium on Wireless Personal Multimedia Communication, (2012)　486-487
Akinori Ito and Takeshi Nagano

■A Spoken Dialogue System Using Virtual Conversational Agent with Augmented Reality
Shinji Miyake, Akinori Ito
APSIPA ASC, PS.1-SLA.3 Speech Recognition (I), Dec(2012)

■A Japanese Lyrics Writing Support System for Amateur Songwriter
Chihiro Abe, Akinori Ito
APSIPA ASC, PS.2-SLA.4 Audio & Music Processing (I), Dec(2012)

■Recognition of Utterances with Grammatical Mistakes based on Optimization of Language Model towards Interactive CALL Systems
Takuya Anzai, Akinori Ito
APSIPA ASC, OS.15-SLA.7 Speech Recognition (II), Dec(2012)

■A Packet Loss Recovery of G.729 Speech Under Severe Packet Loss Condition
Takeshi Nagano, Akinori Ito
APSIPA ASC, PS.5-SLA.18 Speech Coding and Processing and Recognition, Dec(2012)

2011年

■Utterance Classification for Combination of Multiple Simple Dialog Systems
Proc. ISPAW, (2011), 171-176
Seongjun Hahm, Akinori Ito, Awano Kentaro, Masashi Ito, and Shozo Makino

■Bit Rate Reduction of the MELP Coder Using Lempel-Ziv Segment Quantization
Minoru KOHATA, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO
Proceedings of International Conference on Acoustics, Speech and Signal Processing, (2011), 5240-5244

■Round-Robin Duel Discriminative Language Models in One-Pass Decoding with On-the-fly Error Correction
Takanobu Oba, Takaaki Hori, Akinori Ito, Atsushi Nakamura
Proceedings of International Conference on Acoustics, Speech and Signal Processing, (2011), 5588-5591

■Evaluation of Abnormal Sound Detection using Multi-stage GMM in various Environments
Proc. Interspeech, (2011), 301-304
Akinori Ito, Akihito Aiba, Masashi Ito, and Shozo Makino

■Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition
Proc. Interspeech, (2011), 1465-1468
Ryo Masumura, Seongjun Hahm, and Akinori Ito

■Language model expansion using webdata for spoken document retrieval
Proc. Interspeech, (2011), 2133-2136
Ryo Masumura, Seongjun Hahm, and Akinori Ito

■Manipulating vocal signal in mixed music sounds using small amount of side information
Proc. Int. Conf. IIH-MSP, (2011), 298-301
Yuto Sasaki, and Akinori Ito

■Find out what a user is doing before the first utterance: discrimination of user's internal state using non-verbal information
Proc. APSIPA ASC, (2011)
Yuya Chiba, and Akinori Ito

■A System for Evaluating Singing Enthusiasm for Karaoke
Proc. ISMIR, (2011), 31-36
Ryunosuke Daido, Seong-Jun Hahm, Masashi Ito, Shozo Makino, and Akinori Ito

2010年

■ASPECT-MODEL-BASED REFERENCE SPEAKER WEIGHTING
Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (2010), 4302-4305
Seongjun Hahm, Yuichi Ohkawa, Masashi Ito, Motoyuki Suzuki, Akinori Ito, Shozo Makino

■Document Expansion using Relevant Web Documents for Spoken Document Retrieval
Proceedings of 6th International Conference on Natural Language Processing and Knowledge Engineering, (2010), 612-619
Ryo Masumura, Akinori Ito, Yu Uno, Masashi Ito, Shozo Makino

■Multiple description coding for MP3 coded sound signal
Journal of Information Hiding and Multimedia Signal Processing, (2010), Volume 1, Number 4, 269-285
Ho-seok Wey, Akinori Ito, Takuma Okamoto, Yoiti Suzuki

■An HMM‐based segment quantizer and its application to low bit rate speech coding
Proceedings of International Congress on Acoustics, (2010)
Motoyuki Suzuki, Masashi Adachi, Minoru Kohata, Akinori Ito, Shozo Makino, Fuji Ren

■EVALUATION OF HEAD SIZE OF AN INTERACTIVE ROBOT USING AN AUGMENTED REALITY
Proceedings of International Symposium on Robotics and Applications, (2010)
Yutaka Hiroi, Shuhei Hisano, Akinori Ito

■An Effect of Formant Amplitude in Vowel Perception
Proceedings of Interspeech, (2010), 2490-2493
Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano

■A Query-by-Humming Music Information Retrieval from Audio Signals based on Multiple F0 Candidates
Proceedings of International Conference on Audio, Language and Image Processing, (2010)
Akinori Ito, Yu Kosugi, Shozo Makino, Masashi Ito

■A SPOKEN DIALOG SYSTEM BASED ON AUTOMATICALLY-GENERATED EXAMPLE DATABASE
Proceedings of International Conference on Audio, Language and Image Processing, (2010)，732-736
Akinori Ito, Takahiro Morimoto, Masashi Ito, Shozo Makino

■Grammatical error detection from English utterances spoken by Japanese
Proceedings of 2nd Asian-Pacific Signal and Information Processing Association Annual Summit and Conference, (2010), 482-485
Takuya Anzai, Seongjun Hahm, Akinori Ito, Masashi Ito and Shozo Makino

■Speech Recognition Based on Tree-Structured Clustering and Aspect Model in Multiple Noise Environments
Proceedings of 2nd Asian-Pacific Signal and Information Processing Association Annual Summit and Conference, (2010), 454-457
Seong-Jun Hahm, Yuichi Ohkawa, Motoyuki Suzuki, Masashi Ito, Shozo Makino and Akinori Ito

■Evaluation of head size of an interactive robot using augmented reality
Proceedings of International Symposium on Robotics and Automation, (2010), CD-ROM
Yutaka Hiroi, Shuhei Hisano, Akinori Ito

2009年

■INFORMATION HIDING FOR G.711 SPEECH BASED ON SUBSTITUTION OF LEAST SIGNIFICANT BITS AND ESTIMATION OF TOLERABLE DISTORTION
Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (2009), 1409-1412.
Akinori Ito, Shun'ichiro Abe, Yoiti Suzuki

■Detection of Abnormal Sound Using Multi-stage GMM for Surveillance Microphone
Proceedings of International Conference of Information Assurance and Security (2009), 733-736.
Akinori Ito, Akihito Aiba, Masashi Ito, Shozo Makino

■A Band Extension of G.711 Speech with Low Computational Cost for Data Hiding Application
Proceedings of 5th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, (2009), 491-494.
Akinori Ito, Hironori Handa, Yoiti Suzuki

■Data Hiding is a Better Way for Transmitting Side Information for MP3 Bitstream
Proceedings of 5th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, (2009), 495-498.
Akinori Ito, Shozo Makino

■Relative importance of formant and whole-spectral cues for vowel perception
Proceedings of Interspeech, (2009), 124-127.
Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano

■Evaluation of English Intonation based on Combination of Multiple Evaluation Scores
Proceedings of Interspeech, (2009), 596-599.
Akinori Ito, Tomoaki Konno, Masashi Ito and Shozo Makino

■Detailed description of triphone model using SSS-free algorithm
Proceedings of Interspeech, (2009), 1399-1402.
Motoyuki Suzuki, Daisuke Homma, Akinori Ito and Shozo Makino

■Multiple Description Coding of Flash Video based on Adaptive Allocation of DCT Coefficients
Proceedings of 1st Asian-Pacific Signal and Information Processing Association Annual Summit and Conference, (2009), 453-456.
Akinori Ito, Takuya Kuraishi, Masashi Ito and Shozo Makino

■Multiple Description Coding for Wideband Audio Signal Transmission
Proceedings of International Conference on Network Infrastructure and Digital Content, (2009), 769-773.
Hoseok WEY, Akinori ITO, Yoiti SUZUKI

■Relevant Document Retrieval using a Spoken Document
Proceedings of International Symposium on Communications and Information Technologies, (2009), 1483-1488.
Akinori Ito, Yu Uno, Ryo Masumura, Masashi Ito and Shozo Makino

2008年

■Are Bigger Robots Scary? -The Relationship Between Robot Size and Psychological Threat-
Proceedings of International Conference on Advanced Intelligent Mechatronics, (2008) 546-551.
Yutaka Hiroi, Akinori Ito

■An Unsupervised Language Model Adaptation Based on Keyword Clustering and Query Availability Estimation
Proceedings of International Conference on Audio, Language and Image Processing, (2008), 1412-1418.e
Akinori Ito, Yasutomo Kajiura, Shozo Makino and Motoyuki Suzuki

■Packet Loss Concealment for MDCT-based Audio Codec using Correlation-Based Side Information
Proceedings of 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, (2008) 612-615.
Akinori Ito, Kiyoshi Konno, Shozo Makino, Motoyuki Suzuki

■Discrimination of Task-Related Words for Vocabulary Design of Spoken Dialog Systems
Proceedings of Interspeech, (2008), 207-210.
Akinori Ito, Toyomi Meguro, Shozo Makino and Motoyuki Suzuki

■A Fast Speaker Adaptation Method using Aspect Model
Proceedings of Interspeech, (2008)， 1221-1224.
Seongjun HAHM, Akinori ITO, Shozo MAKINO and Motoyuki SUZUKI

■Recognition of English Utterances with Grammatical and Lexical Mistakes for Dialogue-based CALL System
Proceedings of Interspeech, (2008), 2819-2822.
Akinori Ito, Ryohei Tsutsui, Shozo Makino and Motoyuki Suzuki

■Intonation Evaluation of English Utterances using Synthesized Speech for Computer-Assisted Language Learning
Proceedings of International Conference on Natural Language Processing and Knowledge Engineering, (2008), 202-208.
Tomoaki Konno, Masashi Ito, Motoyuki Suzuki, Akinori Ito, Shozo Makino

2007年

■A PORTABLE SPOKEN DIALOG SYSTEM FOR AN AUTONOMOUS ROBOT
Proceedings of Japan-China Joint Conference on Acoustics, (2007), CD-ROM.
Shozo Makino, Akinori Ito, Motoyuki Suzuki and Takashi Konashi

■PACKET LOSS CONCEALMENT OF AN AUDIO STREAM BY TIME DOMAIN AND FREQUENCY DOMAIN MULTIPLE DESCRIPTION
Proceedings of Japan-China Joint Conference on Acoustics, (2007), CD-ROM.
Akinori Ito, Toshiyuki Sakai, Motoyuki Suzuki and Shozo Makino

■Application of Multiple Description (MD) scalar quantization to speech codec
Proceedings of Japan-China Joint Conference on Acoustics, (2007), CD-ROM.
Ho-seok WEY, Ryouichi NISHIMURA, Akinori ITO, Maori KOBAYASHI, Yoiti SUZUKI

■Automatic Evaluation System of English Prosody for Japanese Learner's Speech
Proceedings of 5th International Conference on Education and Information Systems, Technologies and Applications, (2007), CD-ROM.
Motoyuki Suzuki, Tatsuki Konno, Akinori Ito and Shozo Makino

■REDUCTION METHOD OF SIDE INFORMATION FOR PACKET LOSS CONCEALMENT BASED ON SPECTRUM STRIPE CODING
Proceedings of 19th International Congress of Acoustics, (2007), CD-ROM.
Motoyuki Suzuki, Toshiyuki Sakai, Akinori Ito, Shozo Makino

■DETECTION AND DIRECTION ESTIMATION OF CALLING VOICE
Proceedings of 19th International Congress of Acoustics, (2007), CD-ROM.
Akinori Ito, Kota Kitadate, Motoyuki Suzuki, Shozo Makino

■Increasing Correlation using a Few Bits for Multiple Description Coding
Proceedings of 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing, (2007), 259-262.
Akinori Ito and Shozo Makino

2006年

■EVALUATION OF MULTIPLE PLSA ADAPTATION BASED ON SEPARATION OF TOPIC AND STYLE WORDS
Proceedings of 9th Western-Pacific Acoustic Conference, (2006), CD-ROM.
Akinori ITO, Naoto KURIYAMA, Motoyuki SUZUKI, Shozo MAKINO

■PACKET LOSS CONCEALMENT OF AUDIO STREAM BASED ON MULTIPLE DESCRIPTION BY SPECTRUM STRIPING
Proceedings of 9th Western-Pacific Acoustic Conference, (2006), CD-ROM.
Motoyuki Suzuki, Toshiyuki Sakai, Jie Liu, Akinori Ito, Shozo Makino

■A User Simulator based on VoiceXML for evaluation of spoken dialog systems
Proceedings of Interspeech, (2006), 1045-1048.
Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino

■Unsupervised language model adaptation based on automatic text collection from WWW
Proceedings of Interspeech, (2006), 2202-2205.
Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito and Shozo Makino

■Music Information Retrieval from a Singing Voice Based on Verification of Recognized Hypotheses
Proceedings of 7th International Conference on Music Information Retrieval, (2006), 168-171.
Motoyuki Suzuki, Toru Hosoya, Akinori Ito and Shozo Makino

■Multiple description coding of an audio stream by optimum recovery transform
Proceedings of 2nd International Conference on Intelligent Information Hiding and Multimedia Signal Processing, (2006), 19-22.
Akinori Ito and Shozo Makino

2005年

■Pronunciation Error Detection Method Based on Error Rule Clustering Using a Decision Tree
Proceedings of 9th European Conference on Speech Communication and Technology, (2005), 173-176.
Akinori Ito, Yen-Ling Lim, Motoyuki Suzuki and Shozo Makino

■Construction Method of Acoustic Models Dealing with Various Background Noises Based on Combination of HMMs
Proceedings of 9th European Conference on Speech Communication and Technology, (2005), 973-976.
Motoyuki Suzuki, Yusuke Kato, Akinori Ito and Shozo Makino

■Internal Noise Suppression for Speech Recognition by Small Robots
Proceedings of 9th European Conference on Speech Communication and Technology, (2005), 2685-2688.
Akinori Ito, Takashi Kanayama, Motoyuki Suzuki and Shozo Makino

■LYRICS RECOGNITION FROM A SINGING VOICE BASED ON FINITE STATE AUTOMATON FOR MUSIC INFORMATION RETRIEVAL
Proceedings of the 6th International Conference on Music Information Retrieval, (2005), 532-535.
Toru Hosoya, Motoyuki Suzuki, Akinori Ito and Shozo Makino

■Smile and Laughter Recognition using Speech Processing and Face Recognition from Conversation Video
Proceedings of International Conference on Cyberworlds, (2005), 437-444.
Akinori Ito, Xinyue Wang, Motoyuki Suzuki, Shozo Makino

■A New Design Concept of Robotic Interface for the Improvement of User Familiarity
Proceedings of SPIE, 6042, (2005), doi:10.1117/12.664685.
Yutaka Hiroi, Eiji Nakano, Takayuki Takahashi, Akinori Ito, Koji Kotani and Nobuo Takatsu

2004年

■A dialogue-based CALL system for Japanese conversation
Proceedings of the 18th International Congress on Acoustics, (2004), III-2015 - III-2018.
Oh Pyo Kweon, Motoyuki Suzuki, Akinori Ito and Shozo Makino

■Language Modeling using Stochastic Switching N-gram
Proceedings of the 18th International Congress on Acoustics, (2004), V-3697 - V-3700.
Takeshi NAGANO, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO

■Language Modeling by an Ergodic HMM based on an N-gram
Proceedings of the 18th International Congress on Acoustics, (2004), V-3701 - V-3704.
Takeshi NAGANO, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO, Masaharu KATO, Masaki KOHDA

■A spoken dialog system based on automatic grammar generation and template-based weighting for autonomous mobile robots
Proceedings of International Conference on Spoken Language Processing, (2004), CD-ROM.
Takashi KONASHI, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO

■Noise Adaptive Spoken Dialog System based on Selection of Multiple Dialog Strategies
Proceedings of International Conference on Spoken Language Processing, (2004), CD-ROM.
Akinori Ito, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki and Shozo Makino

■A Japanese dialogue-based CALL system with mispronunciation and grammar error detection
Proceedings of International Conference on Spoken Language Processing, (2004), CD-ROM.
Oh Pyo Kweon, Akinori Ito, Motoyuki Suzuki and Shozo Makino

■Speaker Adaptation Method for CALL Systems Using Bilingual Speakers' Utterances
Proceedings of International Conference on Spoken Language Processing, (2004), CD-ROM.
Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Yuichi Ohkawa, Shozo Makino

■COMPARISON OF FEATURES FOR DP-MATCHING BASED QUERY-BY-HUMMING SYSTEM
Proceedings of the 5th International Conference on Music Information Retrieval, (2004), 297-302.
Akinori Ito, Sung-Phil Heo, Motoyuki Suzuki, Shozo Makino

2003年

■A portable spoken dialog system for autonomous robots
Proceeding of 1st International Workshop on Language Understanding and Agents for Real-world Interaction, (2003), 79-84.
Takashi Konashi, Motoyuki Suzuki, Akinori Ito, Shozo Makino

■An Optimized Multi-Duration HMM for Spontaneous Speech Recognition
Proceeding of European Conference on Speech Communication and Technology, (2003), 485-488.
Yuichi Ohkawa, Akihiro Yoshida, Motoyuki Suzuki, Akinori Ito, Shozo Makino

■Error Tolerant Melody Matching Method in Music Information Retrieval
Proceeding of 1st International Workshop on Adaptive Multimedia Retrieval (AMR 2003), Lecture Note in Computer Science, 3094, (2004), 212-217.
Sung-Phil Heo, Motoyuki Suzuki, Akinori Ito, Shozo Makino and Hyun-Yeol Chung

■Analysis of pronunciation errors in Japanese speech uttered by Korean towards development of Japanese CALL system
Proceedings of Oriental COCOSDA, (2003), 185-192.
Oh Pyo Kweon, Motoyuki Suzuki, Akinori Ito and Shozo Makino

■Three Dimensional Continuous DP Algorithm for Multiple Pitch Candidates in Music Information Retrieval System
Proceedings of 4th International Symposium on Music Information Retrieval (2003), 235-236.
Sung-Phil HEO, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO

■A Patient Care Service Robot System Based on a State Transition　Architecture
Proceedings of the 2nd International Conference on　Mechatronics and Information Technology, (2003), 231-236.
Yutaka HIROI, Eiji NAKANO, Takayuki TAKAHASHI, Shozo MAKINO, Akinori　ITO, Koji KOTANI, Nobuo TAKATSU and Tadahiro OHMI

2002年

■Continuous Speech Recognition Consortium —an Open Repository for CSR Tools and Models —
Proceedings of IEEE International Conference on Language Resources and Evaluation, (2002), 1438-1441.
Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito , Katsunobu Itou, Kiyohiro Shikano

東北大学大学院工学研究科通信工学専攻伊藤・能勢研究室

国際会議(査読あり) : ～2013年

リンク

お問い合わせ

東北大学大学院 工学研究科 通信工学専攻 伊藤・能勢研究室

国際会議(査読あり) : ～2013年

リンク

お問い合わせ

東北大学大学院工学研究科通信工学専攻伊藤・能勢研究室