IST Proposal MobiNews Meeting - June 10th, 2003 “Automatic and Personalised Compilation of Broadcast News with Audio Playback on Mobile Devices” François.

Slides:



Advertisements
Présentations similaires
INTERNATIONAL ENTREPRENEURSHIP Pasqualino Mare, Projectmanager KC Handel 4 Novembre 2008 Paris,
Advertisements

New opportunities offered by APHLIS 3 Les nouvelles opportunities qui soffrent avec APHLIS 3 JRC.
OUR LAND – OUR WEALTH, OUR FUTURE, IN OUR HANDS Second Regional Preparation Workshop for the GEF Strategic Investment Program for Sustainable Land Management.
Département fédéral de lintérieur DFI Office fédéral de la statistique OFS Implementing the economic classification revision (NACE / ISIC) in the Business.
Targets of the approach
THALES Communications Les informations contenues dans ce document sont la propriété exclusive du Groupe THALES. Elles ne doivent pas être divulguées sans.
Le sondage LibQUAL à HEC Montréal Une première expérience réussie qui sintègre au processus de planification stratégique de la bibliothèque Le sondage.
(Nom du fichier) - D1 - 01/03/2000 FTR&D/VERIMAG TAXYS : a tool for the Development and Verification of RT Systems a joint project between France Telecom.
Échanger connaissances et techniques sur les routes et le transport routier 1 The PIARC Website.
Copyright © 2010 Systematic Présentation des enjeux Europe et International 1 Jean-Luc Beylat, Vice-Président International Systematic.
1 DGRI / Département des affaires européennes et internationales 7 ème PCRDT, TIC: appel SME Initiative on Digital Content and Languages Frédéric Laurent.
(Nom du fichier) - D1 - 01/03/2000 France Télécom R&D Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation.
(Nom du fichier) - D1 - 01/03/2000 France Télécom R&D Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation.
Branche Développement Cnet La communication de ce document est soumise à autorisation du Cnet © France Télécom - (Nom du fichier) - D1 - 11/01/2014 Diffusion.
Thales Communications
Gérard CHOLLET Fusion Gérard CHOLLET GET-ENST/CNRS-LTCI 46 rue Barrault PARIS cedex 13
Revenir aux basiques !. 1 Revenir aux basiques Processus Nécessité daméliorer la Maîtrise les Offres et Projets: lanalyse des causes racines montre un.
Inforoute Santé du Canada Les défis de linteropérabilité en e-santé Mike Sheridan, Chef de lexploitation 19 mai 2006.
interaction in the .LRN platform
RECOMMENDATIONS ON EXPORT MARKETING FOR GEORGIAN WINES Tbilisi – November 27, 2007.
1 Initiatives involving the social partners in Europe on climate change and employment policies Denmark : The experience of the Lindoe Offshore Renewable.
Laboratoire Composants Optique Hyperfréquences et numerique:
Status report SOLEIL April 2008
1 Découverte des Outils SI de Cadence Ecole dElectronique Numérique IN2P3 Roscoff 2006 Découverte des Outils dAnalyse dIntégrité du Signal de Cadence ®
Coopération/Distribution DEA Informatique Nancy. Content 4 Introduction - Overview 4 Coordination of virtual teams : –explicit interaction model –explicit.
TP2 ... MVC ? JList JLabel JSlider ImageLibrary Contrôleur Vue Modèle
1Chaire de commerce électronique RBC Groupe Financier HEC Montréal Is e-Commerce different ? Commercer en ligne : Est-ce différent ? Sylvain Sénécal Is.
Minimisation Techniques 1 Assimilation Algorithms: Minimisation Techniques Yannick Trémolet ECMWF Data Assimilation Training Course March 2006.
Université Des Sciences Et De La Technologie DOran Mohamed Boudiaf USTO République Algérienne Démocratique et Populaire Département de linformatique Projet.
Defence R&D Canada R et D pour la défense Canada Novel Concepts for the COP of the Future Denis Gouin Alexandre Bergeron-Guyard DRDC Valcartier.
(Nom du fichier) - D1 - 01/03/2000 Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document.
TM.
Defence Research and Development Canada Recherche et développement pour la défense Canada Canada 11-1.
* Google Confidential and Proprietary Khaled KOUBAA Public Policy & Gov't Relations Manager - North Africa Google, Inc. Research, Innovation and Entrepreneurship.
DELF Le 12 au 15 avril POURQUOI DELF? Official French language diplomas (DELF-DALF) - Why take the DELF and the DALF ? The Diplôme dEtudes en Langue.
Assessment and the new secondary curriculum S. Barfoot.
EUROPEAN ASSOCIATION OF DEVELOPMENT RESEARCH AND TRAINING INSTITUTES ASSOCIATION EUROPÉENNE DES INSTITUTS DE RECHERCHE ET DE FORMATION EN MATIÈRE DE DÉVELOPPEMENT.
AFNOR NF Z – "Online Consumer Reviews
The EMPREINTE Project Juillet - octobre 2004
TortoiseSVN N°. Subversion : pour quoi faire ? Avoir un espace de stockage commun – Tous les étudiants du SIGLIS ont un espace svn commun Partager vos.
Seite 1 Présentation Guinée Réunion Task Force CQ/SQI, Eschborn CONCOURS QUALITE IN GUINEA Context and perennity Dr Mohamed Lamine.
INVESTMENT CLIMATEDEVELOPMENT IMPACT EVALUATION INITIATIVE Piloting the Entreprenant Status: In search of a successful formalization model BENIN Impact.
PURCHASING PHASE REVIEW Cornerstones of Purchase baseline
Laboratoire de Bioinformatique des Génomes et des Réseaux Université Libre de Bruxelles, Belgique Introduction Statistics.
My4n-news TEACHER EDU Des ressources numériques en ligne pour enseigner langlais avec lactualité 28 mars 2013.
Contribution du projet PARIS Christian Pérez Réunion LEGO LIP, ENS Lyon 10 février 2006.
Consortium québécois sur la découverte du médicament Facilitating creative partnerships in biopharmaceutical research OTTAWA - December 1 st, 2009 Max.
Présentation dun modèle dinterface adaptative dun système de diagnostique et dintervention industriel: ADAPTS (Adaptive Diagnostics And Personalized Technical.
1 ISBN John Wiley and sons. 2 IntroductionIntroduction Chapter 1.
Ce document est la propriété d EADS CCR ; il ne peut être communiqué à des tiers et/ou reproduit sans lautorisation préalable écrite d EADS CCR et son.
Systèmes distribués Le futur des systèmes dinformation est: Networked Diverse Numerous Mobile Ubiquitous Systèmes multiagents Middlewares: CORBA JINI HLA.
Rebecca Kent and Stacey Mahoney Key Stage 3 Story Telling Triple Literacy Project Croesyceiliog School.
Marketing électronique Cours 5 La personnalisation.
Le Baromètre Zone Cours : un environnement pour la micro-évaluation de ressources pédagogiques* Jacques Raynauld Olivier Gerbé HEC Montréal, MATI Montréal.
1 Diffusion du savoir et mobilisation des connaissances Bilan de la réunion des partenaires du Domaine Justice, Police et Sécurité à Ottawa (14 novembre.
Thematic Alignment of Static Documents with Meeting Dialogs Dalila Mekhaldi Diva Group Department of Computer Science University of Fribourg.
"Man Machine Interaction" MEMODULES as tangible shortcuts to multimedia information Omar ABOU KHALED, Rolf INGOLD, Denis LALANNE.
INDICATOR DEFINITION An indicator describes the manifestation of a process of change resulting from the pursuit of an action. Un indicateur décrit la manifestation.
Réseaux de nouvelle génération et Internet : propositions pour le futur Alistair URIE Membre du Board d’ETSI Président du groupe de réflexion d’ETSI sur.
16-Oct-00SL-BI and QAP Presented to QAWG on 23/10/2000Slide 1 Quality Assurance in SL/BI Jean-Jacques GRAS (SL-BI)
Branche Développement Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document par son destinataire.
VTHD PROJECT (Very High Broadband Network Service): French NGI initiative C. GUILLEMOT FT / BD / FTR&D / RTA
«MASTER MANAGEMENT ET INGENIERIE ECONOMIQUE» Spécialité: Projet innovation conception, option gestion de la connaissance Module: Communautés virtuelles,
KM-Master Course, 2004 Module: Communautés virtuelles, Agents intelligents C3: Collaborative Knowledge construction & knowledge sharing Thierry NABETH.
All Rights Reserved © Alcatel-Lucent 2006, ##### Kick off ECOSCELLS Project 9 November 2009 Université D’Avignon.
Quelle heure est-il? What time is it ?.
Belgian Breast Meeting Senator F. Roelants du Vivier 13th october.
Reveal-This Ou comment générer des métadonnées utiles automatiquement.
Session 3: Implementation experience: Selection of measures based on Cost-effectiveness Analysis Introduction: summary of relevant results of the questionnaire.
Transcription de la présentation:

IST Proposal MobiNews Meeting - June 10th, 2003 “Automatic and Personalised Compilation of Broadcast News with Audio Playback on Mobile Devices” François CAPMAN, PhD Research Engineer, Technologies Radio & Signal Unit francois.capman@fr.thalesgroup.com Tel : +33 (0) 1 46 13 29 63 Fax : +33 (0) 1 46 13 25 55 June 10th, 2003

MobiNews Workshop Agenda 10h00 - 10h15 Agenda, objectives of the meeting 10h15 - 10h30 Presentation of MobiNews IST proposal , current status 10h30 - 11h30 Presentation of each organisation 1 (5mn/10mn) 11h30 - 11h45 Break 11h45 - 12h15 Presentation of each organisation 2 (5mn/10mn) 12h15 - 12h45 Definition of contributions and overall structure of the project 12h45 - 13h45 Lunch 13h45 - 15h15 Detailed structure of the project, description of work-packages 15h15 - 15h45 Other topics (additional partners, ...) 15h45 - 16h00 Further steps, planning for the proposal 16h00 - 16h30 Discussion - Conclusion

IST Objectives (2nd Call) Call 2: publication 17/6 2003, closing 15/10 2003 – would have an indicative budget of around 525 MEuros (80 % pre-distributed). Objectives covered in Call 2 Advanced displays Optical, opto-electronic, & photonic functional components Open development platforms for software and services Cognitive systems Embedded systems Applications and services for the mobile user and worker (60 MEuros) Cross-media content for leisure and entertainment (55 MEuros) GRID-based Systems for solving complex problems Improving Risk management eInclusion  Specific Targeted Research Project (STREP) : 2.5 / 3.0 MEuros (Funding)

IST Objectives (2nd Call) 2.3.2.7 Cross-media content for leisure and entertainment Objective: To improve the full digital content chain, covering creation, acquisition, management and production, through effective multimedia technologies enabling multi-channel, cross-platform access to media, entertainment and leisure content in the form of film, music, games, news and alike. It will accelerate take up in B2B, B2C and C2C, currently hampered by insufficient productivity, convergence and high cost. Focus is on: – Developing technologies supporting the creation of new, compelling forms of content for interactive, creative or artistic consumption. Research should aim at advancing imaging technologies and audio-visual representation, multi-dimensional immersive environments and experience portals, as well as virtual, augmented and mixed reality technologies featuring higher levels of quality and accuracy. Device adaptivity and contextualisation, personalisation and (emotive) feedback, and ability to capture real-time, multimodal and multisensorial input will be embedded as needed. – Developing integrated content programming environments allowing to retrieve content from different sources, types and locations, and to store, compress and categorise it, with a view to realising programming appropriate to a particular audience and delivery channel, including interactive TV, e-cinema, radio, online games and music.

IST Objectives (2nd Call) 2.3.2.6 Applications and Services for the Mobile User and worker Objective: To foster the emergence of rich landscape of innovative applications and services for the mobile user and worker and to support the use and development of new work methods and collaborative work environments. These should be based on interoperable mobile, wireless technologies and the convergence of fixed and mobile communication infrastructures. Such applications and services will enable new business models, new ways of working, improved customer relations and government services in any context. The target applications and services will be capable of being seamlessly accessed and provided anywhere, anytime and in any context. Focus is on: – The integration of technologies into a wide range of innovative mobile and multimodal applications and services including workplace designs that enhance creativity and productivity. (Intelligent, adaptive and self-configuring services that deploy wearable interfaces and enable automatic context-sensitivity, user profiling and personalisation in a trusted and secure environment as well as multi-lingual and multi-cultural presentation, and multiple modes of interaction) – Addressing the major hurdles for the deployment of applications and services for the mobile user.

MobiNews Proposal Targeted Application Expected Features Automatic compilation of broadcast news (audio, text) with audio playback on mobile devices (2.5G, 3G). Access to personally selected text and audio news from a service/source provider using Multimedia Messaging Service (MMS) transmission protocol. Expected Features Fast and reliable access to synthetic newscast on a regular basis (daily, weekly, …) or upon request. Access to various identified sources within the same compilation, using scheduled programme. Automatic server-based generation of the synthetic newscast, with MMS WAP 2.0 Low-cost transmission towards mobile devices. User-defined profile for automatic download Enhanced Man Machine Interface (MMI) for queries’ submission, key-word-based search, ...

MobiNews Proposal Technical Objectives Audio data and Text data Structuring: automatic / semiautomatic segmentation (speaker tracking, scheduled programme, …) classification, discrimination (speech, music, jingles, …) transcription and information retrieval (word-spotting, key-words, …) automatic summarisation Very Low Bit Rate (VLBR) Wide-Band speech compression (with optional scalable audio stage). Text-To-Speech (TTS) synthesis for audio display of the transmitted text component (optional voice conversion, style / prosody mimicking). Software optimisation (complexity and memory) of VLBR decoder and TTS modules for embedded solutions on mobile devices (downloadable as plug-ins). Enhanced interface for mobile products (Natural Language Processing (NLP), …) Demonstrator with MMS link between a PC-based server and a handheld mobile terminal.

MobiNews Proposal

MobiNews Proposal

VLBR compression for MobiNews Targeted duration: 10 to 15 minutes in one single MMS  VLBR between 800 and 1200 bits/sec

MobiNews Work Packages Definition of Work Packages WP 1 Project management WP 2 Analysis of the needs, analysis of the market, dissemination WP 3 Broadcast radio news databases (specifications, collect, recordings) WP 4 Audio and text data structuring WP 5 Very-Low Bit Rate (VLBR) compression for synthetic newscast WP 6 Text-To-Speech (TTS) synthesis for mobile devices WP 7 MMS-based demonstrator (Server and mobile applications, MMI, …) WP 8 Evaluation methodology, field trials, analysis

MobiNews Consortium Thales Communications (France) L.I.A. (France) E.N.S.T. (France) E.S.I.E.E. (France) Elan Speech (France) Brno University of Technology (Czech Republic) Multitel (Belgium) INESC-ID (Portugal) PT Inovação, Voice services and platforms Dept (Portugal) Radio France Multimedia (France) Belga Press Agency (Belgium) Portuguese Radio/TV (Portugal) ???

Presentation of organisations  General Presentation and Potential Contributions to MobiNews 1 - Gwenaël Guilmin (Thales Communications) 2 - Bertrand Ravera : RNRT project proposal Mobi-Info 2 - Corinne Fredouille (L.I.A.) 3 - Maurice Charbit (E.N.S.T.) 4 - Geneviève Baudoin (E.S.I.E.E.) 5 - Jacques Toën (ELAN SPEECH) 6 - Petr Motlicek (BRNO University of Technology) 7 - Stéphane Deketelaere (MULTITEL) 8 - Isabel Trancoso (INESC-ID) 9 - Nuno Beires (PT INOVACAO) 10 - Caroline Roy (RADIO France MULTIMEDIA)

Contributions Thales Communications: E.N.S.T.: E.S.I.E.E.: Speech segmentation / classification Very-Low Bit Rate speech compression using parametric approaches optimisation of VLBR for a mobile plug-in E.N.S.T.: voice conversion using improved HNM synthesis, joint-optimisation of speech units for coding and synthesis E.S.I.E.E.: Very Low Bit Rate speech compression using recognition/synthesis Very Low Bit Rate speech compression using parametric approaches voice conversion BRNO University of Technology:

Contributions ELAN SPEECH: INESC-ID, and L.I.A.: MULTITEL: distributed architecture (mobile/server) for speech synthesis optimisation for a mobile plug-in voice personalization, voice conversion INESC-ID, and L.I.A.: audio data structuring MULTITEL: Man-Machine Interface, Natural Language Processing PT INOVACAO: MMS synthetic newscast packaging MMS-based demonstrator Radio France Multimedia, and Belga Press Agency (+ Portuguese TV/rad) specifications news content provider evaluation

MobiNews: WORKPLAN WP 2: Analysis of the market, … needs, dissemination: WP2.1: Analysis of the market: existing services WP2.2: Analysis of the needs: limitations of the existing services WP2.3: Dissemination: valorisation of the outcome of the project, standardisation, ...

MobiNews: WORKPLAN WP 3: Broadcast radio news databases WP3.1: Audio databases (collect, recordings, annotation, meta-data, …) WP3.2: Text databases (collect, annotation, meta-data, …) WP3.3: Service specifications (features, user acceptance, …)

MobiNews: WORKPLAN WP 4: Audio and Text data Structuring WP4.1: Low-level segmentation speech/non speech discrimination (silence, noise, pause, speech, music, jingle, …) speaker characterisation (identification, tracking, segmentation, clustering, …) WP4.2: High-level segmentation speech-to-text transcription story segmentation, topic detection, tracking and classification WP4.3: Customisation text summarisation, audio summarisation constrained summarisation (profile-driven, queries-driven, duration, multi-sources, …) meta-data information evaluation methodology (reference human-built summaries, quiz scores, …)

MobiNews: WORKPLAN WP 5: VLBR Speech / Audio compression WP5.1: Segmental-based parametric compression of synthetic newscast audio stream analysis and segmentation optimised compression of structured messages scalable solutions (bit-rate and bandwidth) WP5.2: Compression based on natural speech units indexing optimised HNM-based speech synthesis speaker-independent mode (speaker adaptation, voice conversion) joint-optimisation of units for both synthesis and coding compression of synthesis units for memory storage optimisation

MobiNews: WORKPLAN WP 6: Text-To-Speech synthesis for mobile devices WP6.1: Voice conversion / customisation WP6.2: Optimisation for mobile terminals complexity reduction memory storage distributed software architecture

MobiNews: WORKPLAN (Man Machine Interface) WP 7: User-centred design of the MMI (Man Machine Interface) WP7.1: Server-based application optimised entries for the definition of user profile, user queries, ... WP7.2: Mobile embedded application design of an efficient mobile interface with emphasis on the ease-of-use and the acceptability (= usability)

MobiNews: WORKPLAN WP 8: MMS-based demonstrator WP8.1: Server-based applications module for data structuring module for audio compression MMS packaging WP8.2: Mobile devices embedded applications MMS de-packaging optimised plug-in for text-to-speech synthesis optimised plug-in for audio decompression

MobiNews: WORKPLAN WP 9: Evaluation methodology, Field trials, Analysis WP9.1: Evaluation methodologies audio quality for speech synthesis and compression evaluation of synthetic newscast (summarisation) evaluation of MMI (queries, profile, …) WP9.2: Field trials and analysis quiz score methods … ?

Administrative Issues the project proposal will include: A1 form: proposal acronym, proposal number, proposal title, estimated duration (30 months ?), key word codes, abstract (co-ordinator) A2 form: participant submission form (for each participant) A3 form:financial information (co-ordinator) B part: non-anonymous description of scientific/technological objectives