Call for Papers – SET International Journal of Broadcast Engineering (SET IJBE) – 2nd Edition

Chair: João Vandoros, Consultant

João Vandoros, graduated in Electrical Engineering from Mackenzie Presbyterian University and a Telecommunications Specialist from the University of Campinas, has been active in the video distribution and contribution market since 2000, where he began his career at the Mackenzie Digital TV Laboratory. He worked at the TVA and Band companies between 2006 and 2013. Since 2013, as a consultant, he has been involved in various projects, with a focus on those carried out at GfK, Eurovision, and Mackenzie - where he currently collaborates on the evaluation of the physical layer for TV 3.0 through RNP.

A Larger Scope of CMAF Usage for Video Delivery

The emergence of the CMAF file format transformed OTT video delivery with its versatile, ISO BMFF-based architecture, surpassing rival formats in interoperability. Initially, CMAF was utilized for B2C packaging (end-users and players). However, with the advent of the live media ingest protocol, it is set to dominate the B2B video delivery scope. This protocol enables seamless video exchange between various video processing entities (encoders, packagers, CDN, cloud services) through synchronized data and metadata fragments from the original stream. The technology gains immense interest as it offers a modern alternative to the traditional TS format for first mile delivery. The paper covers the protocol’s key features, benefits, and implementation architecture.

Speaker: Robin Hérin, Director of Standardization at Ateme

Robin Hérin is a Director of Standardization within the CTO Office at Ateme, where he helps the Research & Innovation team develop future technologies for video processing and delivery, and drive partnership projects and standardization. Now in his ninth year at Ateme and his career, Robin has previously worked in both South & North America as a Solutions Architect before moving to NY and focusing on expanding Ateme’s footprint in the North-East, including the first Ateme POCs & deployments in both ATSC 1.0 & 3.0. Robin holds a master’s degree in mechanical engineering from the Université de Technologie de Compiègne (France).

An Overview of Audio Technologies, Immersion and Personalization Features envisaged for the TV3.0

In 2021 the Forum SBTVD accomplished the phase 2 of the TV3.0 project, comprised of a series of tests of the technologies proposed for this next generation of TVD system in Brazil. The tests were conducted by research groups of Brazilian universities. Particularly referring to the audio coding layer of the system, we carried out at the University of São Paulo 13 groups of tests, as prescribed in a public Call, and could assess the technologies capabilities and versatility in providing a series of new features for next generation of audio in the digital broadcasting system. This paper summarizes the main results of this testing and evaluation phase, and brings an overview of the stimulating new features that content producers and audience would have available to create and consume immersive and personalized services at home.

Speaker: Regis Rossi A. Faria, School of Arts, Sciences and Humanities - University of Sao Paulo

Regis Rossi Alves Faria is professor at the School of Arts, Sciences and Humanities and at the School of Communications and Arts at the University of São Paulo. He works in the interdisciplinarity between arts and sciences, in the areas of audio engineering, sound and music computing, addressing issues related to sound creation and reception using technological resources, and developing systems for sound and music. He coordinates the Laboratory of Audio and Music Technology at USP (LATM-EACH/USP), is a researcher at LabArteMídia (ECA/USP) and with the Sonology Research Center (ECA/USP). He is member of the ABNT Audio, Image, Multimedia and Hypermedia Coding Study Committee and an audio expert representing the Brazilian Association of Technical Standards at ISO-MPEG.

An Overview of Audio Technologies, Immersion and Personalization Features envisaged for the TV3.0

Co-speaker: Almir Almas, Professor at School of Communications and Arts - University of São Paulo

Associate Professor of the Department of Film, Radio and Television and Researcher of the Program of Postgraduate Studies in Media and Audiovisual Processes; General Coordinator of the Research Group LabArteMídia and Obted Observatory of ECA/University of São Paulo. PhD in Communication and Semiotics. Filmmaker/Videoartist/VJ; Artist of the Cobaia Art Collective. Member of the Board of the Brazilian Society of Television Engineering (SET). Member of the Brazilian Digital Terrestrial Television System Forum (FORUM SBTVD). Author of Televisão digital terrestre: sistemas, padrões e modelos (Digital terrestrial television: systems, standards and models), among other books and articles.

Harmonized support for immersive audio interactivity in SBTVD TV 2.5 and TV 3.0

MPEG-H Audio enables highly efficient immersive audio transmission with advanced accessibility, interaction, personalization, and adaptation features. MPEG-H Audio utilizes audio objects and metadata to allow viewers to interact with content, creating a personalized listening experience. Broadcasters can enable or disable interactivity options and set limits for viewer interactions. This paper details Fraunhofer IIS’ proposal for TV 3.0 Project on new high-level immersive audio interactivity APIs for Ginga-NCL and Ginga CC WebServices. These APIs allow broadcasters to develop and deliver multimedia applications to control such advanced audio features, using their own visual identity, graphical design and even the multimodal interaction expected to be supported in TV 3.0. The paper also discusses the implications of audio stream metadata changes, which occurs when the content includes different audio scenes. The workhighlightsthat such API proposal for TV 3.0 is harmonized with current TV 2.5 specifications and, given the increasing number of TV 2.5 receivers supporting MPEG-H audio, these APIs could also be proposed for being included in current standards.

Speaker: Marcelo F. Moreno, Associate Professor, Federal University of Juiz de Fora (UFJF) | Technical Module of the SBTVD Forum

Associate Professor at the Federal University of Juiz de Fora (UFJF), Brazil, with a Ph.D. in Computer Science from PUC-Rio and expertise in multimedia systems and computer networks. He was a Visiting Professor at the International Audio Laboratories Erlangen (FAU/Fraunhofer IIS) in 2022–2023. He co-edited ITU-T Recommendation H.761 (“NCL and Ginga-NCL”) and has contributed to several international standards, having chaired ITU-T working groups for over a decade. Since 2015, he has coordinated the Application Coding Working Group of the Brazilian Digital TV System Forum (SBTVD), where he also serves as editor of ABNT standards for TV 2.5 and TV 3.0. His research bridges academic innovation and standardization, with a focus on application-oriented broadcasting, second-screen integration, audience measurement, and privacy-aware media platforms for next-generation digital TV.

Harmonized support for immersive audio interactivity in SBTVD TV 2.5 and TV 3.0

Co-speaker: Adrian Murtaza, Senior Manager, Technology and Standards - Fraunhofer IIS

Adrian Murtaza received his M.Sc. degree in Communication Systems from the École Polytechnique Fédérale de Lausanne, Switzerland in 2012 with a thesis on “Backward Compatible Smart and Interactive Audio Transmission”. Upon graduation he joined Fraunhofer IIS, where he works as a Senior Research Engineer. Adrian Murtaza joined MPEG in 2013 and since then contributed to development of various audio technical standards in MPEG-D and MPEG-H. He serves as Fraunhofer’s Senior Standards Manager in a number of industry standards bodies, including DVB, ATSC, SBTVD, CTA and SCTE, and is the co-author of multiple specifications in those groups. More recently he focused on specification of Next Generation Audio delivery and transport in ATSC 3.0 systems and MPEG-2 Transport Stream based DVB systems, as well as on enabling of MPEG-H Audio services in different broadcast and streaming ecosystems.

Academia’s R&D progress on TV 3.0 Application Coding

SBTVD Forum’s TV 3.0 Project aims to develop the next-generation broadcasting technologies for Brazil. Television has played a crucial social and cultural role in the country, and any technological evolution could lead to significant societal changes. Currently in Phase 3, the project carries out an R&D effort on application coding support, gathering 40 researchers from Academia. This paper focuses on the methodology, progress and early achievements of the Academia R&D team on addressing the high-priority application-coding requirements for TV 3.0. Focus groups and opinion polls have been contributing to social studies related to the application-based TV experience. The group proposes architectural changes, new user interfaces and a persistent media player for enhanced interactivity and full broadcaster control over all audiovisual content. Moreover, efforts are made to evaluate audio and video codecs, including the adopted MPEG-H audio technology, to identify the need for new APIs in a harmonized fashion with current SBTVD standards. The team also focuses on extensibility support and accessibility requirements, particularly on forwarding captioning and sign language to mobile devices. Work is in progress for sensory effects, immersive content, and multimodal interaction, utilizing the adopted NCL 4.0 and Guaraná proposals to enable advanced mulsemedia applications and 360° scenes. A demonstration of TV 3.0 use case prototypes will take place during SET Expo 2023 to showcase and discuss the progress and the findings of the Academia R&D team.

Speaker: Marcelo F. Moreno, Associate Professor, Federal University of Juiz de Fora (UFJF) | Technical Module of the SBTVD Forum

Associate Professor at the Federal University of Juiz de Fora (UFJF), Brazil, with a Ph.D. in Computer Science from PUC-Rio and expertise in multimedia systems and computer networks. He was a Visiting Professor at the International Audio Laboratories Erlangen (FAU/Fraunhofer IIS) in 2022–2023. He co-edited ITU-T Recommendation H.761 (“NCL and Ginga-NCL”) and has contributed to several international standards, having chaired ITU-T working groups for over a decade. Since 2015, he has coordinated the Application Coding Working Group of the Brazilian Digital TV System Forum (SBTVD), where he also serves as editor of ABNT standards for TV 2.5 and TV 3.0. His research bridges academic innovation and standardization, with a focus on application-oriented broadcasting, second-screen integration, audience measurement, and privacy-aware media platforms for next-generation digital TV.

The Use of Artificial IntelligenceEnablingScaleAudioDescription in BrazilianTelevision: a workflow proposal

Recently, Artificial Intelligence (AI) technologies have been making their way into diverse fields of knowledge, significantly impacting many academic and business spheres. One of the applications that can benefit from AI is the inclusion of people with disabilities in audiovisual content, where the scalability of certain processes can bring new accessibility opportunities. In this work, we show what a basic workflow of an audio description for drama audiovisual content looks like, and from that, we propose a new workflow for generating audio descriptions for visually impaired people using synthetic voices created with AI models. The proposed workflow, besides allowing for the generation of audio on a larger scale compared to a traditional workflow, enables a greater coverage of the target audience by considerably reducing production time. It also allows multiple people to work on the same project without losing its sound identity, which is very important for the consumer of this type of service. With this proposal, we believe that accessibility on Brazilian television can be expanded and reach a much larger number of people.

Speaker: Luiz Kruszielski, Audio Producer - Globo

Luiz Fernando Kruszielski graduated in Sound Production (UFPR - Brazil) and have a master's and a doctorate degree in Sound and the Environment at Tokyo University of the Arts (Japan). He worked as a professional sound designer since 2003, and from 2013, he started to work at Globo TV Network as a researcher for sound technologies and later became Sound Producer, were he worked in more than 10 series and telenovelas.

Globo’s Ultimate Operational Challenge: a creative full based workflow editing in cloud

In 2022, the famous and epic soap opera “Chocolate com Pimenta” in Brazil became a good surprise for TV Globo: the post-production chain accomplished a simple, productive, and economical workflow. A cloud based, remote and collaborative editing produced the entertainment content in an innovative way. Globo, a free-to-air television network, saw in this path an excellent opportunity to thrive technologically and to offer spectators a unique experience. Globo’s objective was to provide a special edition of the soap opera through Globoplay, an online video on demand platform, and by Open TV. Having the team in charge located at Post-Production Center of Estúdios Globo, the material was edited collaboratively in HD (XDCAM codec) directly connected to cloud. The process was successfully achieved and has helped to maintain Globo into the future of technological innovations.  And besides, the business model of this unique approach was very attractive for Globo, solving the market need for a cloud-native solution, notably when the Post-Production Center deals with big files and assets routinely, enabling the increase of the content production to a next level (cost benefit). Globo interprets this initiative not as a mere technological advance, but also as a step taken toward a more concrete operational level. Through all the effortful work, Globo, its customers, and everybody involved in this project could come together to cheer and look forward for a brand-new efficient future.

 

Speaker: Priscila David, Product Owner of Post-Production projects in the Media Solutions area at Globo

Priscila David was born in Rio de Janeiro. She holds a Bachelor's degree in Telecommunications Engineering and an MBA in Strategic People Management. She is currently undertaking a specialization program in Information Technology Management. Priscila authored the poster “4K and 4K-HDR VOD in Rio’s 2016 Olympic Games,” published by IBC in 2017, and the article “Globo’s Ultimate Operational Challenge: a creative workflow editing in cloud,” published by SET in 2023. She has been working at Globo for 18 years and, for the past three years, has held the position of Specialist Product Owner for Post-Production projects in the Media Solutions area.

Globo’s Ultimate Operational Challenge: a creative full based workflow editing in cloud

Co-speaker: Ariza Bertelli, Media Solution Analyst, Globo

Ariza Bertelli was born in Minas Gerais, Brazil in 2000. She is majoring in Electrical Engineering with an emphasis on Robotics and Industrial Automation. She was a member of IEEE from 2019 to 2021. She received the award for the 3 best education project at RNR: “Talking with the hands” in 2020. She has been working at Globo for 1 year as a media solutions intern, and in June 2023, as a Media Solution Analyst.  

Tool in Python for predicting wireless network signals in an indoor environment using neural networks based on mapping and measurements in the field

This work aims to present the advances and applications of Artificial Neural Networks (ANNs) using the Python programming language. ANNs have proven to be a powerful tool in the field of artificial intelligence, imitating the functioning of the human brain to solve complex problems and perform tasks that previously required extremely elaborate algorithms.
In addition, the importance of Python as one of the most popular and accessible languages for the development of neural networks will be emphasized. Through robust libraries, it is possible to create, train and evaluate neural networks efficiently and intuitively.

Speaker: Breno Batista Nascimento Silva, Undergraduate student - Electrical Engineering faculty , Mackenzie University

Electricity has always been a passion, so much so that at the age of 15 he entered an electronics technician at Senai Guarulhos, where he decided which college he would follow. I joined the Electrical Engineering faculty in 2019, through the Mackenzie philanthropic scholarship, where there were several opportunities for personal and professional development. And it was during a class during the pandemic that he became interested in the opportunity to carry out a scientific initiation with Professor Edson Tafeli on Artificial Neural Networks (ANN's) in Python. Currently Accounting Measurement Analyst at the Câmara de Comercialização de Energia (CCEE).