{"id":134207,"date":"2024-12-13T11:21:27","date_gmt":"2024-12-13T11:21:27","guid":{"rendered":"https:\/\/showbizztoday.com\/index.php\/2024\/12\/13\/podtile-facilitating-podcast-episode-browsing-with-auto-generated-chapters\/"},"modified":"2024-12-13T11:21:27","modified_gmt":"2024-12-13T11:21:27","slug":"podtile-facilitating-podcast-episode-browsing-with-auto-generated-chapters","status":"publish","type":"post","link":"https:\/\/showbizztoday.com\/index.php\/2024\/12\/13\/podtile-facilitating-podcast-episode-browsing-with-auto-generated-chapters\/","title":{"rendered":"PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div class=\"published-date\">\n<div class=\"icon-holder\">\n                                                <img decoding=\"async\" src=\"https:\/\/research.atspotify.com\/wp-content\/themes\/spotify\/images\/icon.png\" alt=\"\"\/>\n                                            <\/div>\n<p><span class=\"date\">October 24, 2024<\/span> Published by Azin Ghazimatin and Ekaterina Garmash, Gustavo Penha, Kristen Sheets, Martin Achenbach, Oguz Semerci, Remi Galvez, Divya Narayanan, Ofeliya Kalaydzhyan, Ann Clifton, Paul N. Bennett, Claudia Hauff, Mounia Lalmas<\/p>\n<\/p><\/div>\n<div class=\"img-holder\">\n                                            <img src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/RS066-PODTILE_-Facilitating-Podcast-Episode-Browsing-with-Auto-generated-Chapters-WITHOUT-LOGO.png\" class=\"attachment-post-thumbnail size-post-thumbnail wp-post-image\" alt=\"RS066 PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters\" decoding=\"async\" fetchpriority=\"high\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/RS066-PODTILE_-Facilitating-Podcast-Episode-Browsing-with-Auto-generated-Chapters-WITHOUT-LOGO.png 1200w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/RS066-PODTILE_-Facilitating-Podcast-Episode-Browsing-with-Auto-generated-Chapters-WITHOUT-LOGO-250x131.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/RS066-PODTILE_-Facilitating-Podcast-Episode-Browsing-with-Auto-generated-Chapters-WITHOUT-LOGO-700x368.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/RS066-PODTILE_-Facilitating-Podcast-Episode-Browsing-with-Auto-generated-Chapters-WITHOUT-LOGO-768x403.png 768w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/RS066-PODTILE_-Facilitating-Podcast-Episode-Browsing-with-Auto-generated-Chapters-WITHOUT-LOGO-120x63.png 120w\" sizes=\"(max-width: 1200px) 100vw, 1200px\"\/><figcaption\/>\n                                        <\/div>\n<p>Listeners usually discover it difficult to navigate lengthy podcast episodes on account of their lengthy length. This makes it troublesome for them to know the general construction and find particular sections of curiosity. A useful gizmo to handle this challenge is podcast chapterization, the place the content material is split into segments labeled with titles and timestamps. Although podcast creators can present chapters with their episodes, that is not often accomplished.\u00a0<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"756\" src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image3-3-700x756.png\" alt=\"\" class=\"wp-image-5876\" style=\"width:528px;height:auto\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image3-3-700x756.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image3-3-250x270.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image3-3-120x130.png 120w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image3-3.png 750w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\"\/><\/figure>\n<\/div>\n<p class=\"has-text-align-center\"><em>Examples of podcast chapters.<\/em><\/p>\n<p>To prolong the advantages of chapterization to extra podcasts in our catalog, we&#8217;ve got developed a machine learning-based chapterization mannequin. This mannequin is skilled in a supervised means utilizing creator-provided chapterizations of their podcast episodes. The podcast area presents distinctive analysis challenges in comparison with earlier work on chapterization and semantic segmentation. Podcasts are sometimes conversational and lack a particular construction, with audio system generally diverging from the principle subject for brief durations. Additionally, episode transcripts are usually lengthy, requiring environment friendly processing strategies. In this submit, we describe our answer for automating podcast chapterization that addresses these challenges.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"326\" src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image6-700x326.png\" alt=\"\" class=\"wp-image-5877\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image6-700x326.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image6-250x116.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image6-768x358.png 768w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image6-120x56.png 120w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image6.png 1277w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\"\/><\/figure>\n<\/div>\n<p class=\"has-text-align-center\"><em>Comparison of typical podcast construction (left) and typical Wikipedia construction (proper).<\/em><\/p>\n<h2 class=\"wp-block-heading\">PODTILE<\/h2>\n<p>We make use of giant language fashions and develop PODTILE, an LLM-based mannequin that concurrently generates chapter boundaries and titles for the enter transcript.\u00a0\u00a0<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"283\" src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5-700x283.png\" alt=\"\" class=\"wp-image-5878\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5-700x283.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5-250x101.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5-768x311.png 768w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5-1536x621.png 1536w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5-120x49.png 120w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image5.png 1602w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\"\/><\/figure>\n<\/div>\n<p class=\"has-text-align-center\"><em>Podcast chapterization with PODTILE.<\/em><\/p>\n<p>We use LongT5 with a 16k enter token restrict as our base LLM. This LLM is beneficial for capturing long-distance dependencies in podcast episodes. For transcripts longer than the 16K enter restrict, we break up textual content into smaller chunks and course of them independently, which might result in a lack of world context important to understanding the complete construction.<\/p>\n<p>Predicting chapters based mostly on native chunks can lead to a lack of world context important for correct chapter prediction. To deal with this limitation, we enrich every chunk with <strong>world contextual <\/strong>cues to assist protect total coherence. Specifically, we leverage <strong>static context<\/strong>, together with metadata like episode titles and descriptions, and <strong>dynamic context<\/strong>, which maintains a file of beforehand generated chapter titles. This dynamic context acts as a working reminiscence to tell the era of future chapters. We embrace each sorts of data as textual content into the enter to the PODTILE mannequin.\u00a0<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"400\" src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1-700x400.png\" alt=\"\" class=\"wp-image-5879\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1-700x400.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1-250x143.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1-768x439.png 768w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1-1536x877.png 1536w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1-120x69.png 120w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image2-1.png 1595w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\"\/><\/figure>\n<\/div>\n<p class=\"has-text-align-center\"><em>PODTILE\u2019s enter and output format. The static context comprises the episode\u2019s title and outline, and the dynamic context consists of the sooner chapter titles.<\/em>\u00a0<\/p>\n<h2 class=\"wp-block-heading\">Evaluation and Findings<\/h2>\n<p>We evaluated PODTILE on an inner podcast dataset utilizing title and boundary accuracy metrics generally utilized in textual content segmentation duties. Specifically, PODTILE demonstrated an 11% enchancment in title accuracy in comparison with the strongest baseline. Moreover, we discovered that for very lengthy podcasts, which required chunking as a result of mannequin\u2019s enter size restrictions, the metric enhancements have been almost double these of shorter podcasts that didn&#8217;t want chunking. This discovering underscores the effectiveness of our modeling in capturing world (static and dynamic) context.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"143\" src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image4-700x143.png\" alt=\"\" class=\"wp-image-5880\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image4-700x143.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image4-250x51.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image4-768x157.png 768w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image4-120x25.png 120w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image4.png 1330w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\"\/><\/figure>\n<\/div>\n<p class=\"has-text-align-center\"><em>Comparison of the quantity of enchancment in lengthy transcripts vs. shorter ones that don&#8217;t want chunking<\/em>.<\/p>\n<p>We additionally carried out a qualitative comparability of chapter titles generated by PODTILE with these from the baseline. We discovered that, by leveraging static and dynamic context, PODTILE\u2019s titles are extra informative.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"230\" src=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image1-1-700x230.png\" alt=\"\" class=\"wp-image-5881\" srcset=\"https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image1-1-700x230.png 700w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image1-1-250x82.png 250w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image1-1-768x253.png 768w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image1-1-120x39.png 120w, https:\/\/storage.googleapis.com\/research-production\/1\/2024\/10\/image1-1.png 1298w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\"\/><\/figure>\n<\/div>\n<p class=\"has-text-align-center\"><em>Comparison of chapter titles generated by PODTILE in opposition to these from the baseline. PODTILE\u2019s chapter titles are extra informative.<\/em><\/p>\n<p>In April 2024, we started a restricted roll-out of our chapterization mannequin.\u00a0<\/p>\n<p>We anticipated that by broadening the supply of chapters, high-quality auto-generated chapters will result in a rise in engagement. In truth, we noticed an 88.12% improve in chapter-initiated performs within the first month of the roll-out.<\/p>\n<p>We additionally carried out an experiment to evaluate the influence of indexing chapter titles on search effectiveness. Using the TREC podcast dataset, designed for brief section retrieval and summarization, and using BM25 because the sparse retrieval methodology, we in contrast the efficiency of indexing episode descriptions alone in opposition to descriptions enriched with chapter titles. The outcomes confirmed a 24% improve in R@50 when chapter titles have been included within the episode description, demonstrating that chapter titles successfully summarize transcripts, which then enhances retrieval effectiveness.<\/p>\n<h2 class=\"wp-block-heading\">Conclusions<\/h2>\n<p>We launched PODTILE, an answer for supervised podcast chapterization that successfully fashions the worldwide context of the episode. PODTILE addresses the challenges of maximum size, long-distance dependencies, and low structuredness. PODTILE outperforms state-of-the-art baselines in offline analysis, with notably notable enchancment for terribly lengthy podcasts. The deployed answer has considerably elevated our catalog protection, and evaluation of consumer interplay knowledge highlighted its worth for much less reputation reveals. Additionally, we evaluated PODTILE\u2019s usefulness in different downstream duties: in offline analysis, we confirmed that including chapters to episode descriptions will increase episode search high quality.<\/p>\n<p>For extra data, please discuss with our paper:<br \/><a href=\"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3627673.3680081\" target=\"_blank\" rel=\"noopener\">PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters<\/a>\u00a0<br \/>Azin Ghazimatin and Ekaterina Garmash, Gustavo Penha, Kristen Sheets, Martin Achenbach, Oguz Semerci, Remi Galvez, Divya Narayanan, Ofeliya Kalaydzhyan, Ann Clifton, Paul N. Bennett, Claudia Hauff, Mounia Lalmas<br \/>CIKM 2024<\/p>\n<\/p><\/div>\n<p>[ad_2]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] October 24, 2024 Published by Azin Ghazimatin and Ekaterina Garmash, Gustavo Penha, Kristen Sheets, Martin Achenbach, Oguz Semerci, Remi Galvez, Divya Narayanan, Ofeliya Kalaydzhyan, Ann Clifton, Paul N. Bennett, Claudia Hauff, Mounia Lalmas Listeners usually discover it difficult to navigate lengthy podcast episodes on account of their lengthy length. This makes it troublesome for [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":134209,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[38],"tags":[6770,6769,6771,1568,6768,1202,6767],"class_list":{"0":"post-134207","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-spotify","9":"tag-browsing","10":"tag-chapters","11":"tag-episode","12":"tag-facilitating","13":"tag-podcast","14":"tag-podtile"},"_links":{"self":[{"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/posts\/134207","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/comments?post=134207"}],"version-history":[{"count":0,"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/posts\/134207\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/media\/134209"}],"wp:attachment":[{"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/media?parent=134207"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/categories?post=134207"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/showbizztoday.com\/index.php\/wp-json\/wp\/v2\/tags?post=134207"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}