Unified Multimedia Segmentation - A Comprehensive Model for URI-based Media Segment Representation

Willi, Jan; Bernstein, Abraham; Rossetto, Luca

doi:10.4230/TGDK.2.3.1

Abstract

In multimedia annotation, referencing specific segments of a document is often desired due to its richness and multimodality, but no universal representation for such references exists. This significantly hampers the usage of multimedia content in knowledge graphs, as it is modeled as one large atomic information container. Unstructured data - such as text, audio, images, and video - can commonly be decomposed into its constituent parts, as such documents rarely contain only one semantic concept. Hence, it is reasonable to assume that these advances will make it possible to decompose these previous atomic components into logical segments. To be processable by the knowledge graph stack, however, one needs to break the atomic nature of multimedia content, providing a mechanism to address media segments.
This paper proposes a Unified Segmentation Model capable of depicting arbitrary segmentations on any media document type. The work begins with a formal analysis of multimedia and segmentation, exploring segmentation operations and how to describe them. Building on this analysis, it then develops a practical scheme for expressing segmentation in Uniform Resource Identifiers (URIs). Given that this approach makes segments of multimedia content referencable, it breaks their atomic nature and makes them first-class citizens within knowledge graphs. The proposed model is implemented as a proof of concept in the MediaGraph Store, a multimedia knowledge graph storage and querying engine.

3D Systems, Inc. Stereolithography interface specification, October 1989.
Richard Arndt, Raphaël Troncy, Steffen Staab, Lynda Hardman, and Miroslav Vacura. COMM: Designing a well-founded multimedia ontology for the web. In The Semantic Web, pages 30-43. Springer, 2007. URL: https://doi.org/10.1007/978-3-540-76298-0_3.
Ana B. Benitez, Seungyup Paek, Shih-Fu Chang, Atul Puri, Qian Huang, John R. Smith, Chung-Sheng Li, Lawrence D Bergman, and Charles N. Judice. Object-based multimedia content description schemes and applications for MPEG-7. Signal Processing: Image Communication, 16(1-2):235-269, September 2000. URL: https://doi.org/10.1016/s0923-5965(00)00030-8.
Tim Berners-Lee. Linked data, July 2006. (Retrieved 2023-07-20). URL: https://www.w3.org/DesignIssues/LinkedData.
Tim Berners-Lee, Roy T. Fielding, and Larry Masinter. Uniform resource identifiers (URI): generic syntax. RFC, 2396(2396):1-40, August 1998. URL: https://doi.org/10.17487/RFC2396.
Tim Berners-Lee, Roy T. Fielding, and Larry Masinter. Uniform resource identifier (URI): generic syntax. RFC, 3986(3986):1-61, January 2005. URL: https://doi.org/10.17487/RFC3986.
Tim Berners-Lee, James Hendler, and Ora Lassila. The semantic web. Scientific American, 284(5):34-43, 2001.
Blender Foundation. Big buck bunny, 2008. (Retrieved 2023-05-24). URL: https://doi.org/10.1145/1504271.1504321.
Stephan Bloehdorn, Siegfried Handschuh, Steffen Staab, Yannis Avrithis, Yiannis Kompatsiaris, Vassilis Tzouvaras, Kosmas Petridis, Nikos Simou, and Michael G. Strintzis. Knowledge representation for semantic multimedia content analysis and reasoning. In European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology (EWIMT), November 2004.
Jon Bosak and Tim Bray. Xml and the second-generation web. Scientific American, 280(5):89-93, 1999.
Ian S. Burnett, Fernando Pereira, Rik Van de Walle, and Rob Koenen. The MPEG-21 Book. Wiley, 2006.
Nick Burris and David Bokan. Text fragments. W3C community group draft report, W3C, December 2022. URL: https://wicg.github.io/scroll-to-text-fragment/.
Jeremy Carroll and Graham Klyne. Resource description framework (RDF): Concepts and abstract syntax. W3C recommendation, W3C, February 2004. URL: https://www.w3.org/TR/2004/REC-rdf-concepts-20040210/.
Hirokazu Chiba, Ryota Yamanaka, and Shota Matsumoto. Property graph exchange format. CoRR, abs/1907.03936, 2019. https://arxiv.org/abs/1907.03936, URL: https://doi.org/10.48550/arXiv.1907.03936.
Tom Crane. Wellcome library ixif "interim" implementation, 2015. (Retrieved 2023-02-27). URL: https://gist.github.com/tomcrane/7f86ac08d3b009c8af7c.
Richard Cyganiak and Leo Sauermann. Cool URIs for the semantic web. W3C note, W3C, December 2008.
Mike Dean and Guus Schreiber. OWL web ontology language reference. W3c recommendation, W3C, February 2004. URL: https://www.w3.org/TR/2004/REC-owl-ref-20040210/.
Steven DeRose, Eve Maler, and Ron Daniel. XPointer xpointer() scheme. W3C working draft, W3C, December 2002. URL: https://www.w3.org/TR/2002/WD-xptr-xpointer-20021219/.
Davy Van Deursen, Wim Van Lancker, Erik Mannens, and Rik Van de Walle. NinSuna: A server-side w3c media fragments implementation. In 2010 IEEE International Conference on Multimedia and Expo (ICME). IEEE, July 2010. URL: https://doi.org/10.1109/icme.2010.5583192.
Davy Van Deursen, Wim Van Lancker, Wesley De Neve, Tom Paridaens, Erik Mannens, and Rik Van de Walle. NinSuna: a fully integrated platform for format-independent multimedia content adaptation and delivery using semantic web technologies. Multimedia Tools and Applications, 46(2-3):371-398, September 2009. URL: https://doi.org/10.1007/s11042-009-0354-0.
Davy Van Deursen, Raphaël Troncy, Erik Mannens, and Silvia Pfeiffer. Protocol for media fragments 1.0 resolution in HTTP. W3C working draft, W3C, December 2011. URL: https://www.w3.org/TR/2011/WD-media-frags-recipes-20111201/.
Davy Van Deursen, Raphaël Troncy, Erik Mannens, Silvia Pfeiffer, Yves Lafon, and Rik Van de Walle. Implementing the media fragments URI specification. In International Conference on World Wide Web. ACM, April 2010. URL: https://doi.org/10.1145/1772690.1772931.
Sandro Rama Fiorini, Wallas Henrique Sousa dos Santos, Rodrigo Costa Mesquita Santos, Guilherme Augusto Ferreira Lima, and Márcio Ferreira Moreno. General fragment model for information artifacts. CoRR, abs/1909.04117, 2019. https://arxiv.org/abs/1909.04117, URL: https://doi.org/10.48550/arXiv.1909.04117.
Sandro Rama Fiorini, Guilherme Ferreira Lima, and Marcio F. Moreno. Demo: Tools for information fragmentation in knowledge graphs*. CEUR Workshop Proceedings, 2980:1-5, 2021. URL: https://ceur-ws.org/Vol-2980/paper377.pdf.
Aldo Gangemi, Nicola Guarino, Claudio Masolo, Alessandro Oltramari, and Luc Schneider. Sweetening ontologies with DOLCE. In Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web, pages 166-181. Springer, 2002. URL: https://doi.org/10.1007/3-540-45810-7_18.
Roberto García and Òscar Celma. Semantic integration and retrieval of multimedia metadata. In Siegfried Handschuh, Thierry Declerck, and Marja-Riitta Koivunen, editors, Proceedings of the 5th International Workshop on Knowledge Markup and Semantic Annotation ( SemAnnot 2005 ) located at the 4rd International Semantic Web Conference ISWC 2005, 7th November 2005, Galway, Ireland, volume 185 of CEUR Workshop Proceedings. CEUR-WS.org, 2005. URL: https://ceur-ws.org/Vol-185/semAnnot05-07.pdf.
Paul Grosso, Eve Maler, Jonathan Marsh, and Norman Walsh. XPointer element() scheme. W3C recommendation, W3C, March 2003. URL: https://www.w3.org/TR/2003/REC-xptr-element-20030325/.
Paul Grosso, Eve Maler, Jonathan Marsh, and Norman Walsh. XPointer framework. W3C recommendation, W3C, March 2003. URL: https://www.w3.org/TR/2003/REC-xptr-framework-20030325/.
Ramanathan Guha and Dan Brickley. RDF vocabulary description language 1.0: RDF schema. W3C recommendation, W3C, February 2004. URL: https://www.w3.org/TR/2004/REC-rdf-schema-20040210/.
Michael Hausenblas, Raphaël Troncy, Tobias Bürger, and Yves Raimond. Interlinking multimedia: How to apply linked data principles to multimedia fragments. In Christian Bizer, Tom Heath, Tim Berners-Lee, and Kingsley Idehen, editors, Proceedings of the WWW2009 Workshop on Linked Data on the Web, LDOW 2009, Madrid, Spain, April 20, 2009, volume 538 of CEUR Workshop Proceedings. CEUR-WS.org, 2009. URL: https://ceur-ws.org/Vol-538/ldow2009_paper17.pdf.
Michael Hausenblas, Erik Wilde, and Jeni Tennison. URI fragment identifiers for the text/csv media type. RFC, 7111(7111):1-13, January 2014. URL: https://doi.org/10.17487/RFC7111.
Ian Hickson, Robin Berjon, Steve Faulkner, Travis Leithead, Erika Doyle Navara, Theresa O'Connor, and Silvia Pfeiffer. HTML5. W3C recommendation, W3C, October 2014. URL: https://www.w3.org/TR/2014/REC-html5-20141028/.
Aidan Hogan, Eva Blomqvist, Michael Cochez, Claudia D’amato, Gerard De Melo, Claudio Gutierrez, Sabrina Kirrane, José Emilio Labra Gayo, Roberto Navigli, Sebastian Neumaier, Axel-Cyrille Ngonga Ngomo, Axel Polleres, Sabbir M. Rashid, Anisa Rula, Lukas Schmelzeisen, Juan Sequeda, Steffen Staab, and Antoine Zimmermann. Knowledge graphs. ACM Comput. Surv., 54(4), July 2021. URL: https://doi.org/10.1145/3447772.
Philipp Hoschka. Synchronized multimedia integration language (SMIL) 1.0 specification. W3C recommendation, W3C, June 1998. URL: https://www.w3.org/TR/1998/REC-smil-19980615/.
Jane Hunter. Adding multimedia to the semantic web: Building an MPEG-7 ontology. In Isabel F. Cruz, Stefan Decker, Jérôme Euzenat, and Deborah L. McGuinness, editors, Proceedings of SWWS'01, The first Semantic Web Working Symposium, Stanford University, California, USA, July 30 - August 1, 2001, pages 261-283. CEUR-WS.org, 2001. URL: http://www.semanticweb.org/SWWS/program/full/paper59.pdf, URL: https://doi.org/10.1002/0470012617.ch3.
Antoine Isaac and Raphaël Troncy. Designing and Using an Audio-Visual Description Core Ontology. Workshop on Core Ontologies in Ontology Engineering, 118, 2004.
ISO/IEC 15938. Multimedia Content Description Interface (MPEG-7). Standard, Moving Picture Experts Group, 2001.
ISO/IEC 21000. Multimedia Framework (MPEG-21). Standard, Moving Picture Experts Group, 2002.
ISO/IEC 21000-17. Multimedia framework (mpeg-21) - part 17: Fragment identification of mpeg resources. Standard, Moving Picture Experts Group, 2006.
Dean Jackson. Scalable vector graphics (SVG): the world wide web consortium’s recommendation for high quality web graphics. In Tom Appolloni, editor, Proceedings of the 29th International Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 2002, San Antonio, Texas, USA, July 21-26, 2002, Abstracts and Applications, page 319. ACM, January 2002. URL: https://doi.org/10.1145/1242073.1242327.
Rob Koenen and Fernando Pereira. MPEG-7: The Generic Multimedia Content Description Standard, Part 1. IEEE Multimedia, 9(2):78-87, 2002. URL: https://doi.org/10.1109/93.998074.
Thomas Kurz, Georg Güntner, Violeta Damjanovic, Sebastian Schaffert, and Manuel Fernandez. Semantic enhancement for media asset management systems - integrating the red bull content pool in the web of data. Multim. Tools Appl., 70(2):949-975, August 2014. URL: https://doi.org/10.1007/s11042-012-1197-7.
Thomas Kurz and Harald Kosch. Lifting media fragment uris to the next level. In European Semantic Web Conference ESWC, International Workshop on Linked Media, volume 1615. CEUR-WS.org, 2016. URL: https://ceur-ws.org/Vol-1615/limePaper3.pdf.
Thomas Kurz, Sebastian Schaffert, Kai Schlegel, Florian Stegmaier, and Harald Kosch. SPARQL-MM - extending SPARQL to media fragments. In Lecture Notes in Computer Science, pages 236-240. Springer International Publishing, 2014. URL: https://doi.org/10.1007/978-3-319-11955-7_26.
Carl Lagoze and Jane Hunter. The ABC ontology and model. In Keizo Oyama and Hironobu Gotoda, editors, 2001 International Conference on Dublin Core and Metadata Applications, pages 160-176. Dublin Core Metadata Initiative, 2001. URL: http://dcpapers.dublincore.org/pubs/article/view/655.
David H. Laidlaw, W. Benjamin Trumbore, and John F. Hughes. Constructive solid geometry for polyhedral objects. In 1986 Conference on Computer Graphics and Interactive Techniques (SIGGRAPH). ACM, 1986. URL: https://doi.org/10.1145/15922.15904.
Wim Van Lancker, Davy Van Deursen, Erik Mannens, and Rik Van de Walle. HTTP adaptive streaming with media fragment URIs. In 2011 IEEE International Conference on Multimedia and Expo (ICME). IEEE, July 2011. URL: https://doi.org/10.1109/icme.2011.6012149.
Wim Van Lancker, Davy Van Deursen, Erik Mannens, and Rik Van de Walle. Implementation strategies for efficient media fragment retrieval. Multimedia Tools and Applications, 57(2):243-267, March 2011. URL: https://doi.org/10.1007/s11042-011-0785-2.
Jim Ley. Thoughts on an image description vocabulary, August 2003. (Retrieved 2023-06-05). URL: http://jibbering.com/discussion/image.html.
Library of Congress. Streaming services: An audio and video (a/v) delivery api for the library of congress. (Retrieved 2023-03-01). URL: https://www.loc.gov/apis/micro-services/streaming-services/.
Gene Loh. Linked data and IIIF: Integrating taxonomy management with image annotation. In 2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings (PNC). IEEE, November 2017. URL: https://doi.org/10.23919/pnc.2017.8203521.
Erik Mannens, Davy Van Deursen, Raphaël Troncy, Silvia Pfeiffer, Conrad Parker, Yves Lafon, Jack Jansen, Michael Hausenblas, and Rik Van De Walle. A URI-based approach for addressing fragments of media resources on the web. Multimedia Tools and Applications, 59(2):691-715, 2012. URL: https://doi.org/10.1007/s11042-010-0683-z.
Larry Masinter. The "data" URL scheme. RFC, 2397(2397):1-5, August 1998. URL: https://doi.org/10.17487/RFC2397.
James D. Murray and William vanRyper. Encyclopedia of graphics file formats, 2nd Edition. Springer, 2 edition, May 1996. URL: http://oreilly.com/catalog/9781565921610.
Klinsukon Nimkanjana and Suntorn Witosurapot. A simple approach for enabling sparql-based temporal queries for media fragments. In Kamal Zuhairi Zamli, Vitaliy Mezhuyev, and Luigi Benedicenti, editors, Proceedings of the 7th International Conference on Software and Computer Applications, ICSCA 2018, Kuantan, Malaysia, February 08-10, 2018, pages 212-216. ACM, 2018. URL: https://doi.org/10.1145/3185089.3185126.
Lyndon J. B. Nixon, Matthias Bauer, Cristian Bara, Thomas Kurz, and John Pereira. Connectme: Semantic tools for enriching online video with web content. In 2012 I-SEMANTICS Posters & Demonstrations Track, volume 932 of CEUR Workshop Proceedings, pages 55-62. CEUR-WS.org, 2012. URL: https://ceur-ws.org/Vol-932/paper11.pdf.
Silvia Pfeiffer, Conrad Parker, and Claudia Schremmer. Annodex: a simple architecture to enable hyperlinking, search & retrieval of time-continuous data on the web. In Nicu Sebe, Michael S. Lew, and Chabane Djeraba, editors, Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, MIR 2003, November 7, 2003, Berkeley, CA, USA, pages 87-93. ACM, 2003. URL: https://doi.org/10.1145/973264.973279.
Silvia Pfeiffer and Craig Parker. Specifying time intervals in URI queries and fragments of time-based Web resources. Internet-Draft draft-pfeiffer-temporal-fragments-03, Internet Engineering Task Force, 2005. URL: https://datatracker.ietf.org/doc/draft-pfeiffer-temporal-fragments/03/.
Julien A. Raemy, Peter Fornaro, and Lukas Rosenthaler. Implementing a video framework based on IIIF: A customized approach from long-term preservation video formats to conversion on demand. Archiving Conference, 14(1):68-73, May 2017. URL: https://doi.org/10.2352/issn.2168-3204.2017.1.0.68.
A. A. G. Requicha and H. B. Voelcker. Constructive solid geometry. Technical report, production automation project tm-25, University of Rochester, November 1977.
Lukas Rosenthaler, Peter Fornaro, Andrea Bianco, and Benjamin Geer. Simple image presentation interface (SIPI) – an IIIF-based image-server. Archiving Conference, 14:28-33, May 2017. URL: https://doi.org/10.2352/issn.2168-3204.2017.1.0.28.
Luca Rossetto. lucaro/Unified-Media-Segmentation-Test-Suite. Audiovisual (visited on 2024-12-12). URL: https://github.com/lucaro/Unified-Media-Segmentation-Test-Suite
full metadata available at: https://doi.org/10.4230/artifacts.22623
Luca Rossetto and Jan Willi. lucaro/MeGraS. Software, (visited on 2024-12-12). URL: https://github.com/lucaro/MeGraS
archived version
full metadata available at: https://doi.org/10.4230/artifacts.22622
Carsten Saathoff and Ansgar Scherp. M3O: The multimedia metadata ontology. In 2009 Workshop on Semantic Multimedia Database Technologies, 10th International Workshop of the Multimedia Metadata Community (SeMuDaTe), volume 539, pages 4-15. CEUR-WS.org, 2009.
Philippe Salembier and John R. Smith. Mpeg-7 multimedia description schemes. IEEE Transactions on Circuits and Systems for Video Technology, 11(6):748-759, 2001. URL: https://doi.org/10.1109/76.927435.
Elena Sánchez-Nielsen, Francisco Chávez-Gutiérrez, Javier Lorenzo-Navarro, and Modesto Castrillón-Santana. A multimedia system to produce and deliver video fragments on demand on parliamentary websites. Multimedia Tools and Applications, 76(5):6281-6307, February 2016. URL: https://doi.org/10.1007/s11042-016-3306-5.
Felix Sasaki, Tobias Bürger, Wonsuk Lee, Florian Stegmaier, Joakim Söderberg, Jean-Pierre EVAIN, Werner Bailer, Thierry Michel, John Strassner, Véronique Malaisé, and Pierre-Antoine Champin. Ontology for media resources 1.0. W3C recommendation, W3C, February 2012. URL: https://www.w3.org/TR/2012/REC-mediaont-10-20120209/.
Andy Seaborne and Eric Prud'hommeaux. SPARQL query language for RDF. W3C recommendation, W3C, January 2008. URL: https://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/.
Ching-Kuang Shene. B-spline curves: Closed curves, 2011. (Retrieved 2023-05-24). URL: https://pages.mtu.edu/~shene/COURSES/cs3621/NOTES/spline/B-spline/bspline-curve-closed.html.
Tomo Sjekavica, Ines Obradović, and Gordan Gledec. Ontologies for Multimedia Annotation: An overview. In 2013 European Conference of Computer Science (ECCS), pages 123-129, 2013.
Stuart Snydman, Robert Sanderson, and Tom Cramer. The international image interoperability framework (iiif): A community & technology approach for web-based images. Archiving Conference, 12(1):16-16, 2015. URL: https://doi.org/10.2352/issn.2168-3204.2015.12.1.art00005.
Peter Sorotokin, Garth Conboy, Brady Duga, John Rivlin, Don Beaver, Kevin Ballard, Alastair Fettes, and Daniel Weck. Epub canonical fragment identifiers 1.1. Recommended specification, IDPF, 2017. URL: https://idpf.org/epub/linking/cfi/.
Raphaël Troncy, Òscar Celma, Suzanne Little, Roberto García, and Chrisa Tsinaraki. MPEG-7 based multimedia ontologies: Interoperability support or interoperability issue. In International Workshop on Multimedia Annotation and Retrieval enabled by Shared Ontologies (MARESO), pages 2-15, 2007.
Raphaël Troncy, Lynda Hardman, and Jacco Van Ossenbruggen. Identifying Spatial and Temporal Media Fragments on the Web. In W3C Video on the Web Workshop, pages 4-9, 2007.
Raphaël Troncy, Erik Mannens, Silvia Pfeiffer, and Davy Van Deursen. Media fragments URI 1.0 (basic). W3C recommendation, W3C, September 2012. URL: https://www.w3.org/TR/2012/REC-media-frags-20120925/.
Chrisa Tsinaraki, Panagiotis Polydoros, and Stavros Christodoulakis. Interoperability support for ontology-based video retrieval applications. In Lecture Notes in Computer Science, pages 582-591. Springer, Berlin, Heidelberg, 2004. URL: https://doi.org/10.1007/978-3-540-27814-6_68.
Greg Turk. The ply polygon file format, 1994. (Retrieved 2023-05-28). URL: http://gamma.cs.unc.edu/POWERPLANT/papers/ply.pdf.
Erik Wilde and Marcel Baschnagel. Fragment identifiers for plain text files. In ACM Conference on Hypertext and Hypermedia. ACM, September 2005. URL: https://doi.org/10.1145/1083356.1083398.
Erik Wilde and Martin J. Dürst. URI fragment identifiers for the text/plain media type. RFC, 5147(5147):1-17, April 2008. URL: https://doi.org/10.17487/RFC5147.
Ting Wu, Zhuoming Xu, Lixian Ni, Yuanhang Zhuang, Junhua Wang, and Qin Yan. Towards a media fragment URI aware user agent. In 2014 Web Information System and Application Conference (WISA), pages 37-42. IEEE, September 2014. URL: https://doi.org/10.1109/wisa.2014.15.

Unified Multimedia Segmentation - A Comprehensive Model for URI-based Media Segment Representation

Authors Jan Willi , Abraham Bernstein , Luca Rossetto

File

Document Identifiers

Author Details

Cite As Get BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message

Unified Multimedia Segmentation - A Comprehensive Model for URI-based Media Segment Representation

Authors Jan Willi , Abraham Bernstein , Luca Rossetto

File

Document Identifiers

Author Details

Funding

Cite As Get BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

Supplementary Materials

References

Thanks for your feedback!

Could not send message