 




<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Buzz’s Blog: On Web 3.0 and the Semantic Web &#187; continuous data</title>
	<atom:link href="http://itknowledgeexchange.techtarget.com/semantic-web/tag/continuous-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://itknowledgeexchange.techtarget.com/semantic-web</link>
	<description>Defining the necessary skills for future software professionals</description>
	<lastBuildDate>Sun, 16 Dec 2012 04:42:23 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>The Challenge of Complex Media in a Relational World, Part 2</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-2/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-2/#comments</comments>
		<pubDate>Tue, 20 Apr 2010 19:13:52 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[continuous data]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[namespaces]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-2/</guid>
		<description><![CDATA[In the previous posting of this blog, we looked at SQL-based relational databases and why they are not well suited to managing advanced forms of media, like images, language, video, and sound. Searching by semantics. Here, we look closely at one specific issue related to managing complex media: How to categorize and search advanced forms [...]]]></description>
				<content:encoded><![CDATA[<p>In the <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-1/">previous posting</a> of this blog, we looked at SQL-based relational databases and why they are not well suited to managing advanced forms of media, like images, language, video, and sound.  </p>
<p><strong>Searching by semantics. </strong></p>
<p>Here, we look closely at one specific issue related to managing complex media: How to categorize and search advanced forms of media by their “meaning” or “semantics”.  This is extraordinarily difficult, and in fact, in general, it is impossible.  This is why we usually rely on relatively low-level heuristics and can only simulate search-by-semantics in simplistic ways.</p>
<p>Consider a library of soundless video clips.  Let’s assume there are many thousands of them, and they vary in length from seconds to hours.  First of all, the only clips we can afford to download and actually view in real time are the ones that are only seconds or minutes in length, and we can do this only if we are somehow able to limit the search space to a small handful of candidates.  Keep in mind that a video can consist of twenty to forty images per second.</p>
<p>So what do we do?</p>
<p><strong>Searching previews.</strong></p>
<p>We could search tiny samples of our video clips, perhaps taken from the beginning, the middle, and the end of each clip, but this doesn’t actually well, either.  We need something that can scale, that is automated.</p>
<p><strong>Searching tags.  </strong></p>
<p>The dominant technique is to extract information concerning low level attributes of the video clips (such as their format and pixel count) automatically, and then have experts add more tagging information by using  widely adopted, formal <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-1/">namespaces</a>.  We might use a geography namespace to mark clips as having rivers and mountains in them.  </p>
<p>These two forms of tagging information might be encoded together using the very popular <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-dublin-core-and-the-metadata-object-description-schema-a-look-at-namespaces/">MPEG-7</a> language.  This creates a very indirect way of searching video clips.  We don’t actually search them.  We search the hierarchically constructed MPEG-7 tag sets that describe the videos.  This at least allows us to use SQL in a reasonably straightforward way to do the searching. </p>
<p><strong>Searching for specific images.  </strong></p>
<p>There is very good technology for processing images for fixed pixel-based subcomponents like individual faces.  We can also search for video clips that have any faces at all in them.  </p>
<p>In general, it’s easier to search for things made by people because they tend to be more angular and regular in shape.  These include specific buildings and types of aircraft.  </p>
<p><strong>Searching for colors and shapes.</strong></p>
<p>We can also search for more abstract subcomponents of images, like polygons, circles, and the like.  Despite the fact that video images are pixel-based (or “raster”), there is good technology for isolating the lines that form the boundaries of subcomponents. </p>
<p>And we can look for colors and compare the relative location and dominance of various colors, like images where 63% of them are a particular shade of orange.  </p>
<p><strong>Searching for change over time.<br />
</strong><br />
We can also search for pattern changes in the series of images that make up a video clip.  </p>
<p>But none of this has much to do with the real meaning or semantics of images and the video clips they form.  Taking this next step is huge challenge.  </p>
<p><strong>Semantics.</strong></p>
<p>How can look for a setting sun or a ball moving across a tennis court, without knowing the details of the sunset or the particular tennis court in advance?  </p>
<p>We can use the colors and shapes approach to look for a big orange ball descending below a possibly-jagged horizontal line.  We could look for a small, white or yellow spherical object move across a big green rectangle.</p>
<p>One way to raise the bar a bit is to use domain-specific knowledge about the images being processed.  It’s a whole lot easier to spot that tennis court if we know that’s what we’re looking for.  Then we can fill our searching software with lots of detailed information about the various sorts of tennis courts.  We can also more easily isolate the tennis court in a larger image if we know it’s there somewhere.  This gives us an extra edge, so we can perhaps find the court, even if it turns out to be brown and not green, or if the surrounding terrain is almost the same color as the court.</p>
<p>We of course never get away from searching by heuristics that only simulate the process of determining the true meaning of a series of images.  We can never truly search by semantics.</p>
<p><strong>But we can do something else: we can get humans into the loop and train our software to do a better job.  We’ll look at this next.</strong></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Challenge of Complex Media in a Relational World, Part 1</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-1/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-1/#comments</comments>
		<pubDate>Tue, 13 Apr 2010 17:57:19 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[blob data]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[PostgreSQL]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[SQL Server]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-1/</guid>
		<description><![CDATA[Relational databases: the dominant technology. Relational database management systems, such as MySQL, Oracle, MS SQL Server, DB2, and Postgresql, support the relational model. A database is broken up into tables, and each table consists of rows. Each row is a series of values. A row in a table called Insured Drivers in a motor vehicle [...]]]></description>
				<content:encoded><![CDATA[<p><strong>Relational databases: the dominant technology.</strong></p>
<p>Relational database management systems, such as MySQL, Oracle, MS SQL Server, DB2, and Postgresql, support the relational model.  A database is broken up into tables, and each table consists of rows.  Each row is a series of values.  A row in a table called Insured Drivers in a motor vehicle database might consist of:</p>
<p>Fred, 2010 Toyota Prius, State Farm Insurance, 1112233444.</p>
<p>1112233444 might be a unique identifier that the government assigns to each driver.  This would be the “primary key” for the table Insured Drivers.  The point is that human names are not at all unique, and so in relational databases, we introduce artificial keys in order to disambiguate queries.  We still need the value Fred in the row because we want to know how to address him with a letter or email.</p>
<p><strong>Problems with relational databases.</strong></p>
<p>There are a few critical points to note with this approach.  First, such a simple way of representing data allows the database to quickly deliver large sets of rows from this table to the memory of a computer, so that they can be effectively searched in bulk.  We might want to know the names of all people who drive a Toyota Prius and are insured by State Farm, for example.</p>
<p>Another thing is that we might like to be able to put more complex items in a row.  We might want to have another value in a row, one that gives a driver’s address.  But an address has a few parts to it, and is not itself a simple value like a name or a car model or the name of an insurance company.  </p>
<p>It is important to also note, however, that relational databases do indeed support the creation of more complex values, such as an address.  But the more complex values we put in rows in tables, the harder it is to read in a large number of rows at once.  </p>
<p>In fact, we could create a value that represents a very complex object, one that refers to rows in other tables.  For example, we might want to replace the value Fred with a reference to a row in another table called Licensed Drivers, because there is a lot we might want to know about Fred, other than just his name.  But then it would become very difficult to read in lots of rows of a single table quickly.  </p>
<p>It might be that if we follow a link to another table that describes drivers, these rows might themselves have links in them, thus allowing a value in a row to actually consist of an object, like we would in Java or C++.  And in general, these links between tables could be chained together, and extend arbitrarily far.  Do we chase all of these linked references down for every row of Insured Drivers, or do we not follow any of these links so we can read in a large number of rows?  Then we would worry later about getting more information on each driver.</p>
<p>Importantly, relational databases are still very much the dominant database technology in use in businesses and other organizations, as well as on the Web.  We need to keep in mind that we have already aggressively extended them by supporting values that have internal structure (like addresses) and with the ability to create complex objects (like drivers).  How far do we go in extending them?  </p>
<p><strong>Where we stand today.</strong></p>
<p>Indeed, the extensions we have already made to relational databases have created a serious optimization problem.</p>
<p>But it’s worse than that.  Here’s something else to consider.  Relational databases were born into a world where flat business data was pretty much the only game in town.  However, relational databases are being asked to manage far more sophisticated forms of data, like photos and video clips and voice tracks.  There are a couple of problems that crop up.  First, a row with a video clip as a field could be huge.  We might only be able to read in a single row at a time and this could make searching an entire table intractable.  Worse, how do we even search for rows that contain certain pieces of video?  How can we search for all video clips that show Fred getting into a car accident?</p>
<p><strong>Where to go from here.</strong></p>
<p>In previous postings of this blog we have looked at <a href="http://itknowledgeexchange.techtarget.com/semantic-web/multimedia-what-is-it-why-do-we-care/">media databases</a>, and in particular, at techniques that can be used to <a href="http://itknowledgeexchange.techtarget.com/semantic-web/tag/dublin-core/">tag</a> complex forms of blob and continuous media (like photos and video clips).  What’s important to note, though, is that there is a major dilemma right now in the world of database software.  Can we continue to shoehorn more and more complex forms of data into relational databases, or do we need to throw in the towel and start over?</p>
<p><strong>More on this next time&#8230;</strong></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/the-challenge-of-complex-media-in-a-relational-world-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Parallel Worlds of Media Databases and Media Metadata</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/the-parallel-worlds-of-media-databases-and-media-metadata/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/the-parallel-worlds-of-media-databases-and-media-metadata/#comments</comments>
		<pubDate>Wed, 17 Feb 2010 00:38:03 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[3D modeling]]></category>
		<category><![CDATA[blob data]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[MODS]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[namespaces]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[Text]]></category>
		<category><![CDATA[the Metadata Object Description Schema]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/the-parallel-worlds-of-media-databases-and-media-metadata/</guid>
		<description><![CDATA[Searching traditional business data: straight-forward. Managing advanced forms of media, such as images, sound, video, natural language text, and animated models have been discussed a number of times in this blog in the past.  Traditional information systems, such as relational databases, have been engineered largely to handle the sorts of data we have in business applications, [...]]]></description>
				<content:encoded><![CDATA[<p><strong>Searching traditional business data: straight-forward.</strong></p>
<p>Managing advanced forms of <a href="http://itknowledgeexchange.techtarget.com/semantic-web/mega-media-apps-a-huge-challenge-for-web-30/">media</a>, such as images, sound, video, natural language text, and animated models have been discussed a number of times in this blog in the past.  Traditional information systems, such as relational databases, have been engineered largely to handle the sorts of data we have in business applications, primarily simple numeric and character string data.  To the SQL database programmer, the nice part is that the data speaks for itself.  If a field is called Name, and the value is Buzz King, the semantics of &#8220;Buzz King&#8221; is pretty obvious, and it can be processed in a largely automatic fashion.  The same goes for a field called Age, with a value of &#8220;97&#8243;.  </p>
<p><strong>Searching advanced media: far, far more difficult.</strong></p>
<p>But modern media is far more complex than this.  &#8221;Blob&#8221; data like images, and continuous data, like sound, video, and natural language text, are very difficult to search and interpret automatically.  There are two approaches that have been taken to resolve this dilemma.  </p>
<p><strong>Tagging: the simple approach.</strong></p>
<p>The first is <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-dublin-core-and-the-metadata-object-description-schema-a-look-at-namespaces/">tagging</a>.  Descriptive terms, often taken from large, shared vocabularies, at attached to pieces of media.  These vocabularies can be very domain-specific, dedicated to areas like medicine, law, and engineering.  </p>
<p><strong>Intelligent processing software: the second approach.</strong></p>
<p>The second technique is the automatic processing of pieces of media using image processing, natural language, and other highly intelligent software.  These applications are very sophisticated and understood only by experts.  And, these applications often demand a lot of processing time, and this makes bulk processing impossible.  It’s also true that the results can be haphazard.  Some pieces of media can be interpreted precisely, others not so precisely &#8211; and dramatic mistakes are frequent.  A tennis court might be mistaken for an airplane runway.  There&#8217;s a huge trust factor involved in cranking up image or sound processing software or natural language software.  </p>
<p>Often, we can provide feedback so that these applications can learn, over time, the way we want media to be interpreted.  We can help the software learn the difference between a tennis player and a member of a ground crew on a small runway.  All of this is hugely expensive, in terms of the cost of developing the software, and in terms of the physical resources needed to run the software.</p>
<p><strong>A middle ground?  Not really.</strong></p>
<p>So, is there some middle ground?  Something simple, yet more &#8220;intelligent&#8221;?  Yes, and the answer is to take a sophisticated approach to what otherwise might be very simple tagging techniques.  However, the core problem with tagging remains: we search and process tags &#8211; and not the actual data.  It is an indirect, but fast process.  The goal is to come as close as we can to simulating the results of such things as image processing, but to do it with a simple, yet comprehensive, accurate tag-based technology.</p>
<p>We&#8217;ve looked at some of the solutions that have been proposed.  They include <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-dublin-core-and-the-metadata-object-description-schema-a-look-at-namespaces/">Dublin Core, MODS, and MPEG-7</a>.  The first is very simplistic.  The second is more sophisticated, in that the terminology used is broader and far more precise.  The third is very aggressive in that it supports the complex structuring of tag data elements.  </p>
<p><strong>So, what are we really doing?</strong></p>
<p>In essence, we build a hierarchy of metadata and then instantiate it for every piece of media we want to catalogue and later search.  What we are doing is creating a parallel database, one where every piece of blob or continuous data is accompanied by a possibly very large tree of structured tagging information.  The parallel database has its own schema and an instance of it is created for every piece of media in the original media database.</p>
<p>The end result?  Instead of creating some sort of media-centric query language, like an SQL-for-video, we give up on trying to search the media database itself.  The query language remains largely ignorant of the nature of blob and continuous media.  We can continue to refine and expand the schema of the parallel database until search results are satisfactory.</p>
<p><strong>More later&#8230;</strong></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/the-parallel-worlds-of-media-databases-and-media-metadata/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Media Management Workflow: the world of plugins and format standards</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/media-management-workflow-the-world-of-plugins-and-format-standards/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/media-management-workflow-the-world-of-plugins-and-format-standards/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 19:09:01 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[3D animation]]></category>
		<category><![CDATA[3D modeling]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[media applications]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/media-management-workflow-the-world-of-plugins-and-format-standards/</guid>
		<description><![CDATA[Animation workflow. I teach an introductory 3D animation course at my university. The primary application we use is Autodesk Maya, the gold standard of 3D modeling and animation applications. Maya is a vast application that can take many years to learn and that requires a lot of skill to use effectively. So trying to learn [...]]]></description>
				<content:encoded><![CDATA[<p><span><strong>Animation workflow.<br />
</strong><br />
I teach an introductory 3D animation course at my university.  The primary application we use is Autodesk Maya, the gold standard of 3D modeling and animation applications.  Maya is a vast application that can take many years to learn and that requires a lot of skill to use effectively.  So trying to learn Maya in a semester isn’t realistic.  </p>
<p>As it turns out, the do-it-in-a-semester approach is even more challenging &#8211; because creating a complete animated video, something my students are required to do, involves other complex applications.  At a minimum, scenes rendered into video with Maya have to be edited with a video editing application and a soundtrack synchronized with the video.</p>
<p>The complexities of workflow in animation projects are far more extensive than this, and we will look at this in a future posting of this blog.</p>
<p>Today, we focus on the favored technology for providing application-to-application workflow: plugins.  Basically, the world of animation and media apps is exploding with them.  Here’s what the landscape looks like.</p>
<p><strong>Making applications fragile: plugins.  </p>
<p></strong>Since they might focus much of their career on a single application, animation, photography, video, and audio professionals can panic at the thought of any substantive changes in the application that drives their daily work.  One major reason is workflow.  Even minor updates to an application can lead to broken plugins.  This is because vendors who sell plugins that allow an application to import and export from other applications are often far behind the version schedule of the two applications.  This is true even when the plugin vendor sells one of the two applications.  Also plugins tend to lag in bug fixing also, as plugin vendors struggle to understand the internal workings of applications they do not sell.</p>
<p><strong>The one-of-a-kind nature of app-to-app communication.</strong>  </p>
<p>Master each plugin can be a huge investment in itself; this is because complex forms of data are typically being passed between one app and another.  And interfaces between various applications tend to be one-of-a-kind, and only partly undocumented.  Plugins can also demand that the users  program complex configurations.  </p>
<p>Moving 3D models between applications, moving scenes between a modeling application and a renderer, if not done very painstakingly, can lead to nasty results.  Often, some of the information that the user knows must be configured doesn’t seem to appear in the menus or palettes of the plugin.  It might also be true that plugins for two different renderers conflict with each other.</p>
<p><strong>The various kinds of plugins.</p>
<p></strong>Plugins also come in many different flavors, interacting with import/export applications very differently.  Here are a handful of sorts of plugins:</p>
<p><strong>1. external app-to-app connections: import/export of minimally-compatible standards.</p>
<p></strong>These are very common, and typically don’t involve any external plugins to be installed.  They require that information (such as models or scenes or complete videos) be exported, stored in the file system of the computer, and then separately imported into the second application.  Many 3D modeling applications do this by using the OBJ format.  But this sort of import/export tends to be highly error prone, as various pieces of information get dropped or damaged.</p>
<p><strong>2. app-to-app connections: richer standards for saving files.</p>
<p></strong>There are a number of popular and emerging standards that are much richer, however, such as the FBX format (from Autodesk, the folks who sell Maya), which can accurately store a large amount of information about an animated scene.  Sometimes a plugin must be installed on one app or the other, but many cutting edge applications come right out of the box supporting richer data standards.</p>
<p>Another example is the Collada XML exchange format that can be used to insert models into game engines so they can be used in interactive games. </p>
<p><strong>3</strong>. <strong>Bringing one app into the GUI interface of another app.<br />
</strong><br />
These plugins can greatly ease the hassle of moving data around.  The drawing program Google SketchUp can have third party plugins installed that allows it to render to external applications.  This sort of plugin will create a menu or palette in one or the other application.  (I happen to use the IMSI renderer with SketchUp.)<br />
</span><br />
<span>Plugins can, in this way, effectively extend the capabilities of one application with some or all of the capabilities of a second app.  The 3D application Vue, which is used for building environments and settings (like deserts or oceans), can be used from within Maya by using a plugin supplied by the Vue folks.  </p>
<p><strong>4. Plugins that allow the re-synching of assets from an outside application.</strong></p>
<p>Poser, the popular application for modeling humans and critters, can be used to import models into Maya, and not only is the Maya GUI extended with Poser options, but Poser can be told periodically (from within Maya) to re-sync models from Poser.  <strong></strong></span></p>
<p><span><strong>5. Extracting focused assets from one application and importing them into another. </p>
<p></strong>Sometimes the focus of a plugin is to extract highly specialized pieces of information that don’t make up any sort of standard piece of media, like models or images or video.  For example, Smart Sound provides a plugin for their application SonicFire, so that sound markers from Apple’s Final Cut video editor can be used within SonicFire to time a soundtrack within SonicFire to a video created with Final Cut.</p>
<p><strong>6. The hardwired connection between Adobe’s Soundbooth and Premiere Pro.  </p>
<p></strong>These perhaps hardly qualify as plugins at all, but Adobe has created a standardized format for representing audio so that their sound editor SoundBooth and their video editor Premiere Pro can work together fluidly.  Users end up adopting the Adobe format as a primary audio format within their overall workflow.</p>
<p><strong>We will look more at media workflow&#8230;</strong></span></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/media-management-workflow-the-world-of-plugins-and-format-standards/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The imposing heterogeneity of media applications</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/the-imposing-heterogeneity-of-media-applications/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/the-imposing-heterogeneity-of-media-applications/#comments</comments>
		<pubDate>Mon, 26 Oct 2009 20:58:53 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[3D animation]]></category>
		<category><![CDATA[3D modeling]]></category>
		<category><![CDATA[advanced Web apps]]></category>
		<category><![CDATA[automating Web searches]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[media applications]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[the Metadata Object Description Schema]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[video containers]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[Web development]]></category>
		<category><![CDATA[Web development frameworks]]></category>
		<category><![CDATA[XML Schema]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/the-imposing-heterogeneity-of-media-applications/</guid>
		<description><![CDATA[This blog is dedicated to the discussion of emerging web technologies. Today, we look at a the rapidly growing world of media applications, and their impact on the Semantic Web. The problem of searching for media assets. We’ve already looked at advanced media, in particular video, audio, and animation data, in previous blog postings. In [...]]]></description>
				<content:encoded><![CDATA[<p>This blog is dedicated to the discussion of emerging web technologies.  Today, we look at a the rapidly growing world of media applications, and their impact on the <a href="http://itknowledgeexchange.techtarget.com/semantic-web/what-do-we-mean-by-semantic-web/">Semantic Web</a>.  </p>
<p><strong>The problem of searching for media assets.</strong></p>
<p>We’ve already looked at advanced media, in particular video, audio, and animation data, in previous blog postings.  In particular, we’ve looked at the subtle and complex nature of <a href="http://itknowledgeexchange.techtarget.com/semantic-web/multimedia-the-problem-of-subtle-semantics/">media asset semantics</a>.  We’ve seen that interpreting a piece of video, for example, is far, far more difficult than interpreting an integer or character field.  Since the goal of the Semantic Web effort is to make the searching of the web highly automated, advanced media is becoming a huge and critical research and development focus for the builders of next-generation web development applications.</p>
<p>Just how do we provide an environment where media assets can be searched in a mostly automatic fashion, so that a human does not have to painfully paw through hundreds or thousands (or millions) of video chunks to find the right one?  We’ve looked at <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-world-of-advanced-applications-what-are-they/">emerging technologies</a> for marking up advanced media information, and for making it usable in a variety of web applications.  We’ve also looked at the dramatic challenge presented by <a href="http://itknowledgeexchange.techtarget.com/semantic-web/mega-media-apps-a-huge-challenge-for-web-30/">mega apps</a> to would-be users; the interfaces to these applications are truly massive and cannot present to the user the way in which they are meant to be used. </p>
<p><strong>The problem of proprietary formats.<br />
</strong><br />
One specific, and very difficult problem, is the massive heterogeneity, not just of media formats, compression technologies, and container technologies, but of the applications themselves.  If we are going to automate the searching of complex modeling, video, audio, and other media assets, we’re going to have to address a key question: since many media apps make use of their own proprietary data formats, how are we going to provide automated ways of searching media assets that are stored in these formats?</p>
<p><strong>The problem of highly imperfect generic formats.</strong></p>
<p>There are indeed many existing, as well as soon-to-emerge, standards for importing and exporting data between powerful media applications, but transformations in and out of these formats are often “lossy”, in that information is lost or changed.  In fact, locating and downloading assets that are in supposedly-generic form is often very frustrating, because these assets end up not performing well.  They can be difficult to edit and reuse.  3D animation models regularly blow up when animators try to import them into animation applications and the manipulate them.  A hawk may look like a hawk until you try to render it with its wings flapping, and suddenly it’s a blob of geometric garbage.</p>
<p><strong>One possible direction.</strong></p>
<p>So, what do we do about the fact that many media assets must be manipulated by the original applications that created them?  How can we facilitate reuse?  It’s extremely unrealistic to expect users to master perhaps dozens of video or audio or animation applications.  Filtering assets according to their file extensions is a good idea, and it is a well established practice.  </p>
<p>But what we really need is a globally-known site that either literally or conceptually centralizes the massive network of import/export relationships, along with information about the relative success of these mappings.  Are they ever lossy?  If so, can they be fixed?  What series of applications might we want an asset to be imported/exported through so that in the end it is in a usable format, given the applications that the user owns and has mastered?</p>
<p>There is much to be done.  Right now, searching for and reusing media assets is a painstaking, trial-and-error-prone process.</p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/the-imposing-heterogeneity-of-media-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Multimedia: The Problem of Subtle Semantics</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/multimedia-the-problem-of-subtle-semantics/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/multimedia-the-problem-of-subtle-semantics/#comments</comments>
		<pubDate>Sat, 03 Oct 2009 21:12:46 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[3D animation]]></category>
		<category><![CDATA[3D modeling]]></category>
		<category><![CDATA[advanced Web apps]]></category>
		<category><![CDATA[automating Web searches]]></category>
		<category><![CDATA[blob data]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[information]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[rich internet apps]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[smart search engines]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[Text]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[web applications]]></category>
		<category><![CDATA[Web development]]></category>
		<category><![CDATA[Web development frameworks]]></category>
		<category><![CDATA[XML]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/multimedia-the-problem-of-subtle-semantics/</guid>
		<description><![CDATA[The challenge of the Semantic Web. We’ve looked at the emerging Semantic Web technology in the previous postings of this blog. The idea is to have a far, far smarter Web, one where the process of finding and interpreting and making use of far flung information can be largely automated. This is in sharp contrast [...]]]></description>
				<content:encoded><![CDATA[<p><strong>The challenge of the Semantic Web.</strong></p>
<p>We’ve looked at the emerging Semantic Web technology in the previous postings of this blog.  The idea is to have a far, far smarter Web, one where the process of finding and interpreting and making use of far flung information can be largely automated.  This is in sharp contrast with today’s Web, where these things have to be done in a painful, extremely time-consuming fashion.</p>
<p>So that is the key challenge.  It has to do with searching the kinds of information that are important to us in our daily lives.  This information, as it turns out, is very difficult to process automatically.  Why is this?</p>
<p><strong>The complexity of modern multimedia.</strong></p>
<p>I teach a very basic 3D animation class to mostly computer science students.  We use Maya, arguably the most popular 3D animation application, one that is used in the making of many animated features.  The interesting thing about animation is that it is truly multimedia.  It can give us a lot of insight into what we need the new Web to do for us.</p>
<p>That’s because the number and diversity of applications that are used for drawing, documenting, modeling, animating, motion capture, texturing, video rendering, video editing, video conversion and compression, sound editing, in even small projects, can be very impressive.  Correspondingly, the wide variety and complexity of media formats involved in an animation project can be overwhelming.  </p>
<p>What happens in an animation project?  The workflow might begin with vector storyboard drawings to break the story down into scenes. In a typical animation project, 3D models in a variety of proprietary formats are used.  Models must be transformed as they are exported from one application and imported into the next. Multiple video renders of animated models are made, and they must be edited together, along with multiple sound files.  Multiple video and audio formats might be used. 2D images are used for textures; photographs of butterfly wings can be used to make an animated butterfly very realistic, and a checkerboard image made with Photoshop can be used to make a Linoleum floor.  And along the way, a variety of note taking, screen capture, and conferencing software might be used to facilitate group communication.</p>
<p>There is also a heavy focus on reuse in an animation project.  Building every model, editing every texture, creating every environment and background, recording every sound from scratch is frequently intractable.  If existing assets cannot be tailored and reused, the project would be far too expensive and time consuming, and would demand too wide a variety of professionals to always be available.  This raises the multimedia stakes, as assets of widely differing forms must be constantly reconfigured and used in concert in new ways.</p>
<p>But what’s the real problem?  We aren’t all trying to produce complex animated videos.  But very interestingly, in our everyday lives we essentially face the animator’s challenge when we try to find and use information on the Web.  That’s because we’re often looking for things whose meaning, whose interpretation, demands focused human thought.  We are looking not for business data, but for pieces of media, and the problem is that today, most of our searching has to be based on tags or brief textual descriptions that are associated with pieces of media, and not on the true meaning of the media itself.  </p>
<p><strong>The needs of the business world are not our needs.<br />
</strong><br />
It’s the subjective nature of media assets &#8211; this is what is at the heart of the problem facing us.  Existing technology for searching the web is based on keywords and very short pieces of text.  </p>
<p>There is other technology, though, under active development, stuff that serves as the information storage backbone of most commercial websites.  It’s the technology that has for decades been used in-house (not on the Web) by businesses when they process large databases.  But this stuff was designed to handle traditional business data forms, like integers, character strings, real numbers, dates, timestamps, and full text.  </p>
<p>There is more, though.  All of the major database management systems, along with tools for building and searching advanced websites are being retrofitted (or in some cases, built from the ground up) to manage more than keywords and text, more than standard business data.  </p>
<p>But up to  now, the focus has not been on supporting the kinds of information you and I are most interested in.  The focus has been on extending database and Web technology to support xml documents, as well as more complex data objects, like those inside a Java program, as well as other forms of data found inside programs. This includes arrays and lists and short pieces of textual data, like the names of diseases.  </p>
<p>In other words, we’ve been busy extending our support of the business world, so they can store complex business data in databases and make that information  processable over the Web.  You and I have largely been left out.</p>
<p><strong>Finally, we are attacking our needs.</strong></p>
<p>But there now many ongoing efforts to extend database and Web technology to make it useful to us.  The new focus is on supporting blob and continuous media like images, video, and audio.  This is extremely hard to do.</p>
<p>Why?  Because the strongest means by which we deduce the meeting of business data is by looking at its internal structure and the terms that are used to describe that structure.  A relational table named Prescriptions, with a character attributes Patient Name, Doctor’s Name, and Medication, and with a numeric attribute Dosage, is pretty easy to interpret.  </p>
<p>But what do we do with a photograph, which is just a grid of pixels with no internal structure?  Or a long series of images, along with a sound track, put together to form a piece of video?  </p>
<p>The U.S. military has been pumping money into image processing for several decades, and so all is not lost.  There is a vast body of mathematical research and software development that allows us to write programs that can find a particular face in a crowd and search satellite photos for airplane runways.  But in general, we cannot at this time write a program that can process an arbitrary photo or video clip and tell us what it <strong></strong><em>means</em><strong>.  </strong>That means we can’t quickly search vast media database for useful pieces of information.</p>
<p>The goal behind the Semantic Web effort is to build a new generation  of websites whose information can be searched automatically, and where information from multiple sites can be automatically integrated.  To do this with numeric and character based data is quite doable.  But when it comes to multimedia, like images and sound and video and 3D models and engineering designs, well, we have a long way to go. The meaning &#8211; in other words, the semantics &#8211; of these forms of data are complex and subtle, and highly dependent upon an individual’s interpretation of that media.</p>
<p><strong>So, we see that we have only just begun our journey to create the new Web.</strong></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/multimedia-the-problem-of-subtle-semantics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mega Media Apps: A Huge Challenge for Web 3.0</title>
		<link>http://itknowledgeexchange.techtarget.com/semantic-web/mega-media-apps-a-huge-challenge-for-web-30/</link>
		<comments>http://itknowledgeexchange.techtarget.com/semantic-web/mega-media-apps-a-huge-challenge-for-web-30/#comments</comments>
		<pubDate>Mon, 13 Apr 2009 03:27:13 +0000</pubDate>
		<dc:creator>Roger King</dc:creator>
				<category><![CDATA[3D animation]]></category>
		<category><![CDATA[3D modeling]]></category>
		<category><![CDATA[blob data]]></category>
		<category><![CDATA[codecs]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[Maya]]></category>
		<category><![CDATA[media applications]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[video containers]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[web applications]]></category>

		<guid isPermaLink="false">http://itknowledgeexchange.techtarget.com/semantic-web/mega-media-apps-a-huge-challenge-for-web-30/</guid>
		<description><![CDATA[What Are Web 2.0 and Web 3.0 Apps? In our continuing series on Web 2.0/3.0 and Semantic Web technology, we&#8217;ve discussed one particularly impressive Web 2.0 app: Evernote. The challenge is to get the best of both worlds: the interactive performance of a desktop application, and the use-it-from-anywhere convenience of the Web. Many Web applications [...]]]></description>
				<content:encoded><![CDATA[<p><strong>What Are Web 2.0 and Web 3.0 Apps?</strong></p>
<p>In our continuing series on <a href="http://itknowledgeexchange.techtarget.com/semantic-web/the-difference-between-web-2-and-the-semantic-web/" target="_blank">Web 2.0/3.0 and Semantic Web</a> technology, we&#8217;ve discussed one particularly impressive Web 2.0 app: <a href="http://itknowledgeexchange.techtarget.com/semantic-web/a-look-at-a-web-20-app/" target="_blank">Evernote</a>. The challenge is to get the best of both worlds: the interactive performance of a desktop application, and the use-it-from-anywhere convenience of the Web. Many Web applications &#8211; such as Evernote &#8211; also ensure offline usability by providing both a desktop and webpage interface, and maintaining a local version of the database, which is periodically synched with the web-resident database.</p>
<p>But, as cleverly engineered as it is, and as useful as it is, Evernote is still a very simple application. What about big applications? What challenges face the developers of Web 3.0 applications, ones that will manipulate large databases of continuous data, and extra-large instances of blob data? (Video and sound are continuous; an image is blob data.)</p>
<p>Let&#8217;s consider one of the biggest media apps out there: Maya, the high-end 3D application that is widely used to make full length animated movies. (See http://autodesk.com for Maya.)</p>
<p>What&#8217;s the big problem? If an application like Maya was reengineered as a Web app along the lines of Evernote, would it be usable? Might it be intractable to be continuously moving complex animation data between the server and your client machine?</p>
<p><strong>3D Geometry: Just How Big Is It?</strong></p>
<p><a href="http://cdn.ttgtmedia.com/ITKE/uploads/blogs.dir/110/files/2009/04/snapz-pro-xscreensnapz0011.jpg"></a><a href="http://cdn.ttgtmedia.com/ITKE/uploads/blogs.dir/110/files/2009/04/snapz-pro-xscreensnapz0011.jpg"></a><a href="http://cdn.ttgtmedia.com/ITKE/uploads/blogs.dir/110/files/2009/04/snapz-pro-xscreensnapz0011.jpg"></a></p>
<p>Well, the problem is not the complex geometric models that an application like Maya must store and manipulate. 3D animation applications like Maya tend to support multiple ways of creating 3D shapes, and they do indeed tend to be very data-intensive. The first image at the bottom of this page shows a Maya screen with two spheres, one built with straight line geometry and one built with curved line geometry.</p>
<p>As it turns out, to make the straight line model smooth, you would need to use many more lines and vertices than I have in the the model in the image. But if you think about it, the straight line model uses the geodesic dome approach; it builds a 3D sphere out of many 2D polygons &#8211; which are flat. The more polygons, the smoother the model. In the other model, we use curved lines, and so the model looks much smoother, even with not that much detail. But the mathematics are complex.</p>
<p>You can image that a dense scene, with a very large number of detailed, 3D models of these sorts would contain a lot of data. But no, that&#8217;s not the problem. These models can be uploaded and download very quickly. They aren&#8217;t as big as you might image &#8211; because they are not continuous data. They are blobs, either binary or of code text, and are reasonably manageable.</p>
<p><strong>The Killer Problem: Video.</strong></p>
<p>The problem? It&#8217;s what Maya creates at the end of the design process, when Maya renders a scene so we can watch it. It renders video. And video, whether you are looking at video shot with your home camera, or at video rendered by Maya, or video I create when I capture desktop videos on how to use Maya and post it for my animation students, well, it&#8217;s big. Really big.</p>
<p>Video is the killer. Video makes a lot of mega apps, and even very simple apps that happen to create video, not scale. We could manage a modest number of modest-sized video segments via a web interface, but not big chunks of video. To make videos even worse, we usually have to add a sound track.</p>
<p>So, the lesson is that many or most applications that create and/or edit video in any form face this challenge.</p>
<p>This is why we use video <em>compression. </em> First, you need a <em>container</em>, which is a way of bundling the huge series of still images that make up the video, with the sound, as so that we can move it around as a single object. (Keep in mind that often consists of at least 25 frames, or still images, per second &#8211; and that makes for big pieces of continuous data.) Popular containers for small scale projects (such as animations that will be marketed via CDs) are .mov and .avi. The first is the Apple Quicktime standard, and the second is due to Microsoft.</p>
<p>Once you have a container, you need a <em>codec</em>, which is a way of compressing and decompression video, so that it isn&#8217;t so big when you move in over the Internet or store it on a small storage device. Codec actually stands for &#8220;code&#8221; and &#8220;decode&#8221;. It cannot be overstated how powerful a codec can be; I routinely turn gigabyte videos submitted by my students into less-than-100 megabyte videos. They can be uploaded to a website and then played, and at least in a small box on a web page, they look great.</p>
<p>But if you want quality, if you don&#8217;t want to lose detail, and if in particular, if you are going to display a video on a large display (or at the movie theatre), you often cannot compress it enough.</p>
<p>That&#8217;s it. That&#8217;s the problem, and it&#8217;s one of the biggest challenges facing the makers of Web 3.0 apps, which are supposed to fluidly manipulate video segments.</p>
<p><strong>A Far Bigger, Far More Universal Problem.</strong></p>
<p>But perhaps the old video challenge, the one that is constantly shoved in the face of next-generation web app developers, is a distraction, something that draws us away from the real problem, the one that kills many media apps, even when they are totally desktop-based. What is it? Take a look at the animation designer&#8217;s interface to Maya, in the second image at the bottom of this page.</p>
<p>The problem is the size and complexity of these apps. There are made up of multiple complex windows. They have menus, palettes, and lots of little boxes that contain detailed information. Keep in mind that you only see one of the Maya windows in the image below, at the bottom of the page, and it is already too dense for a single screen, even a large one. Looking more closely at the window in this image, note that there are several places on it that contain drop down menus. Many of these menu items lead to other drop down menus. Even the main menu at the top is changed frequently during the process of creating an animation project. The designer&#8217;s GUI as a whole changes during the process of using the app.</p>
<p>It is very hard to fathom the incredible complexity of an interface like Maya&#8217;s until you use it. Professional video editing applications are typically simpler, but are still very complex, especially if the application supports special effects and the insertion of text. Even applications intended for the average Joe, like Photoshop Elements, are often horrifically complex.</p>
<p><strong>The Bottom Line.</strong></p>
<p>The problem that faces developers of all sorts of next-generation apps that must manipulate animation or sound or video or images, or that format complex documents for publication (like Adobe InDesign), or support the development of complex web pages (like Adobe Dreamweaver), is this: it is near-intractable or perhaps completely impossible to build an interface that explains to the user the process of using the application. Little wizards or chunks of documentation that contain &#8220;recipe&#8221; steps, don&#8217;t come within a thousand light-years of conveying how to use that app as a whole.</p>
<p>That&#8217;s it. True Web 3.0 applications would convey not just a vast, deeply embedded toolset, but the way the tools should be used. That&#8217;s the big challenge.</p>
<p>By the way, if you want to see a handful of videos made by my introductory animation students, go to my website at <a href="http://buzzking.squarespace.com" target="_blank">http://buzzking.squarespace.com</a> and look at the right column, near the bottom of the page.</p>
<p><a href="http://itknowledgeexchange.techtarget.com/semantic-web/files/2009/04/snapz-pro-xscreensnapz002.jpg"><img src="http://itknowledgeexchange.techtarget.com/semantic-web/files/2009/04/snapz-pro-xscreensnapz002-300x191.jpg" alt="" width="300" height="191" class="alignnone size-medium wp-image-85" /></a></p>
<p><a href="http://itknowledgeexchange.techtarget.com/semantic-web/files/2009/04/snapz-pro-xscreensnapz001.jpg"><img src="http://itknowledgeexchange.techtarget.com/semantic-web/files/2009/04/snapz-pro-xscreensnapz001-300x175.jpg" alt="" width="300" height="175" class="alignnone size-medium wp-image-87" /></a></p>
<!-- wpms-network-global-inserts -->]]></content:encoded>
			<wfw:commentRss>http://itknowledgeexchange.techtarget.com/semantic-web/mega-media-apps-a-huge-challenge-for-web-30/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
