« Safer Air Travel: Separate Flights for Luggage? | Main | From a Reader: TV News: A Mirror Reflecting the World It Covers or Making It A Better Place in Which to Live One Story at a Time? »

August 28, 2004

Idea: Driving Through Virtual Soundscapes

This is an idea for a new way to navigate interactively through large audio sets, such as collections of thousands of music tracks, and to automatically or interactively learn and evolve interesting trajectories through such spaces.

For each track, choose a representative location (by default, in the middle of the track) and make a 2.5 second soundbite. This will represent an audio "icon" for the track; it should be representative of the nature of the track.

Next, tile a virtual 2D space with audio icons. There are many possible tiling algorithms. One would be to tile tracks such that tracks with similar bpm's and keys are closer to one another; another might be by artist, or by genre, or by recording date, or by album, or by popularity ratings of the tracks, or by several of these measures.

Next, create a virtual vehicle that can move through this audio space. This vehicle is the "audio cursor" -- whatever it is located on in the audio space is played. The audio cursor can be moved by the listener using a navigation input device (such as a mouse, trackpad, or the iPod trackwheel). In particular, it can be turned left or right in the virtual 2D landscape, and its velocity could also be changed with something functioning like a gas pedal. It might also have the ability to go forward or reverse. An ideal interface would be a car-cockpit input device those that are now sold for use with PC racing car games (steeringwheel, pedals, shifter, etc.).

Next, let the user "drive" through the audio landscape.

Randomly choose a starting point in space and start playing it. As the listener changes direction (via the trackwheel) it crossfades between current clip and next adjacent clip in space. Thus until the user has fully changed direction onto an adjacent clip both clips are played with the volume and fade weight adjusted in proportion to the direction of travel. This has the effect that the clip you are turning away from moves to correct surroundsound location and the volume fades out, while the clip you are turning towards moves to the correct surroundsound location and the volume fades in.

For simplicity around every location in audio space there are 8 directions, corresponding to 8 clips that are considered to be adjacent to it. Also the universe wraps around as well. The same clip can also be in many places in the space at once (so that it can be linked logically from other clips). In other words, there is no restriction that a clip can only exist in one location (although this could be enforced if desired).

Next, run genetic algorithms on the landscape to organize the clips in the best ways (thematically, or by genre, or according to mood, etc.). Imagine that this virtual audio landscape actually has geography and topography -- corresponding to genres (location) and ratings or bpm (elevation).

Also imagine that users could create roadways through this space. Roadways evolve over time as a path is traveled more frequently. These roadways correspond to "playlists." As listeners travel through the space they leave imprints that decay slowly over time. The more frequently users traverse a location in space the "deeper" or "stronger" the imprint at that location. As users navigate provide them with feedback about the intensity of the imprint at their current location -- as well as the intensity of the imprint at each adjacent location around them. Ideally users should be able to sense both the cumulative imprint intensity at any location as well as the rate of change of the imprint intensity; and perhaps a momentum measurement that combines the two. This feedback should be used such that it influences the listener's direction as they traverse the space: based on imprints, users should tend to navigate from the current track to the more highly imprinted adjacent tracks, thus reinforcing linkages between the most popular directions at each junction in space. Alternatively, links could established between each track and its adjacent tracks in order to separate the measure of track popularity and direction popularity. To accomplish this, let there be a directional link in each direction between each pair of adjacent tracks in space. Now simply adjust the weight of each link according to how frequently it is traversed. Next, it is possible to automatically adjust link weights as a function of node (track) states -- such that the links adjust to favor the more popular tracks. The above methods of evolving paths through space are similar to ant-derived "scent-trail" collaborative rating schemes.

Automatically save the trajectory that a user takes in each driving/listening session as a playlist. Allow users to name their playlists. At every junction between 2 tracks, search all playlists to see if any playlists contain a matching sequence that traverses the nodes. Show the names of any found playlists graphically as alternative paths to take from the current node. In other words, when at any node, the user can see what playlists intersect with that node and then choose to "follow path" of any playlist they are interested in from that point.

To further enhance the listening experience consider combining all of the above with 3D visualization or immersive VR simulation, in which the user is literally navigating in a shared persistent 2D or 3D audio space, along with perhaps millions of other users. You could then travel with your friends, or go to a place where there were lots of others gathered (a live performance or new release perhaps), or even follow someone (a "DJ") on a tour or trajectory. You could also choose your direction in the landscape using both auditory and visual cues -- this is especially important for providing "distance vision" in the audioscape (not practical through audio due to caucophany issues!) -- so while you can only hear the track you are playing and the track (if any) you are leaving or turning from, you can see visual representations of certain qualities of more distant tracks around you -- perhaps elevation, color, texture, weather, light, and other features of a simulated landscape. You could then intuit that in a given direction the listening experience changes in a qualititave way, such as getting more groovy, or more peaceful, or going towards a certain mood or genre, etc.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/t/trackback/2271/1065152

Listed below are links to weblogs that reference Idea: Driving Through Virtual Soundscapes:

» A Tour Through Sound from teknos
Idea: Interactive Audio Spatial Navigation Absolutely brilliant idea from Nova. Read it, and come back. Okay- so here's some additions. You could also add a way for "drivers" to add meta-content to improve connections - or to suggest connections that [Read More]

» A Tour Through Sound from teknos
Idea: Interactive Audio Spatial Navigation Absolutely brilliant idea from Nova. Read it, and come back. Okay- so here's some additions. You could also add a way for "drivers" to add meta-content to improve connections - or to suggest connections that [Read More]

» Hating on Multimedia from unmediated
Maciej Ceglowski has posted an "audioblogging manifesto" (transcript here) that is worth a listen. His basic point, that dictation-style audioblog posts and talking-head-style videoblog posts are boring, a waste of time, and antithetical to the nature ... [Read More]

» Hating on Multimedia from unmediated
Maciej Ceglowski has posted an "audioblogging manifesto" (transcript here) that is worth a listen. His basic point, that dictation-style audioblog posts and talking-head-style videoblog posts are boring, a waste of time, and antithetical to the nature ... [Read More]

Comments

"Next, run genetic algorithms on the landscape to organize the clips in the best ways" seems a little strange. Given some set of clips S with various properties and rankings p so that for each x in S there is a set of p, wouldn't simple sorting suffice, such as the type that occurs in everyday database transactions suffice? Genetic algorithms are slow, and have terrible performance as the number of items in S increases, and the number of possible permutations of parameters to optimize increases non-linearly.

Would love to know your thoughts on DHTs and semantic free references? See the link below-
http://nms.lcs.mit.edu/projects/sfr/

Well, you could use a joystick to navigate and that would enably you have an audio-vehicle that could drive, fly and dive, if you really want! Oh heck, I'll even throw in a prototype quantum teleportation drive we are working on so you can hyperwarp through wormholes in space-time to distant locations instantly. I wasn't going to add one of these because they're still experimental, but my boys tell me they can have one ready for alpha testing by next week. Use at your own risk.

Whilst it's normally a good idea to base an interface on a skill that the user already knows, Driving has some limitations. Apart from the mechanics of the interface (what hardware? controls?) the essence of driving is A to B. Linear. Ignoring the surroundings. Making a turn just moves from one linear track to another. Music is an inherently pan-dimensional space.

Something like a 3d-Defender or simulated hot air ballooning might be more fun, and apposite to exploration of the music space. Or ditch the R and make it DIVING!

Post a comment

Comments are moderated, and will not appear on this weblog until the author has approved them.

If you have a TypeKey or TypePad account, please Sign In

My Photo

Get my RSS Feed

Twine | Nova Spivack - My Public Twine items

Radar Networks

  • twine.jpg
  • logo_v5_03b.jpg
  • logo_v5_03b.jpg

Nova's Trip to Edge of Space

  • Stepsedgestratosphere
    In 1999 I flew to the edge of space with the Russian air force, with Space Adventures. I made it to an altitude of just under 100,000 feet and flew at Mach 3 in a Mig-25 piloted by one of Russia's best test-pilots. These pics were taken by Space Adventures from similar flights to mine. I didn't take digital stills -- I got the whole flight on digital video, which was featured on the Discovery Channel.

Nova & Friends, Training For Space...

  • Img021
    In 1999 I was invited to Russia as a guest of the Russian Space Agency to participate in zero-gravity training on an Ilyushin-76 parabolic flight training aircraft. It was really fun!!!! Among other people on that adventure were Peter Diamandis (founder of the X-Prize and Zero-G Corporation), Bijal Trivedi (a good friend of mine, science journalist), and "Lord British" (creator of the Ultima games). Here are some pictures from that trip...

Featured Past Articles

Pages

People I Like

  • Kris Thorisson
    Kris has been working with me for years on the design of the Radar Networks software, a new platform for the Semantic Web. He has a PhD from the MIT Media Lab. He designs intelligent humanoids and virtual realities. He is from Iceland, which makes him pretty cool.
  • Jim Wissner
    Jim is among the most talented software developers I've ever worked with. He's a prolific Java coder and an expert on XML. He's the lead engineer for Radar Networks.
  • Marin Spivack
    Marin Spivack is my brother. He is the one of the only western 20th generation lineage holders of the original Chen Family Tai Chi tradition in China. He's been practicing Tai Chi for about 6 to 10 hours a day for the last 10 years and is now one of the best and most qualified Tai Chi teachers in America. He just returned from 3 years in China studying privately with a direct descendant of the original Chen family that created Tai Chi. The styles that he teaches are mainly secret and are not known or taught in the USA. One thing is for sure, this is not your grandmother's Tai Chi: This is serious combat Tai Chi -- the original, authentic Tai Chi, not the "new age" form that is taught in the USA -- it's intense, physically-demanding, fast, powerful and extremely deadly. If you are serious about Tai Chi and want to learn the authentic style and applications, the way it was meant to be, you should study with my brother. He's located in Boston these days but also travels when invited to teach master classes.
  • Paul Ford
    Paul is an accidental Semantic Web guru. He is really a writer. Ftrain is his masterpiece. You should his famous article on the Semantic Web
  • Josh Kirschenbaum
    Josh is a visual effects whiz, director and generalist hacker in LA. We have been pals and collaborators since the 1980's. Josh is probably going to be the next Jim Cameron. He's also a really good writer.
  • Joey Tamer
    Joey is a long-time friend and advisor. She is an expert on high-tech strategic planning.
  • Jerry Michalski
    I have been friends with Jerry for many years; he's been advising Radar Networks on social software technology.
  • Bram Boroson
    Bram is an astrophysicist and college pal of mine. We spend hours and hours brainstorming about cellular automata simulations of the universe. He's one of the smartest people I ever met.
  • Adam Cohen
    Adam Cohen is a long-term friend; we were roommates in college. He is a really talented composer and film-scorer. He doesn't have a Web site but I like him anyway! He's in Hollywood living the dream.
  • Mayer Spivack
    Mayer Spivack is my father; he's a brilliant inventor, cognitive scientist, sculptor, designer and therapist. He also builds carbon fiber trimarans in his spare time, and studies animal intelligence. He is working on several theories related to the origins of violence and ways to prevent it, new treatments for learning disabilities, and new theories of cognition. He doesn't have a Web site yet, but I'm working on him...
  • Louise Freedman
    Louise specializes in art-restoration. She does really big projects like The Museum of Fine Arts in Boston, The Gardner Museum and Harvard University. She's also a psychotherapist and she's married to my dad. She likes really smart parrots and she knows how to navigate a large sailboat.
  • Kathleen Spivack
    Kathleen Spivack is my mother. She's a poet, novelist and creative writing teacher. She was a personal student of Robert Lowell and was in the same group of poets with Silvia Plath, Elizabeth Bishop and Anne Sexton. She coaches novelists, playwrites and poets in France and the USA. She teaches privately and her students, as well as being published, have won many of the top writing prizes.
  • Peter F. Drucker
    Peter F. Drucker was my grandfather. He was one of my principal teachers and inspirations all my life. My many talks with him really got me interested in organizations and society. He had one of the most impressive minds I've ever encountered. He died in 2005 at age 95. Here is what I wrote about his death. His foundation is at http://www.pfdf.org/
  • Bari Koral
    Bari Koral is a really talented singer songwriter. We co-write songs together sometimes. She's getting some buzz these days -- she recently opened for India Arie. She worked at EarthWeb many years ago. Now she tours almost all year long and she just had a hit in Europe. Check out her video, on her site.
  • Chris Jones
    Chris is a long-time friend and now works with me in Radar Networks, as our director of user-experience. He's a genius level product designer, GUI designer, and product manager.

Interesting Links

Blog powered by TypePad
Member since 08/2003

Tip Jar

Give me a tip!

Tip Jar