Instructional Technologies: May 2014

Monday, May 12, 2014

Modernizr: A JS lib that detects HTML5 and CSS3 features in the user’s browser

Modernizr download & documentation

Modernizr is a JavaScript library that detects HTML5 and CSS3 features in the user’s browser.

Why use Modernizr?

Taking advantage of cool new web technologies is great fun, until you have to support browsers that lag behind. Modernizr makes it easy for you to write conditional JavaScript and CSS to handle each situation, whether a browser supports a feature or not. It’s perfect for doing progressive enhancement easily.

How it works

Modernizr runs quickly on page load to detect features; it then creates a JavaScript object with the results, and adds classes to the htmlelement for you to key your CSS on. Modernizr supports dozens of tests, and optionally includes YepNope.js for conditional loading of external .js and .css resources.

Check out the full list of features that Modernizr detects, or learn more about conditional resource loading with Modernizr.

Capturing Audio & Video in HTML5

Original post link: http://www.html5rocks.com/en/tutorials/getusermedia/intro/

By Eric Bidelman

Published: February 22nd, 2012Updated: October 29th, 2013Comments: 159

Introduction

Audio/Video capture has been the "Holy Grail" of web development for a long time. For many years we've had to rely on browser plugins (Flash or Silverlight) to get the job done. Come on!

HTML5 to the rescue. It might not be apparent, but the rise of HTML5 has brought a surge of access to device hardware. Geolocation (GPS), the Orientation API(accelerometer), WebGL (GPU), and the Web Audio API (audio hardware) are perfect examples. These features are ridiculously powerful, exposing high level JavaScript APIs that sit on top of the system's underlying hardware capabilities.

This tutorial introduces a new API, navigator.getUserMedia(), which allows web apps to access a user's camera and microphone.

The road to getUserMedia()

If you're not aware of its history, the way we arrived at the getUserMedia() API is an interesting tale.

Several variants of "Media Capture APIs" have evolved over the past few years. Many folks recognized the need to be able to access native devices on the web, but that led everyone and their mom to put together a new spec. Things got so messy that the W3C finally decided to form a working group. Their sole purpose? Make sense of the madness! The Device APIs Policy (DAP) Working Group has been tasked to consolidate + standardize the plethora of proposals.

I'll try to summarize what happened in 2011...

Round 1: HTML Media Capture

HTML Media Capture was the DAP's first go at standardizing media capture on the web. It works by overloading the <input type="file"> and adding new values for the accept parameter.

If you wanted to let users take a snapshot of themselves with the webcam, that's possible with capture=camera:

<input type="file" accept="image/*;capture=camera">

Recording a video or audio is similar:

<input type="file" accept="video/*;capture=camcorder">
<input type="file" accept="audio/*;capture=microphone">

Kinda nice right? I particularly like that it reuses a file input. Semantically, it makes a lot of sense. Where this particular "API" falls short is the ability to do realtime effects (e.g. render live webcam data to a <canvas> and apply WebGL filters). HTML Media Capture only allows you to record a media file or take a snapshot in time.

Support:

Android 3.0 browser - one of the first implementations. Check out this video to see it in action.
Chrome for Android (0.16)
Firefox Mobile 10.0
iOS6 Safari and Chrome (partial support)

Round 2: device element

Many thought HTML Media Capture was too limiting, so a new spec emerged that supported any type of (future) device. Not surprisingly, the design called for a new element, the <device> element, which became the predecessor togetUserMedia().

Opera was among the first browsers to create initial implementations of video capture based on the <device> element. Soon after (the same day to be precise), the WhatWG decided to scrap the <device> tag in favor of another up and comer, this time a JavaScript API called navigator.getUserMedia(). A week later, Opera put out new builds that included support for the updated getUserMedia()spec. Later that year, Microsoft joined the party by releasing a Lab for IE9supporting the new spec.

Here's what <device> would have looked like:

<device type="media" onchange="update(this.data)"></device>
<video autoplay></video>
<script>
  function update(stream) {
    document.querySelector('video').src = stream.url;
  }
</script>

Support:

Unfortunately, no released browser ever included <device>. One less API to worry about I guess :) <device> did have two great things going for it though: 1.) it was semantic, and 2.) it was easily extendible to support more than just audio/video devices.

Take a breath. This stuff moves fast!

Round 3: WebRTC

The <device> element eventually went the way of the Dodo.

The pace to find a suitable capture API accelerated thanks to the larger WebRTC(Web Real Time Communications) effort. That spec is overseen by the W3C WebRTC Working Group. Google, Opera, Mozilla, and a few others have implementations.

getUserMedia() is related to WebRTC because it's the gateway into that set of APIs. It provides the means to access the user's local camera/microphone stream.

Support:

getUserMedia() has been supported since Chrome 21, Opera 18, and Firefox 17.

Getting started

With navigator.getUserMedia(), we can finally tap into webcam and microphone input without a plugin. Camera access is now a call away, not an install away. It's baked directly into the browser. Excited yet?

Feature detection

Feature detecting is a simple check for the existence ofnavigator.getUserMedia:

function hasGetUserMedia() {
  return !!(navigator.getUserMedia || navigator.webkitGetUserMedia ||
            navigator.mozGetUserMedia || navigator.msGetUserMedia);
}

if (hasGetUserMedia()) {
  // Good to go!
} else {
  alert('getUserMedia() is not supported in your browser');
}

You can also use Modernizr to detect getUserMedia to avoid the vendor prefix dance yourself:

if (Modernizr.getusermedia){
  var gUM = Modernizr.prefixed('getUserMedia', navigator);
  gUM({video: true}, function( //...
  //...
}

Gaining access to an input device

To use the webcam or microphone, we need to request permission. The first parameter to getUserMedia() is an object specifying the details and requirements for each type of media you want to access. For example, if you want to access the webcam, the first parameter should be {video: true}. To use both the microphone and camera, pass {video: true, audio: true}:

<video autoplay></video>

<script>
  var errorCallback = function(e) {
    console.log('Reeeejected!', e);
  };

  // Not showing vendor prefixes.
  navigator.getUserMedia({video: true, audio: true}, function(localMediaStream) {
    var video = document.querySelector('video');
    video.src = window.URL.createObjectURL(localMediaStream);

    // Note: onloadedmetadata doesn't fire in Chrome when using it with getUserMedia.
    // See crbug.com/110938.
    video.onloadedmetadata = function(e) {
      // Ready to go. Do some stuff.
    };
  }, errorCallback);
</script>

OK. So what's going on here? Media capture is a perfect example of new HTML5 APIs working together. It works in conjunction with our other HTML5 buddies,<audio> and <video>. Notice that we're not setting a src attribute or including<source> elements on the <video> element. Instead of feeding the video a URL to a media file, we're feeding it a Blob URL obtained from a LocalMediaStreamobject representing the webcam.

I'm also telling the <video> to autoplay, otherwise it would be frozen on the first frame. Adding controls also works as you'd expected.

If you want something that works cross-browser, try this:

navigator.getUserMedia  = navigator.getUserMedia ||
                          navigator.webkitGetUserMedia ||
                          navigator.mozGetUserMedia ||
                          navigator.msGetUserMedia;

var video = document.querySelector('video');

if (navigator.getUserMedia) {
  navigator.getUserMedia({audio: true, video: true}, function(stream) {
    video.src = window.URL.createObjectURL(stream);
  }, errorCallback);
} else {
  video.src = 'somevideo.webm'; // fallback.
}

Setting media constraints (resolution, height, width)

The first parameter to getUserMedia() can also be used to specify more requirements (or constraints) on the returned media stream. For example, instead of just indicating you want basic access to video (e.g. {vide: true}), you can additionally require the stream to be HD:

var hdConstraints = {
  video: {
    mandatory: {
      minWidth: 1280,
      minHeight: 720
    }
  }
};

navigator.getUserMedia(hdConstraints, successCallback, errorCallback);

...

var vgaConstraints = {
  video: {
    mandatory: {
      maxWidth: 640,
      maxHeight: 360
    }
  }
};

navigator.getUserMedia(vgaConstraints, successCallback, errorCallback);

For more configurations, see the constraints API

Selecting a media source

In Chrome 30 or later, getUserMedia() also supports selecting the the video/audio source using the MediaStreamTrack.getSources() API.

In this example, the last microphone and camera that's found is selected as the media stream source:

MediaStreamTrack.getSources(function(sourceInfos) {
  var audioSource = null;
  var videoSource = null;

  for (var i = 0; i != sourceInfos.length; ++i) {
    var sourceInfo = sourceInfos[i];
    if (sourceInfo.kind === 'audio') {
      console.log(sourceInfo.id, sourceInfo.label || 'microphone');

      audioSource = sourceInfo.id;
    } else if (sourceInfo.kind === 'video') {
      console.log(sourceInfo.id, sourceInfo.label || 'camera');

      videoSource = sourceInfo.id;
    } else {
      console.log('Some other kind of source: ', sourceInfo);
    }
  }

  sourceSelected(audioSource, videoSource);
});

function sourceSelected(audioSource, videoSource) {
  var constraints = {
    audio: {
      optional: [{sourceId: audioSource}]
    },
    video: {
      optional: [{sourceId: videoSource}]
    }
  };

  navigator.getUserMedia(constraints, successCallback, errorCallback);
}

Check out Sam Dutton's great demo of how to let users select the media source.

Security

Some browsers throw up an infobar upon calling getUserMedia(), which gives users the option to grant or deny access to their camera/mic. The spec unfortunately is very quiet when it comes to security. For example, here is Chrome's permission dialog:

If your app is running from SSL (https://), this permission will be persistent. That is, users won't have to grant/deny access every time.

Providing fallback

For users that don't have support for getUserMedia(), one option is to fallback to an existing video file if the API isn't supported and/or the call fails for some reason:

// Not showing vendor prefixes or code that works cross-browser:

function fallback(e) {
  video.src = 'fallbackvideo.webm';
}

function success(stream) {
  video.src = window.URL.createObjectURL(stream);
}

if (!navigator.getUserMedia) {
  fallback();
} else {
  navigator.getUserMedia({video: true}, success, fallback);
}

Basic demo

Taking screenshots

The <canvas> API's ctx.drawImage(video, 0, 0) method makes it trivial to draw <video> frames to <canvas>. Of course, now that we have video input viagetUserMedia(), it's just as easy to create a photo booth application with realtime video:

<video autoplay></video>
<img src="">
<canvas style="display:none;"></canvas>

<script>
  var video = document.querySelector('video');
  var canvas = document.querySelector('canvas');
  var ctx = canvas.getContext('2d');
  var localMediaStream = null;

  function snapshot() {
    if (localMediaStream) {
      ctx.drawImage(video, 0, 0);
      // "image/webp" works in Chrome.
      // Other browsers will fall back to image/png.
      document.querySelector('img').src = canvas.toDataURL('image/webp');
    }
  }

  video.addEventListener('click', snapshot, false);

  // Not showing vendor prefixes or code that works cross-browser.
  navigator.getUserMedia({video: true}, function(stream) {
    video.src = window.URL.createObjectURL(stream);
    localMediaStream = stream;
  }, errorCallback);
</script>

Applying Effects

CSS Filters

Using CSS Filters, we can apply some gnarly effects to the <video> as it is captured:

<style>
video {
  width: 307px;
  height: 250px;
  background: rgba(255,255,255,0.5);
  border: 1px solid #ccc;
}
.grayscale {
  +filter: grayscale(1);
}
.sepia {
  +filter: sepia(1);
}
.blur {
  +filter: blur(3px);
}
...
</style>

<video autoplay></video>

<script>
var idx = 0;
var filters = ['grayscale', 'sepia', 'blur', 'brightness',
               'contrast', 'hue-rotate', 'hue-rotate2',
               'hue-rotate3', 'saturate', 'invert', ''];

function changeFilter(e) {
  var el = e.target;
  el.className = '';
  var effect = filters[idx++ % filters.length]; // loop through filters.
  if (effect) {
    el.classList.add(effect);
  }
}

document.querySelector('video').addEventListener(
    'click', changeFilter, false);
</script>

Click the video to cycle through CSS filters

WebGL Textures

One amazing use case for video capture is to render live input as a WebGL texture. Since I know absolutely nothing about WebGL (other than it's sweet), I'm going to suggest you give Jerome Etienne's tutorial and demo a look. It talks about how to use getUserMedia() and Three.js to render live video into WebGL.

Using getUserMedia with the Web Audio API

One of my dreams is to build AutoTune in the browser with nothing more than open web technology!

Chrome supports live microphone input from getUserMedia() to the Web Audio API for real-time effects. Piping microphone input to the Web Audio API looks like this:

window.AudioContext = window.AudioContext ||
                      window.webkitAudioContext;

var context = new AudioContext();

navigator.getUserMedia({audio: true}, function(stream) {
  var microphone = context.createMediaStreamSource(stream);
  var filter = context.createBiquadFilter();

  // microphone -> filter -> destination.
  microphone.connect(filter);
  filter.connect(context.destination);
}, errorCallback);

Demos:

For more information, see Chris Wilson's post.

Conclusion

In general, device access on the web has been a tough cookie to crack. Manypeople have tried, few have succeeded. Most of the early ideas have never taken hold outside of a proprietary environment nor have they gained widespread adoption.

The real problem is that the web's security model is very different from the native world. For example, I probably don't want every Joe Shmoe web site to have random access to my video camera. It's a tough problem to get right.

Bridging frameworks like PhoneGap have helped push the boundary, but they're only a start and a temporary solution to an underlying problem. To make web apps competitive to their desktop counterparts, we need access to native devices.

getUserMedia() is but the first wave of access to new types of devices. I hope we'll continue to see more in the very near future!

Additional resources

Demos

Friday, May 2, 2014

Browser Kernels

Browser Kernels and Their Representative Browsers

Data source: http://en.wikipedia.org/wiki/List_of_web_browsers

Notable layout engines

Gecko is developed by the Mozilla Foundation.
KHTML is developed by the KDE project.
Presto is developed by Opera Software for use in Opera. Development stopped as Opera transitions to Blink.
Tasman was developed by Microsoft for use in Internet Explorer 5 for Macintosh.
Trident is developed by Microsoft for use in the Windows version of their web browser, from Internet Explorer 4 to the present time.
WebKit is a fork of KHTML by Apple Inc. used in Apple Safari, Chromium and Google Chrome.
Blink is a 2013 fork of WebKit by Google used in Chromium, Google Chrome and Opera.^[19]
Servo is an experimental web browser layout engine being developed cooperatively by Mozilla and Samsung.

Graphical

Current/maintained projects are in boldface.

Trident shells

Other software publishers have built browsers and other products around Microsoft's Trident engine. The following browsers are all based on that rendering engine:

360 Secure Browser
AOL Explorer (discontinued)
Bento Browser (built into Winamp)
GreenBrowser
Internet Explorer
MediaBrowser (discontinued)
MenuBox
MSN Explorer
NeoPlanet (discontinued)
NetCaptor (discontinued)
RealPlayer
SlimBrowser
Tencent Traveler
TomeRaider
UltraBrowser (discontinued)
WebbIE

Gecko-based

Mozilla Firefox (formerly Firebird and Phoenix)

Netscape Browser 8 to Netscape Navigator 9 (discontinued)
Iceweasel, Debian's Firefox rebrand
GNU IceCat, GNU's fork of Firefox
Pale Moon (Windows-only optimized browser based on Firefox)
Comodo IceDragon (Firefox-based web browser for Windows)

AT&T Pogo (discontinued; based on Firefox)
Flock (discontinued; was based on Firefox until version 2.6.1, and based on Chromium thereafter)
Swiftfox (discontinued; processor-optimised builds based on Firefox)
Swiftweasel (discontinued; processor-optimised builds based on Iceweasel)
xB Browser (discontinued; formerly XeroBank Browser and Torpark), portable browser for anonymous browsing, originally based on Firefox

Firefox for mobile (codenamed Fennec)
Skyfire (for mobile)
Camino for Mac OS X (formerly Chimera)^[20] (discontinued)
Conkeror, keyboard-driven browser
Mozilla Application Suite (discontinued)

SeaMonkey (successor to Mozilla Application Suite)
- Iceape Debian's Seamonkey rebrand
Classilla (an updated fork of the Suite to Mac OS 9)
Gnuzilla GNU's fork
Netscape (discontinued; Netscape 6 to 7, based on Mozilla)
Beonex Communicator (separate branch, based on Mozilla Application Suite) (discontinued)

Yahoo! Browser (or partnership browsers e.g. "AT&T Yahoo! Browser"; "Verizon Yahoo! Browser"; "BT Yahoo! Browser" etc.)
Galeon, GNOME's old default browser (discontinued)
K-Meleon for Windows

K-Ninja for Windows (discontinued; based on K-Meleon)
K-MeleonCCF ME for Windows (based on K-Meleon core, mostly written in Lua)

MicroB (for Maemo)
Minimo (for mobile, discontinued)

Gecko- and Trident-based

Browsers that use both Trident and Gecko include:

K-Meleon with the IE Tab extension
Mozilla Firefox with the IE Tab extension
Netscape Browser 8 (discontinued)

Webkit- and Trident-based

Baidu Browser
Maxthon (formerly known as MyIE2)
Sleipnir

Gecko-, Trident- and WebKit-based

Browsers that can use Trident, Gecko and WebKit include:

Avant Browser^[21]
Lunascape^[22]

KHTML-based

Konqueror
Konqueror Embedded

Presto-based

Internet Channel (for Wii console, Opera-based)
Nintendo DS Browser (Opera-based)
Opera (for releases up until 12.16^[23])

WebKit-based

Amazon Kindle (experimental)
Arora (discontinued)
BOLT browser (discontinued)
Chromium
- Google Chrome (based on fork Blink since Chrome v. 28)
- Opera^[24]
- Amigo
- Torch browser
- Comodo Dragon
- QIP Surf
- Epic
- Nichrome
- SRWare Iron
- Uran Browser
- Yandex browser
- RockMelt (discontinued)
Dolphin Browser (Android and Bada)
Dooble
Flock (discontinued) (version 3.0 and above)
iCab (version 4 uses WebKit; earlier versions used its own rendering engine)
Iris Browser (discontinued)
Konqueror (version 4 can use WebKit as an alternative to its native KHTML^[25])
Maxthon (version 3.0 and above)
Midori
Nintendo 3DS NetFront Browser NX
OmniWeb
OWB
QtWeb
QupZilla
Rekonq
Safari
Shiira (discontinued)
Steel for Android
Steam ingame browser

Ultralight^[26] for Android
Uzbl
Web
Web Browser for S60, used in all Nokia Symbian smartphones.
webOS, used in the Palm Pre, Palm Pixi, Pre 2, HP Veer, Pre 3 and TouchPad mobile devices
WebPositive, browser in Haiku
xombrero

For Java platform

Bitstream ThunderHawk
BOLT Browser (discontinued)
HotJava (discontinued)
Opera Mini
uZard Web
UCWEB
HtmlUnit (Graphical user interface-less)
Teashark

Specialty browsers

Browsers created for enhancements of specific browsing activities.

Current

Gollum browser (Created specially for browsing Wikipedia)
Image Xplorer (Designed only to view, download, and print images)
Kirix Strata (Designed for data analytics)
Miro (A media browser that integrates BitTorrent add-on)
Nightingale (open source audio player and web browser based on the Songbird (see below) media player source code)
SpaceTime (Search the web in 3D)
Wyzo (A media browser that integrates BitTorrent-like add-on)
Zac Browser (For children with autism, and autism spectrum disorders such as Asperger syndrome, pervasive developmental disorders (PDD), and PDD-NOS)

Discontinued

Ghostzilla (Blends into the GUI to hide activity)
Prodigy Classic (Executable only within the application)
Flock (To enhance social networking, blogging, photo-sharing, and RSS news-reading)
RockMelt (Designed to combine web browsing, and social activities such as Facebook and Twitter into a unified one window experience)
Songbird (browser with advanced audio streaming features and built in media player with library.)

Mosaic based

Mosaic was the first widely used web browser. The National Center for Supercomputing Applications (NCSA) licensed the technology and many companies built their own web browser on Mosaic. The best known are the first versions of Internet Explorer and Netscape.

Others

Abaco (for Plan 9 from Bell Labs)
Amaya
Arachne (for DOS)
Arena
Ariadna (AMSD Ariadna) (first Russian web browser, discontinued)
AWeb (AmigaOS)
Baidu Mobile Browser
Charon (for Inferno)
Dillo (Small, fast, free, minimalistic, and multi-platform)
DR-WebSpyder (for DOS, discontinued)
Embrowser (for DOS, discontinued)
Gazelle (from Microsoft Research, OS-like)
IBrowse (for AmigaOS)
Mothra (for Plan 9 from Bell Labs)
NetPositive (for BeOS)
NetSurf (An open source web browser for RISC OS and GTK+ written in C)
Planetweb browser (discontinued for Dreamcast)
Qihoo 360 mobile browsers
Phoenix, a browser based on tkWWW
tkWWW, based on Tcl
Voyager (for AmigaOS)

Mobile

Main article: Mobile browser

Text-based

Main article: Text-based web browser