Thursday, September 18, 2008

Searching Video Lectures

Monday, November 26, 2007

Searching Video Lectures

A tool from MIT finds keywords so that students can efficiently review lectures.

Researchers at MIT have released a video and audio search tool that solves one of the most challenging problems in the field: how to break up a lengthy academic lecture into manageable chunks, pinpoint the location of keywords, and direct the user to them. Announced last month, the MIT Lecture Browser website gives the general public detailed access to more than 200 lectures publicly available though the university's OpenCourseWare initiative. The search engine leverages decades' worth of speech-recognition research at MIT and other institutions to convert audio into text and make it searchable.

The Lecture Browser arrives at a time when more and more universities, including Carnegie Mellon University and the University of California, Berkeley, are posting videos and podcasts of lectures online. While this content is useful, locating specific information within lectures can be difficult, frustrating students who are accustomed to finding what they need in less than a second with Google.

"This is a growing issue for universities around the country as it becomes easier to record classroom lectures," says Jim Glass, research scientist at MIT. "It's a real challenge to know how to disseminate them and make it easier for students to get access to parts of the lecture they might be interested in. It's like finding a needle in a haystack."

The fundamental elements of the Lecture Browser have been kicking around research labs at MIT and places such as BBN Technologies in Boston, Carnegie Mellon, SRI International in Palo Alto, CA, and the University of Southern California for more than 30 years. Their efforts have produced software that's finally good enough to find its way to the average person, says Premkumar Natarajan, scientist at BBN. "There's about three decades of work where many fundamental problems were addressed," he says. "The technology is mature enough now that there's a growing sense in the community that it's time [to test applications in the real world]. We've done all we can in the lab."

Looking at lectures: MIT is offering a video search tool that can pinpoint keywords in audio and video lectures. Here, a search for “exoskeleton and gasoline” results in this video clip. The automated transcript of the lecture appears below the video.
Credit: MIT

A handful of companies, such as online audio and video search engines Blinkx and EveryZing (which has licensed technology from BBN) are making use of software that converts audio speech into searchable text. (See "Surfing TV on the Internet" and "More-Accurate Video Search".) But the MIT researchers faced particular challenges with academic lectures. For one, many lecturers are not native English speakers, which makes automatic transcription tricky for systems trained on American English accents. Second, the words favored in science lectures can be rather obscure. Finally, says Regina Barzilay, professor of computer Science at MIT, lectures have very little discernable structure, making them difficult to break up and organize for easy searching. "Topical transitions are very subtle," she says. "Lectures aren't organized like normal text."

To tackle these problems, the researchers first configured the software that converts the audio to text. They trained the software to understand particular accents using accurate transcriptions of short snippets of recorded speech. To help the software identify uncommon words--anything from "drosophila" to "closed-loop integrals"--the researchers provided it with additional data, such as text from books and lecture notes, which assists the software in accurately transcribing as many as four out of five words. If the system is used with a nonnative English speaker whose accent and vocabulary it hasn't been trained to recognize, the accuracy can drop to 50 percent. (Such a low accuracy would not be useful for direct transcription but can still be useful for keyword searches.)

The next step, explains Barzilay, is to add structure to the transcribed words. Software was already available that could break up long strings of sentences into high-level concepts, but she found that it didn't do the trick with the lectures. So her group designed its own. "One of the key distinctions," she says, "is that, during a lecture, you speak freely; you ramble and mumble."

To organize the transcribed text, her group created software that breaks the text into chunks that often correspond with individual sentences. The software places these chunks in a network structure; chunks that have similar words or were spoken closely together in time are placed closer together in the network. The relative distance of the chunks in the network lets the software decide which sentences belong with each topic or subtopic in the lecture.

The result, she says, is a coherent transcription. When a person searches for a keyword, the browser offers results in the form of a video or audio timeline that is partitioned into sections. The section of the lecture that contains the keyword is highlighted; below it are snippets of text that surround each instance of the keyword. When a video is playing, the browser shows the transcribed text below it.

Barzilay says that the browser currently receives an average of 21,000 hits a day, and while it's proving popular, there is still work to be done. Within the next few months, her team will add a feature that automatically attaches a text outline to lectures so users can jump to a desired section. Further ahead, the researchers will give users the ability to make corrections to the transcript in the same way that people contribute to Wikipedia. While such improvements seem straightforward, they pose technical challenges, Barzilay says. "It's not a trivial matter, because you want an interface that's not tedious, and you need to propagate the correction throughout the lecture and to other lectures." She says that bringing people into the transcription loop could improve the accuracy of the system by a couple percentage points, making user experience even better.

No comments:

Post a Comment

Great Minds Have Similar Thoughts

Champions aren't made in gyms, champions are made from something they have deep inside them - a desire, a dream, a vision. They have to have last-minute stamina, they have to be a little faster, they have to have the skill and the will. But the will must be stronger than the skill.
-Muhammad Ali

I'll be more enthusiastic about encouraging thinking outside the box when there's evidence of any thinking going on inside it.
- Terry Pratchett

Not to be absolutely certain is, I think, one of the essential things in rationality.
- Bertrand Russell

What we think, or what we know, or what we believe is, in the end, of little consequence. The only consequence is what we do.
Sometimes what's right isn't as important as what's profitable.
- Trey Parker and Matt Stone

There are only two kinds of people who are really fascinating: people who know absolutely everything, and people who know absolutely nothing.
- Oscar Wilde

Sometimes I lie awake at night, and I ask, "Where have I gone wrong?"/ Then a voice says to me, "This is going to take more than one night."
- Charles M. Schulz

There is nothing worse than aggressive stupidity.
- Johann Wolfgang von Goethe

The significance of man is that he is insignificant and is aware of it.
- Carl Becker

A lie can travel halfway around the world while the truth is putting on its shoes.
- Mark Twain

"If you know how to spend less than you get, you have the philosopher's stone." So said Benjamin Franklin more than 200 years ago. How much easier it is to be critical than to be correct.
- Benjamin Disraeli

Of course the game is rigged. Don't let that stop you--if you don't play, you can't win.
- Robert Heinlein

Ability will never catch up with the demand for it.
- Malcolm Forbes

No man remains quite what he was when he recognizes himself.
- Thomas Mann

No man needs a vacation so much as the man who has just had one.
- Elbert Hubbard

There is no pleasure in having nothing to do; the fun is in having lots to do and not doing it.
- Mary Wilson Little

Books to the ceiling,/ Books to the sky,/ My pile of books is a mile high./ How I love them! How I need them!/ I'll have a long beard by the time I read them.
- Arnold Lobel

Leif Ostling said in a statement that his comments about Germany had been "interpreted in a way that was not intended."

If a man will begin with certainties, he shall end in doubts; but if he will be content to begin with doubts he shall end in certainties.
- Sir Francis Bacon

"It's not the voting that's democracy, it's the counting."
- Tom Stoppard

Elections are won by men and women chiefly because most people vote against somebody rather than for somebody.
- Franklin P. Adams

Invention is the mother of necessity.
- Thorstein Veblen

Don't try to solve serious matters in the middle of the night.
- Philip K. Dick

Das Loreleylied

Ich weiß nicht was soll es bedeuten
Daß ich so traurig bin;
Ein Märchen aus alten Zeiten,
Das kommt mir nicht aus dem Sinn.

Die Luft ist kühl und es dunkelt,
Und ruhig fließt der Rhein;
Der Gipfel des Berges funkelt
Im Abendsonnenschein.

Die schönste Jungfrau sitzet
Dort oben wunderbar,
Ihr goldenes Geschmeide blitzet,
Sie kämmt ihr goldenes Haar.

Sie kämmt es mit goldenem Kamme
Und singt ein Lied dabey;
Das hat eine wundersame,
Gewaltige Melodei.

Den Schiffer, im kleinen Schiffe,
Ergreift es mit wildem Weh;
Er schaut nicht die Felsenriffe,
Er schaut nur hinauf in die Höh´.

Ich glaube, die Wellen verschlingen
Am Ende Schiffer und Kahn;
Und das hat mit ihrem Singen
Die Lore-Ley getan.

Heinrich Heine, 1823

CURRENT MOON
moon phases

Logistics Log

Thursday, September 18, 2008