主播大秀

Prototyping Weeknotes #98

Post categories: Prototyping,听Prototyping weeknotes,听Weeknotes

George Wright | 11:13 UK time, Tuesday, 6 March 2012

This week's weeknotes are brought to you by the letter 'R' and the sound of the new Magnetic Fields LP, and the highlighted project is 'Roar to Explore'

Roar to explore

Vicky S says "This still-work-in-progress project uses voice search as a听way for kids to independently explore content and learn about animals by making their sounds. It also allows us to explore interesting new UI approaches which we think aren't currently being examined by products available on the market today听

Where this came from

This interesting use case for CBeebies came from the Future Media UX&D team. It鈥檚 a search tool that allows young children to make the sound of an animal to get content about that animal. For instance, to get lions, roar like a lion.听

R&D got involved to help develop the thinking.听

Why we think this is interesting

Young children find it difficult to use a keyboard and mouse, they can鈥檛 easily use a search box and browsing doesn鈥檛 scale.

听We would like to understand if voice search could be a useful navigation tool for kids, and could help them explore content independently.

听There is a lot of research about voice and speech recognition but not involving kids expressing themselves with sounds, which also makes this work interesting.听

What we鈥檝e done so far

听Feasibility study & demo:听R&D explored the technical feasibility. Chris P, Chris L & Yves built an audio classification system using sample data of 30 children making 12 different animal sounds to see if a computer could be trained to accurately classify the sounds into the 12 classes.

听Findings

听

路听听听听听听听听听It is possible to train a classifier to detect animal noises

路听听听听听听听听听Real world performance of the system is considerably worse than in the lab.

路听听听听听听听听听Careful choice of the algorithms is necessary.

路听听听听听听听听 Web sites do not have access to the host computer鈥檚 microphone by default

UX research:听As our classifier demo isn鈥檛 robust enough to test with kids and the aims of the engineering research are different from the user experience research, we have split UX and engineering into two research tracks.听For the UX research we are developing ideas for an experience prototype, which aims to test the concept and gain a better understanding of the behaviours and needs of our audience.听

We began with some background research relating to information retrieval and behaviour in young children, and found the following:

路听 Young children are easily bored, emotionally driven, drawn to accessible info they understand/ recognize and find it hard to switch cognition between 2 places (i.e. looking at screen/looking at keyboard).

听路听听听听听听听听 They find metaphorical and highly textual interfaces very difficult.

听路听听听听听听听听 Formulating a search query is difficult for children, because they have little knowledge to 鈥榬ecall鈥� concepts or terms from their long-term memory.听

路听听听听听听听听 Young children tend not to plan out their searches but rather react to results they receive from the IR system - behaviour is a conversation. They are easily distracted and their actions are reactions to the information & interface.听

路听听听听听听听听 Generally, their search strategies are not analytical and do not aim precisely at one goal. Instead, they make associations while browsing. This is a trial-and-error strategy.听

路听听听听听听听听 They find the home page comforting to start a new journey from 鈥� once confidence and familiarity is built with the page.

听We did some sketching and thinking about how to develop and evaluate an experience prototype, thinking in particular about 3 things:听Initialisation of the task, type of primary feedback and aiding further exploration

听

Pilot study

We ran a quick pilot study with 4 & 5 yr olds, to test a prototype approach and evaluation method and to answer some initial questions.听听Can they make the sound of the animal they want to see from a selection on screen?听听Do they know (think) they are controlling the device with their voice?听听Do they want to explore further (after the first go) - If it holds their interest or their attention starts to wander.

听Working with small groups of 2 or 3 children at a time, we showed the children a selection of 4 animals on a TV monitor. Three were recognizable CBeebies characters and one was a generic animal from a live action natural history programme. We asked the children to make the sound of the animal they liked best/wanted to pick.听

We learned that kids interpret the animal sounds quite differently.This has an impact on how the system should be designed to give control to the user to make their own interpretations of sounds, rather than having to learn the 鈥榗orrect鈥� generic sound. That way, the kids can train the system to work for them. 听We also observed that kids touch the screen intuitively 鈥� even if the device they are using doesn鈥檛 afford touch (I was using a hybrid TV/computer monitor).听Finally, the prototype needs to be very convincing for this audience. They will not suspend disbelief and they are curious about how things work.听

In my test, I initiated the conversation with the children and gave them instructions. For the next iteration of our experience prototype, we would like to try different ways of initialising the task with children. We could ask the parent to direct the child, we could use an avatar to initiate a conversation and ask the child to mimic what they do and we could use a prop that affords sound input like a microphone to see which is most successful.

听

The aim of our research is to understand the following:

听

路听听听听听听听听 How can we encourage children to use their voice as a controller?

路听听听听听听听听 Do children understand cause and effect - (that their input has a direct effect on the results)?

路听听听听听听听听 Where is voice good/ better than other forms of input like touch or gesture?

听

听

Technical considerations & recommendations

Further engineering work that would need to be done beyond the initial feasibility study

路听听听听听听听Single user classifier

路听听听听Improve acoustic model

路听听听听Study impact of recording equipment on the result"

dddIn other project news:

Michael S

This week I have been primarily working on debugging and making sustainable

the social bookmarks system - ie twitter harvesting and /programmes correlation.

Whilst looking at throughput numbers, I've noticed that there are 100,000

tweets per day for 主播大秀 national TV and radio stations, and 75,000 of those

per day correlate back to specific times in specific programmes.

Chris Needham

Attended an HTML5 event at Google on Thursday, learning about WebRTC (), WebGL (), and Web Intents ().听

Prepared a presentation for the FI-Content plenary meeting that we are hosting.

Olivier

Web Audio standards: the W3C working group has been very active this听week, talking about expanding its scope to look at standardising MIDI in听the browser, exchanging thoughts and feedback about the specs.听Meanwhile, our project to start a 主播大秀-centric prototype which will test听and stretch the APIs got the green light, and we (ChrisL, Matt and I)听will be starting work in earnest next week

Vicky B

This week I've been continuing to analyse the data from the NoTube 'Social Web and TV' survey,听 and using the initial findings to structure a user research workshop which will explore various themes around NoTube's Beancounter user profiling service () in more depth. The survey is still open if you'd like to be part of our research:

Andrew N

This week Andrew's been continuing work on the FI-Content dashboard including trying to diagnose a problem that was causing it to crash Safari. It turned out it was due to the 主播大秀 network which made him sad.听At Google's HTML5 day Web Intents () looks very promising as a mechanism for decoupling common web actions e.g. "share this page" with the web service that performs it. If it takes off, it's another step towards a more flexible web of loosely connected 'apps'.听Interesting link:听"The Web Is a Customer Service Medium"听

Thought-provoking essay dismisses the notion of the web as a meta-medium that mimics TV, radio, print and asks "what is the question that the web as a medium is answering?"

Pete

Last Wednesday through Friday, I was helping oversee the Fusion Trainee Lab 2012 at 主播大秀 Academy: a brief was set on Wednesday and presented back Friday afternoon to the commissioners. As a mentor, I encouraged the teams to focus their ideas thinking, gave them tips around developing clear solutions and showed them ways to communicate their propositions simply and clearly.

FI Content: This week I have been guiding the development of the Dashboard, helping clarify the script for the lab study sessions next week and designing TV interface mockups to show the participants.

Yves

We ended our third work package this week, so we spent a bit of time writing deliverables and preparing for the review meeting, which went very well. I have been working this week on quick prototype to expose all the data we have generated within ABC-IP so far, which will hopefully become the basis for the UX work we are doing right now. The prototype is now mostly done, using Rails on top of a triple store accessed via SPARQL, and using RedStore for tests. I packaged all I needed for that prototype in my PPA (听)

Meeting with I&A people to talk them through the work we've done so far.听听Lots of review deadlines this week, so I spent some time reviewing papers for various workshops/conferences. Lots of very high quality papers in there - I hope they get through! Giving a Prolog crash-course to the engineering team today - should be fun!听

Chris L

This week Chris has been continuing work on the automatic segmentation听investigation. The approach he is taking is to determine regions of听similarity within a piece of audio using the C99 algorithm (听听)and then听evaluate the performance of the algorithm using the WindowDiff听metric ().

Initial results are not encouraging with the C99 algorithm听positing too many segments when compared to the ground truth data. He's听been looking at collapsing neighbouring segments if they are shorter听than a certain threshold to see if this improves the performance.听

Next week we are carrying out user evaluations on attitudes to data privacy with 12 participants, 6 in our London lab, and 6 in our Salford lab. This week we have been preparing the final structure of the sessions, and putting together the materials that the participants will use.听

听

Share this page

Comments Post your comment

Comment number 1.
At 13th Mar 2012, lee wrote:

Roar to Explore - great idea, but your sample set is *much* too small, as you have found out. You will need thousands of examples to be able to train an ANN or compute a median vector. Why not run a CBeebies competition for four animal noises?
Now you have the software, you can do an awful lot of fun things - uploading bird songs and having the birds named, for example, was spoken of years ago on 主播大秀 Earth but never funded: would be great to see it happen

Complain about this comment (Comment number 1)
Comment number 2.
At 14th Mar 2012, Chris Lowis wrote:

Lee - thank you for your comments! You're correct about the need for many examples. Actually we compared an ANN and a SVM classifier and settled on the SVM as it required less data to train. We also investigated the effect of the number of classes on the classification performance. Our findings will be presented as a paper in April at the 132nd AES convention, if you'd like to see an advance copy of that paper please let me know.

Around the time of the conference we'll open-source the source code of this prototype - I'd love to see people attempt something like you suggest with real bird sounds.

Complain about this comment (Comment number 2)

听

This entry is now closed for comments

Jump to more content from this blog

About this blog

This is the Research & Development blog, where researchers, scientists and engineers from 主播大秀 R&D share their work in developing the media technologies of the future.

For the latest updates across 主播大秀 blogs,
visit the Blogs homepage.

Subscribe to Research and Development

You can stay up to date with Research and Development via these feeds.

Research and Development Feed(RSS)

Research and Development Feed(ATOM)

If you aren't sure what RSS is you'll find useful.

Other Related 主播大秀 Blogs

Mothballed Blogs

主播大秀 R&D Main Site

R&D 主播大秀page Image

For a detailed breakdown of our activities, teams, locations and how we collaborate visit our main website. We also host videos on the main website without UK only distribution restrictions.

Prototyping Weeknotes #98

Comments Post your comment

Comment number 1.

Comment number 2.

About this blog

Subscribe to Research and Development

Other Related 主播大秀 Blogs

主播大秀 R&D Main Site

More from this blog...

Topical posts on this blog

Being Discussed Now

Archives

Categories

Latest contributors

主播大秀 navigation

主播大秀 links

主播大秀

Prototyping Weeknotes #98

Comments Post your comment

Comment number 1.

Comment number 2.

About this blog

Subscribe to Research and Development

Other Related 主播大秀 Blogs

主播大秀 R&D Main Site

More from this blog...

Topical posts on this blog

Being Discussed Now

Archives

Categories

Latest contributors

主播大秀 iD

主播大秀 navigation

主播大秀 links