<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Dougma (dŭg·mə) n. &#187; work</title>
	<atom:link href="http://dougma.com/archives/category/work/feed" rel="self" type="application/rss+xml" />
	<link>http://dougma.com</link>
	<description>the truth according to Doug</description>
	<lastBuildDate>Fri, 23 Apr 2010 00:49:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0-beta1</generator>
		<item>
		<title>Paradigm Shifting</title>
		<link>http://dougma.com/archives/79</link>
		<comments>http://dougma.com/archives/79#comments</comments>
		<pubDate>Fri, 04 Apr 2008 03:56:17 +0000</pubDate>
		<dc:creator>doug</dc:creator>
				<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.dougma.com/archives/78</guid>
		<description><![CDATA[So there are two new computers in the house. One is my new MacBook Pro, and the other is Josh&#8217;s new XO. Josh and I were going to share the XO, but to date I have spend a whole 15min on it. Josh would spend a few hours a day on it if we let [...]]]></description>
			<content:encoded><![CDATA[<p>So there are two new computers in the house. One is my new MacBook Pro, and the other is Josh&#8217;s new XO. Josh and I were going to share the XO, but to date I have spend a whole 15min on it.</p>
<p><span id="more-79"></span></p>
<p>Josh would spend a few hours a day on it if we let him. The first thing he said after he turned it on was &#8216;Daddy, I need to move my sites over to my new laptop because my old machine is dying.&#8217; That is a direct quote (Kim immediately wrote it down.) Josh uses the kitchen computer which is an old gateway all-in-one imac-ish 266Mhz PIII. It&#8217;s not dying so much as it just can&#8217;t run the new flash games he likes in realtime. Sadly I can not get the plucky little laptop to run the full <a href="http://www.noggin.com/">www.noggin.com</a> site due to it&#8217;s exclusive use of flash 8. There are rumors of a way to get it to work (with both software and hardware hacks as the real problem is the memory required to run such huge flash games.) The same reason why the 266Mhz machine can&#8217;t hack it, the XO can&#8217;t. I suspected this all along, but so far he has not missed the 1/2 functional flash games.One of his favorite activities on the XO is <a href="http://wiki.laptop.org/go/Memorize">memorize</a>. It is also the one he is most frustrated with. This is basically just the old memory card game, only it could be letters, simple math, or other matchings. You can create your own match boards, but for some odd reason, the game does not randomize the card placement (and hence his frustration). I noticed this and mentioned it to my wife, and then the next day Josh came to me and said &#8216;Dad, you need to help me program this. It&#8217;s always the same.&#8217; I peeked at the source code and my eyes crossed. I can&#8217;t imagine the creators not using a randomization, so it might just be pseudo random and the lacking of a proper seed. I should mention that Josh is 4 (&#8216;4 and a half in two days dad!&#8217;) and has been using computers sence he was one and a half. He has an impressive vocabulary, and ran read many words as well. Granted they are all words which show up in flash games; things like &#8216;Start&#8217;, &#8216;Play&#8217;, &#8216;Pause&#8217;, &#8216;Done&#8217;, &#8216;Next&#8217;, &#8216;Back&#8217;, &#8216;Print&#8217;, and &#8216;Quit&#8217;. It is fun to watch him show me how things work. The only activity which has him confused is &#8216;pipi&#8217;. When I told him it was how you program the computer he lit up, but as he can&#8217;t write yet, it is more a source of frustration than anything else. The track pad is a little frustrating as there is no tactile difference between the conductive and resistive portions. Right clicking is also an oddity for him. Early on I got a small kids mouse for the kitchen computer and mapped both buttons to a left click. I did this after the fifth time he brought up the flash &#8216;properties&#8217; and got all confused. All in all, very minor issues.</p>
<p>Now to the Mac. This has been a long time in coming. I grew up on Mac machines. It was not until college that I switched to PC and linux (at the same time no less). This was because of a virus actually. I was fascinated by the <a href="http://en.wikipedia.org/wiki/Michelangelo_(virus)">Michelangelo</a> virus. this virus actually affected both Mac and PC computers. However I could hack on it on the linux boxen (running the new, easy to install, slackware distribution, installed from 104 floppy diskettes!) The computer lab I was working at ran the new Windows 3.11 environment (not yet an OS) and <a href="http://en.wikipedia.org/wiki/Wolfenstein_3D">Wolfenstein 3D</a> was the game of choice (and Civ, Doom, and Scorched Earth&#8230; good times, good times&#8230;) Well I am back to the Mac, and just in time. Less than 12 hours after getting the machine my old laptop stopped having those pesky hardware faults intermittently; preferring to have them quite consistently during the boot process.So I have been using a Mac for almost 2 weeks. Win2K and XP are now painful for me to use; and I had always thought of Win2K as the best C++ development environment; sorry Unbuntu. I tried using a different laptop the other day and the lack of two finger scrolling and other quick options drove me to distraction. The &#8216;Command&#8217; button still annoys me a little. Mainly because the control key has been moved over and not every thing has been remapped to command, so hitting control is a pain, which makes emacs less than pleasant. The other issue is the screen resolution. I am used to 1600&#215;1200, and this max out at <strike>1200&#215;900</strike> 1440&#215;900. The loss of 30% of my screen height is a real bummer for both browsing and coding. I limit my coding to 80 characters, so the extra wide screen does not gain me much, and the loss of 30% hurts a lot in viewable context. I am sure I will get used to it. To date the only software that I am missing are some obscure codecs for old fansub anime, the ability to write to NTFS volumes and dev studio; sorry there is no better C++ debugger, hate the IDE beyond that. I can live with that. I am trying out vmware to get those 3 things I am missing. The rest is currently native Mac. I can&#8217;t believe how easy it was to switch.</p>
]]></content:encoded>
			<wfw:commentRss>http://dougma.com/archives/79/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Here there be Dragons!</title>
		<link>http://dougma.com/archives/56</link>
		<comments>http://dougma.com/archives/56#comments</comments>
		<pubDate>Tue, 20 Nov 2007 00:08:40 +0000</pubDate>
		<dc:creator>doug</dc:creator>
				<category><![CDATA[dragon]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.dougma.com/archives/53</guid>
		<description><![CDATA[Well it has been a 10+ year wait for some and a ~6 year wait for me, but there is finally a Dragon Naturally Speaking TV Commercial! It&#8217;s just being shown on some basic Cable channels (CNN, Discovery, History, Fox ***). There are not that many spots purchased, but it&#8217;s something! I remember being told [...]]]></description>
			<content:encoded><![CDATA[<p>Well it has been a 10+ year wait for some and a ~6 year wait for me, but there is finally a <a href="http://www.nuance.com/naturallyspeaking/" target="_blank">Dragon Naturally Speaking</a> <a href="http://www.youtube.com/watch?v=2T4v5nfVKTk" target="_blank">TV Commercial</a>! It&#8217;s just being shown on some basic Cable channels (CNN, Discovery, History, Fox ***). There are not that many spots purchased, but it&#8217;s something! I remember being told back in 98 that we would have a commercial for VoiceXpress, never to see it happen. I was more than a little jaded when it was announced internally a year ago, that it was finally going to happen. It is a little surreal to see something that I have worked on (that is not Open Source) hawked on TV. Now I am am just hoping we can get a GOOD commercial for the product, I would settle for merely not bad at this point.</p>
<p><object width="425" height="355"><param name="movie" value="http://www.youtube.com/v/2T4v5nfVKTk&#038;rel=1"></param><param name="wmode" value="transparent"></param><embed src="http://www.youtube.com/v/2T4v5nfVKTk&#038;rel=1" type="application/x-shockwave-flash" wmode="transparent" width="425" height="355"></embed></object></p>
<p>My god, I wish I had something good to say about this piece of advertisement. Lets decompose.</p>
<p><strong> 1. The product</strong></p>
<p>They somehow took a product that is fun to watch people use, and has a very cool, minimal, embedded interface, and made it a chore to watch. Worse yet, they misrepresented it&#8217;s capabilities.  It works better, faster, and easier than is portrayed. It is a continuous speech recognizer, why is that kid talking discreetly (one word at a time, and slowly?). It looked like he was using Kurzweil Voice or Dragon Dictate from 1995! Even the executive is talking as if he is reading a cue card (which is most likely the truth of it). This software is used by stenographers in court rooms. The fastest speaker can be recognized. I thought the most precious thing in a commercial was time? Why is there so much filler?</p>
<p>They never actually show the software clearly. The dragon bar is just a yellow blurry blob on the screen. Granted you can hide the bar, but the end result is, the system appears to be very slow and unresponsive. Even the cute typing text in the add is a misrepresentation. We do not send one letter at a time to the screen; that would be annoying after 2 seconds, not to mention slow. At least the womans use of the product is a little closer to reality. Oh and yes, you have full command and control over your PC ala Star Trek, but by saying &#8216;listen to me&#8217; and &#8217;stop listening&#8217; instead of &#8216;computer&#8217;; though you would never get that impression.</p>
<p><strong>2. The script</strong></p>
<p>Wow. I mean wow. When it comes to emerging trends, technology, and what the &#8216;hip&#8217; kids are doing, our upper management is fairly on top of things. They also understand the products they sell at a level I have not seen at any other company. I guess the same can not be said of our hired marketing firm (or whoever produced this thing)! The things wrong with the script (from both a technical and substantive perspective):</p>
<ul>
<li><strong>The woman writing an e-mail</strong>:  Even my Mom knows what a signature is and how to set her mail program to include it automatically. Who in their right mind would use a voice macro to do that? There are much better ways to show off macro&#8217;s. And <a href="http://www.nuance.com/naturallyspeaking/professional/demo.html" target="_blank">form filling</a> would be better for multiple signatures. That is a killer feature!</li>
<li><strong>The history report</strong>. I know adults think teenage boys are stupid and cannot write a decent report, but this poor actor must have been choking back bile when he saw this script.</li>
<li><strong>The IM interaction</strong>. This is just abysmal: &#8216;Are you in&#8217;? Every IM client from hour 0 has had the &#8216;away&#8217; feature, so yes, he is &#8216;in&#8217;, you know that already. I guess &#8216;in&#8217; is the hip new lingo all the kids are using these days.</li>
<li><strong>The executive</strong>. This is how a teenage boy pretending to be an executive talks.</li>
</ul>
<p><strong> 3. The acting</strong></p>
<p>All I can say is I really hope we got a good deal here. In some cases I think it is the material. There is not much you can do to make it palatable, so why waste the effort. It is a cut rate commercial for a few basic cable channels after all. What is interesting is we have many commercials for our vendors and resellers, and the acting there is top notch. You can find most of those on youtube, or on our <a href="http://www.nuance.com/talk/" target="_blank">website</a> (though this is a demo, not a commercial).</p>
<p><strong>4. The Direction and Cinematography</strong></p>
<p>Ok, its a commercial. The sets were believable. the lighting spot on, the sound perfect, and the basic camera work was good.  But please show the product. You never got a good look at what was going on, on the machine. The product is supposed to be the star of the commercial. Here it appears to be a character off screen being talked about.</p>
<p><strong>What would I have done?</strong></p>
<p>Well I would do a rip off of the HP &#8216;talking hands&#8217; commercials. This was actually my idea back in 1997 when I first started working with speech. Show the apps swapping around like you do in real life, only by voice. Show the real speed of the product. show the people, but not their heads. Show their feet propped up on the desk in front of them. Put the product first, and make it interesting. Have them standing there, and the transparent screen in front doing all the cool things they do in the HP commercials, only they have their hands behind their back. Show people really using the product, not selling it. I guess that is why I am an engineer and not in marketing.</p>
<p><strong>Additional Notes:</strong></p>
<p>No you do not need to speak punctuation. There is auto-punctuation, but before it is accurate the system needs to have been used for a number of hours to collect enough statistics on how you use punctuation and inflection. Having DNS inspect past e-mails and documents can help, but gathering the inflection data is the key part. For that reason it is turned off be default. More research and data will fix that.</p>
<p>No you do not need to spend hours or even minutes training the system. You can use it right out of the box, and it will get better with time. To be honest the learning curve is more on the user end, than on the software end. Talking to a computer is just like any new means of input, like cell phone SMS tapping, or pen tablet. It takes time to become comfortable with it, but not as much time as other input devices.</p>
<p>This blog post dictated with <a href="http://www.nuance.com/talk/" target="_blank">Dragon Naturally Speaking 9.5 Professional</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://dougma.com/archives/56/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>We Are Hiring!</title>
		<link>http://dougma.com/archives/51</link>
		<comments>http://dougma.com/archives/51#comments</comments>
		<pubDate>Thu, 11 Oct 2007 19:44:34 +0000</pubDate>
		<dc:creator>doug</dc:creator>
				<category><![CDATA[c++]]></category>
		<category><![CDATA[pycon]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[work]]></category>

		<guid isPermaLink="false">http://www.dougma.com/archives/48</guid>
		<description><![CDATA[I have not had much time lately to make posts, but one issue has risen to the top. At work we are hiring. This is nothing new really, the jobs listing is 5 pages long for our Burlington Ma. office alone. What is different, is that the group I work in is hiring. That would [...]]]></description>
			<content:encoded><![CDATA[<p>I have not had much time lately to make posts, but one issue has risen to the top. At <a href="http://www.nuance.com/" target="_blank">work</a> we are <a href="http://jobs-nuance.icims.com/nuance_jobs/jobs/candidate/job.jsp?jobid=2770&amp;mode=view" target="_blank">hiring</a>.</p>
<p>This is nothing new really, the jobs listing is 5 pages long for our <a href="http://maps.google.com/maps/ms?f=q&amp;hl=en&amp;geocode=&amp;time=&amp;date=&amp;ttype=&amp;ie=UTF8&amp;msa=0&amp;msid=101366992701854199693.00043c3cd08e7e46dc9be&amp;t=k&amp;z=16&amp;om=1" target="_blank">Burlington Ma. office</a> alone. What is different, is that the group I work in is hiring.  That would be the MREC (Modular Recognizer) Group, which does the development on the core speech recognition engine for a multitude of product lines, including <a href="http://www.nuance.com/naturallyspeaking/">Dragon Naturally Speaking</a>, <a href="http://www.nuance.com/voicecontrol/" target="_blank">Mobile Solutions</a>, and <a href="http://www.nuance.com/healthcare/" target="_blank">Medical Transcription Services</a> to name just three. Yes I know the &#8216;demos&#8217; suck, you can <a href="http://www.youtube.com/results?search_query=nuance+speech&amp;search=Search" target="_blank">search youtube</a> and find better examples of the products.</p>
<p>The Core MREC team is has always been a small core group of less than 10 engineers working with research and the product groups.  Speech experience is not a requirement <em>(and is actually the exception)</em> for our group. We work with research to develop new features and productize new algorithms. The work is primarily C++ on windows and linux. The code base is the cleanest one I have ever worked on (and I have worked on over three dozen in a number of fields). The code is not old either, as we are continually rewriting parts of the engine, removing obsolete features, and adding in new ones; that is where the &#8216;M&#8217; in MREC comes into play. Python experience is a big plus. The primary research framework is all python and the core engine is instrumented in python.</p>
<p>If this sounds interesting to you, send in your resume, either via the <a href="http://jobs-nuance.icims.com/nuance_jobs/jobs/candidate/login.jsp?jobid=2770" target="_blank">official nuance form</a>, or by e-mailing me (doug at this site). You can also ask questions in comments to this post.</p>
<p>This post was dictated directly into wordpress using Dragon Naturally Speaking 9.</p>
]]></content:encoded>
			<wfw:commentRss>http://dougma.com/archives/51/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
