Will Wade
About code Archive Photos Bookmarks
  • Published our py3-tts-wrapper python library finally this week. Should power a lot of funky things. Supports all the major TTS engines online and new ones offline. Use alongside this app to see what voices are available (API here)

    → 11:26 AM, Jul 31
    Also on Bluesky
  • Iterative rapid development Part 2 (aka Judith Part 2).

    So sometimes.. just sometimes.. a couple of months works pays off. Or I guess we could frame this post around successful product development - keeping the end user the focus of your development and involving them at all stages of your development cycle is key. We are proud to shout loudly about this.

    So two months ago we saw Judith. Technology has not been successful for her before. (You can read more why in my previous posts). So we rapidly developed a SwiftUI app that was unique in its operation - you dragged from one letter to the next. Trying it with Judith we still had a problem - positional errors and missing letters were high. So we then rapidly worked on a “correct-a-sentence system” and settled on using a Azure OpenAI API (GPT3.5) and then developed our own custom model alongside this (needing something that worked offline and privacy first was key (NB: The videos is this offline model - Not GPT). A few iterations of this and its correcting as good as GPT 3.5 83% of the time.

    So two months later, what does this all mean for the end user? They can now write on technolgy to communicate for the very first time. Happy - well so far yes. But long term AAC use is complex and not just down to the technology. So lets be curious and proud at the same time. There is more to do

    • Small tweaks in the UI
    • Its correcting a bit too much at times - sometimes adding more grammar changes than I would like - and equally dangerous - losing words. To fix this we could further train our custom model.

    and then more I’m sure the next time we review this cycle of the iterative development cycle.

    What does this process look like overall?

    We follow the double-diamond development cycle. It’s a pretty common approach yet following it is not always easy.

    So lessons so far?

    • Iterate Fast: Rapid prototyping and adjustments are key to finding viable solutions for complex, unique solutions. There is a much longer tail to development for solid, reliable products but getting somewhere fast helps to see whats important.
    • User-Centric Design: Keeping the user at the heart of all stages of development ensures that the technology we develop truly meets their needs. Its not always easy but it can be done. Its important to do this as part of a team though. It can so easily be lost as to whether you are getting carried away or whether you are on track. We pride ourselves on at Ace Centre being a transdisciplinary team where never one person sees the whole picture.
    • Continuous Learning: Every challenge presents a learning opportunity, pushing us to constantly improve. And heck - its good fun too seeing big gains.
    • Passion and Persistence: A keen interest in making a difference and the drive to keep pushing forward are indispensable.
    → 9:38 PM, Mar 26
    Also on Bluesky
  • Objectified. "if we understand what the extremes are, the middle will take care of itself."

    Objectified (2009, 75 minutes) is a documentary film about our complex relationship with manufactured objects and, by extension, the people who design them. What can we learn about who we are, and who we want to be, from the objects with which we surround ourselves?

    It’s a great film (and free to watch from March 14-17!) but in particular, I love this quote from Dan Formosa, Design & Research, Smart Design, New York (around 6 minutes in)

    But really our common interest is in understanding people and what their needs are. So if you start to think, well really, what these guys do as consultants is focus on people, then it’s easy to think about what’s needed design wise in the kitchen, or in the hospital, or in the car.

    We have clients coming to us and saying here’s our average customer. For instance, she’s female, she’s 34 years old, she has 2.3 kids.

    And we listen politely and say, well that’s great, but we don’t care about that person. What we really need to do to design is look at the extremes, the weakest, or the person with arthritis, or the athlete, or the strongest, or the fastest person. Because if we understand what the extremes are, the middle will take care of itself.

    View it online at www.ohyouprettythings.com/free

    → 8:35 AM, Mar 15
    Also on Bluesky
  • I’ve been considering LLMs' potential in autocorrecting AAC input. They offer significant gains, but user-guided inference still excels. Yvette’s video demonstrates this: 32 WPM using a glide pad in Dasher. Anyone faster?! We must get a new version developed.

    youtu.be/va1WufK86…

    → 8:54 PM, Feb 20
    Also on Bluesky
  • Rapid development in swiftUI for niche problems

    Some clients we see are fantastic with paper-based solutions. But sometimes, finding powered AAC systems which give them more independence is far trickier than you may think. Consider Judith. She doesn’t lift her finger from the paper. This continuous movement is surprisingly not well supported in AAC. Your obvious thoughts are SwiftKey and Swype, but they require a lift-up at the end of a word or somewhere else. Next up, you may try a Keguard or TouchGuide. But then, for some users, this is too much of a change in user interaction. Even if you succeed, you often ask an end user to change the orientation or layout of their paper-based system.. and all in all, it’s just too much change. Abandonment is likely. The paper-based system is just more reliable.

    So what do we do? We could look at a bespoke system. But typically, it requires much thought, effort and scoping. That’s still needed, but you can draft something up far quicker these days using ChatGPT. It wrote the whole app after a 2-hour stint of prompt writing. Thats awesome. (Thanks also to Gavin, who tidied up the loose ends). So, this app can be operated by detecting a change in angular direction or detecting a small dwell on each letter. We need to now trial this with our end user and see what’s more likely to work and what’s not and work this up. We may need a way of writing without going to space (something that we see quite a lot), and I can see us implementing a really needed feature, autocorrect. This is all achievable. But for now, we have a working solution to trial a 500 lines of code app made in less than a day’s work.

    → 11:04 PM, Jan 12
    Also on Bluesky
  • Stephen Hawking's AAC setup in closeup

    At MOSI in Manchester today, I saw Stephen Hawking’s Chair and other neat things from his office in Cambridge. Note the spaghetti of cables. It’s tricky to figure out where all the leads go, but I’ll give it a wild guess. The plugs look like either mini XLR or the old PS2 Serial leads. Some questions, though; I’m unsure what the “Filter” box fits to and why is the Words+ box even used? I thought the connection with Intel meant he was using ACAT. Why is that Words+ Softkey box the parallel version when there is clearly a lot of USB kicking about, too? Why are we plugging into something behind the chair when surely the tablet has the speakers anyway? There are as many questions than answers.

    • Words+ Archive page (This is the USB version of the softkey box)
    • Case for Original Synthesiser made by David Mason at Cambridge Adaptive
    • Chair details
    → 9:03 PM, Nov 3
    Also on Bluesky
  • 🚀 Calling All AAC Testers for a new release of our Google Cloud/Azure TTS and Translation tool

    We’ve given our little Translate and Speak app for Windows a complete makeover. Our app not only translates text but also vocalizes messages from the message window using online services. We’ve introduced a user-friendly GUI to simplify configuration, extended support to include paid translation services, and here’s the grand reveal… you can now empower any Windows AAC app to leverage Google Cloud TTS or Azure TTS which massively opens up the possibility of using AAC with more languages. You can even use these services without translation - so just to speak.

    Get your hands on the early version here. BUT - just a heads up, you will need to be comfortable obtaining keys for Azure or Google Cloud. Check out our (somewhat outdated) docs for guidance. And ping me some feedback before we release it properly. Prizes for someone who can make me a nice demo video!

    → 8:48 AM, Sep 2
  • The new voice creation tool in iOS and live speech (in-built tts app from any screen) in iOS. My voice is definitely.. clunky .. but bear in mind I recorded this at like 2am in a AirBnB and didn’t want to wake the neighbours. 15 mins recording.

    → 9:58 PM, Jul 15
  • Need an AAC/AT textbook but nowhere near a library or have money? This is awesome from Internet Archive. Loan a textbook for free for an hour at a time. archive.org/details/i… (and my personal fav : archive.org/details/a…)

    → 4:25 PM, Jul 15
  • Knocked this up quickly. Type in one language. Translate within your AAC app, speak it out loud in that language and paste it back. Pretty configurable. Lag is due to using Google TTS but will work with offline TTS systems - i.e SAPI, coqui & eSpeak. code

    Quick demo video here

    → 12:27 AM, Jul 5
  • From our fab OT student today

    So are you telling me my phone can autocorrect but that’s not done on AAC Devices .. and the user always has to select their predictions?.

    Hmmm. I wonder if I was missing something (and not from the world of research like Keith Vertanen’s demos.. anyone?

    So what’s the difference between autocorrect and prediction? Prediction software has been around for years. In essence, the software displays predictions, and in some way, you have to select the predicted word or sentence. In some software the selection technique is reduced (e.g. in Microsoft 365 products now a swipe to the right allows you to select the suggested word/phrase). But you still have to actively look and seek it. More recently, autocorrection software has started to appear. If you ask me, it makes a lot of sense (some suggest it’s a terrible idea for learning language.. but for dyslexia support it looks amazing). You reduce the visual search aspect and just type. Any mistakes or typos it tries to correct. It’s not for everyone - but in AAC, it seems like a great idea. Focus on what you can get out and let the thing correct itself.

    → 5:55 PM, Jun 21
  • I am reviewing the “traffic light” system for AAC by Karen Erickson for our ATU this week. It’s similar to ideas in an Activity & Occupation Analysis - but much more reduced and focused on AAC activities in a day. Redrawn here - maybe too reduced from the original concept. Use at your peril.

    → 5:40 PM, Jun 21
  • A fascinating couple of papers that Simon Judge notes in his blog about the design and abandonment of AAC systems

    “the role of communication aids in an individuals’ communication is subtle and not a simple binary ‘used or not used’”.

    What I find really neat is Zoë’s paper and the creation of a model

    “This model consists of a communication loop – where experiences of prior communication attempts feed into decisions about whether to communicate and what method to use to communicate – each of which were influenced by considerations of the importance of the message, the time taken, who the communication partner was, the environment the communication is taking place in (physical and social) and the personal context and preferences of the individual

    The “choice” of when and how much to use an AAC device is down to the user. We shouldn’t see this as abandonment.

    → 10:04 AM, Jun 20
  • Just watched Edward Changs talk about their BRAVO project at #bci2023. It’s next level. They are not the only team doing this (e.g. Frank Willets) , but they are one of the few making significant improvement over current AAC solutions.. (even if its for n=2). The video of Ann writing by thought alone at this rate. wow.

    This was pre-publication. Watch their page for updates changlab.ucsf.edu/overview

    → 2:25 PM, Jun 7
  • Silent Speech and Sub-Vocal Communication. The next big technology breakthrough for AAC?

    Silent Speech & Sub-Vocal research is picking up. EMG can detect speech since the 70s but It’s been hard to make it useful. Now though? There are even instructions for making your own . Check out some papers and AlterEgo from MIT for a fancy demo. It’s AI aka “Applied Statistics” making this possible - and I feel that it’s this aiding access that will be the biggest impact on our field than areas of language.

    → 12:02 AM, Jun 6
  • Sebastian Pape has been doing a ton of work on the original Dasher code base for his research on Dasher in VR. It’s pretty awesome. Some of the output can be seen here (and watch the video) - you can also watch a 3D initial from our meeting here. dasher.acecentre.net

    → 5:09 PM, May 25
  • Over the next few weeks I’m fortunate to be representing Ace Centre at two international conferences; BCI meeting and ISAAC talking about our audit on text entry rate in AAC and a lot about Dasher. Hope to see you there if you are going too!

    → 3:05 PM, May 25
  • Last week we released TextAloud on the AppStore. You can read our blog for the entire details as to what it’s all about and why but in brief, it’s v1 of a more extensive app we want to create to support people better in long streams of TTS. We have several ideas for this - but most importantly, we are putting users at the heart of the design process along all stages (using the double diamond approach). Get in touch if you want to be part of the focus group. One idea, though, is using SSML to help markup a speech. You can see one implementation idea below.

    There’s a much longer post due from me about why SSML hasn’t been used in AAC, but in short - the time is overdue.

    → 7:00 AM, May 21
  • RSS
  • JSON Feed