Many have rightfully praised the movie Her for its foresight in predicting the emotional and societal impact of artificial intelligence. But when it comes to AI in the workplace, it’s Iron Man’s J.A.R.V.I.S.- the AI that seamlessly manages Tony Stark’s schedule, home, lab, and Iron Man suit – that feels more aligned with what we’re seeing emerge today. It’s less about companionship and more about an intelligent, always-on co-pilot for getting things done.
Within the startup ecosystem, there are glimpses of what the first versions of J.A.R.V.I.S. might look like. A few companies have started building personal AI assistants or AI-augmented virtual assistants that can operate with high fidelity. We believe this is the start of building highly personalized AI assistants that can predict and complete tasks on behalf of individuals. The world’s best executive assistant, available to everyone. A few key trends are buttressing our excitement in the space:
We have identified two distinct ways in which companies are building towards this J.A.R.V.I.S. future:
We aren’t yet sure which path is better, but we’re excited about both. A few of our key questions as we continue to dive deeper into this opportunity are:
With LLMs being able to reasonably mimic the diction of an average person and task-executing agents seemingly right on the horizon, AI assistants are gaining popularity. As more of our internal communications and corporate data are exposed via streamlined APIs, these assistants can be prompted to access key data to perform (at first) basic tasks on behalf of users.
There are two main strategies we have seen to date. First, there are AI-native assistants that start with basic tasks, such as scheduling meetings. These products are largely activated via email prompts and are, for the time being, more limited in scope in what they can do for users. Second, there are virtual assistants who leverage AI tools to perform a greater range of tasks more quickly. While both strategies start from different places, we believe they share similar goals: to gather idiosyncratic data on users’ preferences to ultimately offer them a highly tailored AI assistant that can act as an extension of the user themselves.
The two sets of strategies also target different demographics. People who are accustomed to interacting with assistants and want white-glove service might opt for the “safer” human-in-the-loop option (Athena charges thousands of dollars a month). Those who have historically not had access to an assistant or are more price-sensitive might opt for the fully automated offerings (Skej charges $10 a month).
Ultimately, we think that these businesses can be powerful conduits for the maturation of agent technology. Tool connectivity will shift the power towards companies who directly own the customer relationship, something that stands to benefit Task-Executing Assistants. As consumers become accustomed to AI assistants completing tasks that are feasible with today’s architecture, these assistants can eventually “earn the right” to handle more extensive tasks in the future, such as booking travel, ordering gifts, or other ancillary tasks that people may want to offload.
Advancements in transcription and voice AI have given rise to a new class of tools that process audio data from meetings, calls, and conversations. These tools primarily focus on summarization and transcription, making spoken content more accessible and searchable.
Companies like Gladia are developing infrastructure to support these capabilities, offering real-time, multilingual transcription services for various applications. Similarly, Hume AI is exploring the integration of emotional intelligence into voice interfaces, aiming to make interactions with AI more natural and personalized.
In the realm of verticalized solutions, Winn.AI provides a sales-focused assistant that automates CRM updates and offers real-time guidance during meetings. On the other hand, Amie emphasizes design and user experience, integrating calendars and to-do lists into a seamless interface.
While these developments are promising, it’s important to note that the field is still in its early stages. The majority of tools have not yet moved beyond enhanced (and pretty awesome) summarization and transcription. The potential for richer, context-aware systems exists, but realizing this vision will require further innovation and user adoption.
We’re incredibly excited about what the future of AI-enabled assistants has in store. The products that exist today are the worst they’ll ever be, and yet are already incredibly powerful. If you are building anything that intersects with the theses we outlined above, please reach out! Molly, Nick, Naseem